LLM Index

Supported providers and models

ProviderModelStatusDistributionReleaseDeprecationParametersContext lengthMax tokensPrice in/out [$/1Mt]Links
AI21
Jamba 1.5 Large
Aug 22, 2024May 06, 2025256,0004,096$2 / $8
Jamba 1.5 Mini
Aug 22, 2024May 06, 2025256,0004,096$0.2 / $0.4
Jamba 1.6 Large
Mar 06, 2025Mar 08, 2025256,0004,096$2 / $8
Jamba 1.6 Mini
Mar 06, 2025Mar 08, 2025256,0004,096$0.2 / $0.4
Jamba 1.7 Large
Jul 03, 2025256,0004,096$2 / $8
Jamba 1.7 Mini
Jul 03, 2025256,0004,096$0.2 / $0.4
Jurassic 2 Light
Mar 09, 20238,1928,192$0.1 / $0.5
Jurassic 2 Mid
Mar 09, 20238,1928,192$0.25 / $1.25
Jurassic 2 Ultra
Mar 09, 20238,1928,192$2 / $10
Aleph Alpha
Luminous Base
Apr 14, 20232,0482,048$30
Luminous Base Control
Apr 14, 20232,0482,048$37.5
Luminous Extended
Apr 14, 20232,0482,048$45
Luminous Extended Control
Apr 14, 20232,0482,048$56.25
Luminous Supreme
Apr 14, 20232,0482,048$175
Luminous Supreme Control
Apr 14, 20232,0482,048$218.75
Anthropic
Claude 2
Jul 11, 2023100,0004,096$8 / $24
Claude 2.1
Nov 21, 2023200,0004,096$8 / $24
Claude 3 Haiku
Mar 15, 2024200,0004,096$0.25 / $1.25
Claude 3 Opus
Mar 04, 2024Jan 30, 2025200,0004,096$15 / $75
Claude 3 Sonnet
Mar 04, 2024Jul 21, 2025200,0004,096$3 / $15
Claude 3.5 Haiku
2024/10/22
Oct 22, 2024200,0008,192$0.8 / $4
Claude 3.5 Sonnet
2024/06/20
Jun 20, 2024Oct 22, 2025200,0008,192$3 / $15
Claude 3.5 Sonnet
2024/10/22
Oct 22, 2024Oct 22, 2025200,0008,192$3 / $15
Claude 3.7 Sonnet
2025/02/19
Feb 24, 2025200,0008,192$3 / $15
Claude 4 Opus
2025/05/14
May 22, 2025200,00064,000$15 / $75
Claude 4 Sonnet
2025/05/14
May 22, 2025200,00064,000$3 / $15
Claude 4.1 Opus
2025/08/05
Aug 05, 2025200,00032,000$15 / $75
Claude 4.5 Haiku
2025/10/01
Oct 15, 2025200,00064,000$1 / $5
Claude 4.5 Opus
2025/11/01
Nov 25, 2025200,00064,000$5 / $25
Claude 4.5 Sonnet
2025/09/29
Sep 29, 2025200,00064,000$3 / $15
Claude Instant 1
Mar 14, 2023100,0002,048$1.63 / $5.51
Claude Instant 1.2
Sep 21, 2023100,0002,048$1.63 / $5.51
Cohere
Aya Expanse 32B
Oct 24, 2024
32B
128,0004,096$0
Aya Expanse 8B
Oct 24, 2024
8B
8,1924,096$0
Command
Feb 07, 2024Sep 15, 20254,0964,096$15
Command A
03/2025
Mar 13, 2025256,0008,000$2.5 / $10
Command A Reasoning
08/2025
Aug 21, 2025256,0008,000$2.5 / $10
Command Light
Feb 07, 2024Sep 15, 20254,0964,096$15
Command Nightly
Feb 07, 2024Sep 15, 20254,0964,096$15
Command R
08/2024
Aug 30, 2024128,0004,096$0.15 / $0.6
Command R 7b
12/2024
Dec 13, 2024
7B
128,0004,096$0.04 / $0.15
Command R+
08/2024
Aug 30, 2024128,0004,096$2.5 / $10
DeepInfra
Code Llama 34B
Instruct HF
Aug 24, 2023
34B
16,3844,096$0.6
DeepSeek R1
Jan 20, 2025
671B
128,0008,192$0.55 / $2.19
DeepSeek R1 Turbo
Mar 26, 2025
671B
128,0008,192$1 / $3
DeepSeek V3
Dec 26, 2024
671B
128,0004,096$0.85 / $0.9
DeepSeek V3.1
Aug 21, 2025
671B
163,840163,840$0.3 / $1
DeepSeek V3.2 Exp
Sep 29, 2025
671B
163,840163,840$0.27 / $0.4
Gemma 2 27B
Jun 27, 2024
27B
8,1928,192$0.27
Gemma 2 9B
Jun 27, 2024
9B
8,1928,192$0.06
Gemma 3 12B
Mar 10, 2025
12B
131,0728,192$0.05 / $0.1
Gemma 3 27B
Mar 10, 2025
27B
131,0728,192$0.1 / $0.2
Gemma 3 4B
Mar 10, 2025
4B
131,0728,192$0.02 / $0.04
Kimi K2 Instruct
Jul 11, 2025
32B
131,07216,384$0.55 / $2.2
Kimi K2 Thinking
Nov 11, 2025
32B
262,144262,144$0.55 / $2.5
Llama 2 13B
Chat HF
Jul 18, 2023
13B
4,0964,096$0.35
Llama 2 70B
Chat HF
Jul 18, 2023
70B
4,0964,096$1.88
Llama 2 7B
Chat HF
Jul 18, 2023
7B
4,0964,096$0.2
Llama 3.3 70B
Instruct
Dec 06, 2024
70B
128,0008,192$0.23 / $0.4
Llama 3.3 70B Turbo
Instruct
Dec 06, 2024
70B
128,0008,192$0.12 / $0.3
Llama 4 Maverick 17B 128e
Instruct FP8
Apr 05, 2025
17B
131,0728,192$0.2 / $0.6
Llama 4 Scout 17B 16e
Instruct
Apr 05, 2025
17B
131,0728,192$0.1 / $0.3
Meta Llama 3.1 405B
Instruct
Jul 23, 2024
405B
128,0008,192$1.79
Meta Llama 3.1 70B
Instruct
Jul 23, 2024
70B
128,0008,192$0.35 / $0.4
Meta Llama 3.1 8B
Instruct
Jul 23, 2024
8B
128,0008,192$0.06
Meta Llama 3.2 1B
Instruct
Sep 25, 2024
1B
128,0008,192$0.01 / $0.02
Meta Llama 3.2 3B
Instruct
Sep 25, 2024
3B
128,0008,192$0.03 / $0.05
MiniMax M2
Nov 11, 2025
10B
262,144262,144$0.27 / $1.15
Mistral 7B
Instruct v0.1
Sep 27, 2023
7B
32,7688,192$0.2
Mistral Nemo
Instruct 24/07
Jul 17, 2024
12B
131,0728,192$0.13
Mixtral 8x22B
Instruct v0.1
Sep 19, 2024
22B
65,5368,192$0.65
Mixtral 8x7B
Instruct v0.1
Sep 19, 2024
56B
32,7688,192$0.24
OpenAI GPT-OSS 120B
Aug 05, 2025
117B
131,07232,766$0.09 / $0.45
OpenAI GPT-OSS 20B
Aug 05, 2025
21B
131,07232,766$0.04 / $0.16
Phi 4
Dec 12, 2024
14B
16,3848,192$0.07 / $0.14
Qwen 2 72B
Instruct
Jul 23, 2024
72B
128,0008,192$0.35 / $0.4
Qwen 2.5 72B
Instruct
Sep 19, 2024
72B
131,0728,192$0.23 / $0.4
Qwen 3 14B
Apr 29, 2025
14B
128,0008,192$0.08 / $0.24
Qwen 3 30B A3B
Apr 29, 2025
30B
128,0008,192$0.1 / $0.3
Qwen 3 32B
Apr 29, 2025
32B
128,0008,192$0.1 / $0.3
Qwen3 235B A22B
Apr 29, 2025
325B
128,0008,192$0.2 / $0.6
WizardLM 2 7B
Apr 16, 2024
7B
32,7688,192$0.06
WizardLM 2 8x22B
Apr 16, 2024
176B
65,5368,192$0.5
DeepMind
Chat Bison
May 10, 20238,1961,024$0.5
Code Bison
May 10, 20238,1961,024$0.5
Gemini 1.0 Pro
Dec 13, 202332,7688,192$0.5 / $1.5
Gemini 1.5 Flash
exp 08/27
Aug 27, 20241,048,5768,192$0.08 / $0.3
Gemini 1.5 Flash
May 14, 2024Sep 24, 20251,048,5768,192$0.08 / $0.3
Gemini 1.5 Flash 8B
Oct 03, 2024Sep 24, 2025
8B
1,048,5768,192$0.08 / $0.3
Gemini 1.5 Flash 8B
exp 08/27
Aug 27, 2024
8B
1,048,5768,192$0.04 / $0.15
Gemini 1.5 Pro
exp 08/27
Aug 27, 20242,097,1528,192$1.25 / $5
Gemini 1.5 Pro
exp 08/01
Aug 27, 20242,097,1528,192$1.25 / $5
Gemini 1.5 Pro
Feb 15, 20242,097,1528,192$1.25 / $5
Gemini 1.5 Pro
Feb 15, 2024May 24, 20252,097,1528,192$1.25 / $5
Gemini 2.0 Flash
Dec 11, 20241,048,5768,192$0.1 / $0.4
Gemini 2.0 Flash
exp
Dec 11, 20241,048,5768,192$0
Gemini 2.0 Flash Lite
Feb 05, 20251,048,5768,192$0.08 / $0.3
Gemini 2.0 Flash Thinking
exp 12/19
Dec 19, 20241,048,5768,192$0
Gemini 2.0 Pro
exp 02/05
Feb 05, 20251,048,5768,192$0
Gemini 2.5 Flash
preview (05/20)
May 20, 20251,048,57665,536$0.02 / $0.6
Gemini 2.5 Flash
May 20, 20251,048,57665,536$0.3 / $2.5
Gemini 2.5 Flash Lite
Jul 22, 20251,048,57665,536$0.1 / $0.4
Gemini 2.5 Pro
preview (05/06)
May 06, 20251,048,57665,536$1.25 / $10
Gemini 2.5 Pro
May 06, 20251,048,57665,536$1.25 / $10
Gemini 2.5 Pro
preview (03/25)
Mar 25, 20251,048,57665,536$1.25 / $10
Gemini 3 Pro
preview
Nov 18, 20251,048,57665,536$2 / $12
Text Bison
May 10, 20238,1921,024$0.5
DeepSeek
Chat
v3.2
Sep 29, 2025
671B
128,0008,000$0.28 / $0.42
Reasoner
v3.2
Sep 29, 2025
671B
128,00064,000$0.28 / $0.42
FetchAI
ASI:One extended
Apr 22, 202564,00064,000$0
ASI:One fast
Apr 30, 202564,00064,000$0
ASI:One mini
Feb 25, 2025128,000128,000$0
Groq
Alibaba Qwen 2.5 32B
Sep 19, 2024
32B
128,0008,000$0.79
Alibaba Qwen 3 32B
Apr 29, 2025
32B
131,07240,960$0.29 / $0.59
Alibaba Qwen QwQ 32B
Mar 05, 2025
32B
131,0728,000$0.29 / $0.39
Compound
Apr 15, 2025131,0728,192$0
Compound Mini
Apr 15, 2025131,0728,192$0
DeepSeek R1 Distill Llama 70B
Jan 20, 2025Oct 02, 2025
70B
128,0001,024$0.75 / $0.99
DeepSeek R1 Distill Qwen 32B
Jan 20, 2025Apr 14, 2025
671B
128,00016,384$0.69
Google Gemma 2 9B
Jun 27, 2024Oct 08, 2025
9B
8,1928,192$0.2
Google Gemma 7B
Jan 15, 2024
7B
8,1928,192$0
Meta Llama 2 70B
Jan 15, 2024
70B
4,0964,096$0
Meta Llama 3 70B
Apr 18, 2024
70B
128,0002,048$0.59 / $0.79
Meta Llama 3 8B
Apr 18, 2024
8B
128,0002,048$0.05 / $0.08
Meta Llama 3.1 405B
preview
Jul 23, 2024
405B
128,0002,048$0.59 / $0.79
Meta Llama 3.1 70B
preview
Jul 23, 2024
70B
128,0002,048$0.59 / $0.79
Meta Llama 3.1 8B
Jul 23, 2024
8B
128,0002,048$0.05 / $0.08
Meta Llama 3.2 1B
preview
Sep 25, 2024
1B
128,0002,048$0.04
Meta Llama 3.2 3B
preview
Sep 25, 2024
3B
128,0002,048$0.06
Meta Llama 3.3 70B
Dec 06, 2024
70B
128,0002,048$0.59 / $0.79
Meta Llama 4 Maverick 17B 128e
Apr 05, 2025
17B
131,0728,192$0.2 / $0.6
Meta Llama 4 Scout 17B 16e
Apr 05, 2025
17B
131,0728,192$0.11 / $0.34
Mistral Saba 24B
Feb 17, 2025Jul 30, 2025
24B
32,76832,768$0.79
Mixtral 8x7B
Jan 15, 2024
7B
32,7688,192$0.24
Moonshot AI Kimi K2
09/05
Sep 05, 2025
32B
262,14416,384$1 / $3
Moonshot AI Kimi K2 Instruct
Jul 11, 2025Oct 10, 2025
32B
131,07216,384$1 / $3
OpenAI GPT-OSS 120B
Aug 05, 2025
117B
131,07232,766$0.15 / $0.6
OpenAI GPT-OSS 20B
Aug 05, 2025
21B
131,07232,766$0.08 / $0.3
Mistral
Codestral
25.08
Jul 30, 2025128,0008,192$0.3 / $0.9
Magistral Medium
25.06
Jun 10, 2025128,000128,000$2 / $5
Magistral Small
25.06
Jun 10, 2025128,000128,000$0.5 / $1.5
Ministral 3B
latest
Oct 16, 2024
3B
131,0008,192$0.04
Ministral 8B
latest
Oct 16, 2024
8B
131,0008,192$0.1
Mistral Large
24.02
Feb 26, 2024Jun 16, 202532,0008,192$4 / $12
Mistral Large 2.1
24.11
Nov 18, 202412,8004,096$2 / $6
Mistral Large 3
25.12
Dec 02, 2025256,0008,192$0.5 / $1.5
Mistral Medium
latest
May 07, 2025128,0008,192$0.4 / $2
Mistral Medium
23.12
Dec 11, 2023Jun 16, 202532,0008,192$2.7 / $8.1
Mistral Medium 3
25.05
May 07, 2025128,0008,192$0.4 / $2
Mistral Medium 3.1
25.08
Aug 12, 2025128,0008,192$0.4 / $2
Mistral Small
Mixtral 8x7B
Dec 11, 2023
56B
32,0008,192$0.7
Mistral Small
latest
Dec 11, 202332,0008,192$1 / $3
Mistral Small
24.02
Nov 30, 2024Jun 16, 2025
24B
32,0008,192$1 / $3
Mistral Small 3.2
25.06
Jun 10, 2025
24B
128,0008,192$1 / $3
Mistral Tiny
Mistral 7B
Dec 11, 2023Mar 30, 2025
7B
32,0008,192$0.25
Nemo
24.07
Jul 18, 2024
12B
131,0008,192$0.3
Open Mixtral 8x22B
Apr 17, 2024Mar 30, 2025
176B
64,0008,192$2 / $6
Saba
25.02
Feb 17, 2025Sep 30, 2025
24B
32,0008,192$0.2 / $0.6
Moonshot AI
Kimi K2
07/11 preview
Jul 11, 2025
32B
131,072131,072$0.6 / $2.5
Kimi K2
09/05 preview
Sep 05, 2025
32B
262,144262,144$0.6 / $2.5
Kimi K2 thinking
Nov 06, 2025
32B
262,144262,144$0.6 / $2.5
Kimi K2 thinking turbo
Nov 06, 2025
32B
262,144262,144$1.15 / $8
Kimi K2 turbo
preview
Aug 03, 2025
32B
262,144262,144$1.15 / $8
NLP Cloud
Chat Dolphin
65,5368,192$0.5
Dolphin
65,5368,192$0.5
OpenAI
Ada 001
Jun 01, 2020Jan 04, 20242,0482,048$0.4
Babbage 001
Jun 01, 2020Jan 04, 20242,0482,048$0.5
Babbage 002
Jan 04, 2024Jan 27, 202516,38416,384$1.6
DaVinci 002
Jan 04, 2024Jan 27, 202516,38416,384$12
DaVinci 003
Nov 28, 2022Jan 04, 20244,0964,096$20
GPT-3.5 Turbo
11/06
Nov 06, 202316,3854,096$1 / $2
GPT-3.5 Turbo
01/25
Jan 25, 202416,3854,096$0.5 / $1.5
GPT-3.5 Turbo
06/13
Sep 13, 202416,3854,096$1.5 / $2
GPT-3.5 Turbo
Instruct
Nov 30, 202216,3854,096$1.5 / $2
GPT-3.5 Turbo
03/01
Jun 13, 202416,3854,096$1.5 / $2
GPT-3.5 Turbo
Nov 30, 202216,3854,096$1.5 / $2
GPT-3.5 Turbo 16k
03/01
Jun 13, 202416,3854,096$3 / $4
GPT-3.5 Turbo 16k
Nov 30, 202216,3854,096$3 / $4
GPT-3.5 Turbo 16k
06/13
Jun 13, 202416,3854,096$3 / $4
GPT-4
06/13
Jul 19, 20238,1924,096$30 / $60
GPT-4
Mar 14, 20238,1924,096$30 / $60
GPT-4
03/14
Jun 13, 20248,1924,096$30 / $60
GPT-4 0125 Turbo
Jan 25, 2024128,0004,096$10 / $30
GPT-4 32k
06/13
Jun 06, 202532,7684,096$60 / $120
GPT-4 32k
03/14
Jun 06, 202532,7684,096$60 / $120
GPT-4 32k
Mar 14, 2023Jun 06, 202532,7684,096$60 / $120
GPT-4 Turbo
2024/04/09
Apr 09, 2024128,0004,096$10 / $30
GPT-4 Turbo
Jan 25, 2024128,0004,096$10 / $30
GPT-4 Turbo
11/06
Nov 06, 2023128,0004,096$10 / $30
GPT-4.1
Apr 14, 20251,047,57632,768$2 / $8
GPT-4.1
2025/04/14
Apr 14, 20251,047,57632,768$2 / $8
GPT-4.1 mini
2025/04/14
Apr 14, 20251,047,57632,768$0.4 / $1.6
GPT-4.1 mini
Apr 14, 20251,047,57632,768$0.4 / $1.6
GPT-4.1 nano
Apr 14, 20251,047,57632,768$0.1 / $0.4
GPT-4.1 nano
2025/04/14
Apr 14, 20251,047,57632,768$0.1 / $0.4
GPT-4.5
preview
Feb 27, 2025Jul 14, 2025128,00016,384$75 / $150
GPT-4o
2024/08/06
Aug 06, 2024128,0004,096$5 / $15
GPT-4o
May 13, 2024128,0004,096$5 / $15
GPT-4o
2024/11/20
Nov 20, 2024128,0004,096$5 / $15
GPT-4o
2024/05/13
May 13, 2024128,0004,096$5 / $15
GPT-4o Search
preview
Jul 19, 2025128,00016,384$2.5 / $10
GPT-4o mini
Jul 18, 2024128,0004,096$0.15 / $0.6
GPT-4o mini
2024/07/18
Jul 18, 2024128,0004,096$0.15 / $0.6
GPT-4o mini Search
preview
Mar 11, 2025128,00016,384$0.15 / $0.6
GPT-5
2025/08/07
Aug 07, 2025400,000128,000$1.25 / $10
GPT-5 mini
2025/08/07
Aug 07, 2025400,000128,000$0.25 / $2
GPT-5 nano
2025/08/07
Aug 07, 2025400,000128,000$0.05 / $0.4
GPT-5 pro
2025/10/06
Oct 06, 2025400,000128,000$15 / $120
GPT-5.1
2025/11/13
Nov 13, 2025400,000128,000$1.25 / $10
o1
2024/12/17
Sep 12, 2024200,000100,000$15 / $60
o1
Sep 12, 2024200,000100,000$15 / $60
o1 mini
2024/09/12
Sep 12, 2024128,00065,536$3 / $12
o1 mini
Sep 12, 2024128,00065,536$3 / $12
o1 preview
2024/09/12
Sep 12, 2024128,00032,768$15 / $60
o1 preview
Sep 12, 2024128,00032,768$15 / $60
o1-pro
Mar 19, 2025200,000100,000$150 / $600
o3
Apr 16, 2025200,000100,000$10 / $40
o3
2025/04/16
Apr 16, 2025200,000100,000$10 / $40
o3 mini
Jan 31, 2025200,000100,000$1.1 / $4.4
o3 mini
2025/01/31
Jan 31, 2025200,000100,000$1.1 / $4.4
o3 pro
Jun 10, 2025200,000100,000$20 / $80
o4 mini
2025/04/16
Apr 16, 2025200,000100,000$1.1 / $4.4
o4 mini
Apr 16, 2025200,000100,000$1.1 / $4.4
o4 mini deep research
2025/06/26
Jun 26, 2025200,000100,000$2 / $8
OpenRouter
Claude 4.5 Sonnet
Sep 29, 20251,000,0001,000,000$3 / $15
DeepSeek V3.1
Aug 21, 2025
671B
163,840163,840$0.2 / $0.8
DeepSeek V3.2 Exp
Sep 29, 2025
671B
163,840163,840$0.27 / $0.4
Ernie 4.5 300B A47B
Jun 30, 2025
300B
123,000123,000$0.28 / $1.1
Gemini 3 Pro
preview
Nov 18, 20251,048,5761,048,576$2 / $12
Grok 4.1 Fast
Nov 19, 20252,000,00030,000$0
Hunyuan A13B
Jun 27, 2025
80B
32,7684,096$0.03
Moonshot AI Kimi K2
Jul 11, 2025
32B
131,07216,384$0.14 / $2.49
OpenAI GPT-OSS 120B
Aug 05, 2025
117B
131,07232,766$0.09 / $0.45
OpenAI GPT-OSS 20B
Aug 05, 2025
21B
131,07232,766$0.04 / $0.16
Perplexity
Code Llama 34B
Instruct
Oct 04, 2023
34B
16,3844,096$0.35 / $1.4
Code Llama 70B
Instruct
Oct 04, 2023
70B
16,3844,096$0.7 / $2.8
Llama 2 70B
Oct 04, 2023
70B
4,0964,096$0.7 / $2.8
Llama 3 70B
Instruct
May 14, 2024Aug 12, 2024
70B
8,1928,192$0 / $1
Llama 3 8B
Instruct
May 14, 2024Aug 12, 2024
8B
8,1928,192$0 / $0.2
Llama 3 Sonar large 32k
Online
May 14, 2024Aug 12, 2024127,0724,096$0 / $1
Llama 3 Sonar large 32k
Chat
May 14, 2024Aug 12, 2024127,0724,096$0 / $1
Llama 3 Sonar small 32k
Online
May 14, 2024Aug 12, 2024127,0724,096$0 / $0.2
Llama 3 Sonar small 32k
Chat
May 14, 2024Aug 12, 2024127,0724,096$0 / $0.2
Llama 3.1 Sonar huge 128k
Online
Aug 14, 2024Feb 22, 2025127,0724,096$0 / $5
Llama 3.1 Sonar large 128k
Online
Jul 31, 2024Feb 22, 2025127,0724,096$0 / $1
Llama 3.1 Sonar large 128k
Chat
Jul 31, 2024Feb 22, 2025127,0724,096$0 / $1
Llama 3.1 Sonar small 128k
Online
Jul 31, 2024Feb 22, 2025127,0724,096$0 / $0.2
Llama 3.1 Sonar small 128k
Chat
Jul 31, 2024Feb 22, 2025127,0724,096$0 / $0.2
Mistral 7B
Oct 04, 2023Aug 12, 2024
7B
32,7688,192$0.07 / $0.28
Mixtral 8x7B
Instruct
Oct 04, 2023Aug 12, 2024
7B
32,7688,192$0 / $0.6
R1 1776
Feb 18, 2025Aug 01, 2025
671B
128,0008,192$2 / $8
Sonar
Jan 25, 2025128,0008,196$1
Sonar Deep Research
Feb 14, 2025200,0008,196$2 / $8
Sonar Pro
Jan 25, 2025200,0008,196$3 / $15
Sonar Reasoning
Jan 30, 2025Dec 15, 2025128,0008,196$1 / $5
Sonar Reasoning Pro
Jan 30, 2025200,0008,196$2 / $8
Sonar medium
Chat
Feb 24, 2024127,0724,096$0.6 / $1.8
Sonar medium
Online
Feb 24, 2024127,0724,096$0 / $1.8
Sonar small
Online
Feb 24, 2024127,0724,096$0 / $0.28
Sonar small
Chat
Feb 24, 2024127,0724,096$0.07 / $0.28
pplx 70B
Chat
Oct 27, 2023
70B
16,3844,096$0.7 / $2.8
pplx 70B
Online
Nov 29, 2023
70B
16,3844,096$0 / $2.8
pplx 7B
Chat
Oct 27, 2023
7B
16,3844,096$0.07 / $0.28
pplx 7B
Online
Nov 29, 2023
7B
16,3844,096$0 / $0.28
Replicate
Llama 2 13B
13B
4,0964,096$0.1 / $0.5
Llama 2 13B chat
13B
4,0964,096$0.1 / $0.5
Meta Llama 3 8B
8B
4,0964,096$0.05 / $0.25
Venice
DeepSeek R1
Jan 20, 2025
671B
128,0008,192$3.5 / $14
GLM 4.6
Sep 30, 2025
357B
202,752202,752$0.85 / $2.75
Qwen 3 235B A22B Instruct
25/07
Jul 21, 2025
235B
131,072131,072$0.15 / $0.75
Qwen 3 235B A22B Thinking
25/07
Jul 21, 2025
235B
131,072131,072$0.45 / $3.5
Venice Large (Qwen 3 235B)
1.1 (D)
Apr 28, 2025
235B
131,072131,072$0.45 / $3.5
Venice Medium (Mistral 3.1 24B)
3.1
Apr 28, 2025
24B
131,072131,072$0.5 / $2
Venice Small (Qwen 3 4B)
May 12, 2025
4B
32,76832,768$0.05 / $0.15
Venice Uncensored
1.1
Jul 03, 202532,76832,768$0.2 / $0.9
xAI
Grok
beta
Nov 04, 2024131,0728,192$5 / $15
Grok 1
Nov 06, 2020
314B
8,1928,192$0
Grok 1.5
Mar 28, 2024128,0008,192$0
Grok 2
12/12
Aug 13, 2024Sep 15, 2025
314B
131,0728,192$2 / $10
Grok 2 Mini
Aug 13, 2024Sep 15, 2025
35B
131,0728,192$0
Grok 3
Feb 19, 2025131,0728,192$3 / $15
Grok 3 Fast
Feb 19, 2025Sep 15, 2025131,0728,192$5 / $25
Grok 3 Mini
Feb 19, 2025131,0728,192$0.3 / $0.5
Grok 3 Mini Fast
Feb 19, 2025Sep 15, 2025131,0728,192$0.6 / $4
Grok 4
07/09
Jul 09, 2025128,0008,192$3 / $15
Grok 4 Fast (non-reasoning)
Sep 19, 20252,000,0002,000,000$0.2 / $0.5
Grok 4 Fast (reasoning)
Sep 19, 20252,000,0002,000,000$0.2 / $0.5
Grok 4.1 Fast (non-reasoning)
Nov 19, 20252,000,0002,000,000$0.2 / $0.5
Grok 4.1 Fast (reasoning)
Nov 19, 20252,000,0002,000,000$0.2 / $0.5
Grok Code Fast 1
Aug 28, 2020256,000256,000$0.2 / $1.5

Promptmetheus currently supports 15 providers and 160 models.

Legend

Status:ย ย 
Supportedย ย  Deprecatedย ย  Not available

Distribution:ย ย 
Proprietaryย ย  Open weightsย ย  Open source

Links:ย ย 
Infoย ย  Announcementย ย  Benchmark

Disclaimer

Even though we try to keep this list accurate and up-to-date, please do not rely on the presented information for critical use cases and always double check official sources.

Model Parameters

Promptmetheus uses LiteLLM to connect to the different LLM APIs. You can find all relevant details for model parameter support in their documentation.

LLM Support

f We aim to support all major providers that have a public inference API and to add new models as soon as they become available. If a provider or model that you need is missing, please don't hesitate to request it.

LLM Benchmarks

We currently don't do any model benchmarking ourselves. Please consult the listed links for official benchmarks of each specific model and the LLM Benchmarks page for a list of trusted sources and leaderboards.

Promptmetheus ยฉ 2023-present