LLM Index

The list of supported providers and models

ProviderModelStatusDistributionReleaseDeprecationParametersContext lengthMax tokensPrice in/out [$/1Mt]Links
AI21
Jamba 1.5 Large
Aug 22, 2024May 06, 2025$2 / $8
Jamba 1.5 Mini
Aug 22, 2024May 06, 2025$0.2 / $0.4
Jamba 1.6 Large
Mar 06, 2025Mar 08, 2025$2 / $8
Jamba 1.6 Mini
Mar 06, 2025Mar 08, 2025$0.2 / $0.4
Jamba 1.7 Large
Jul 03, 2025$2 / $8
Jamba 1.7 Mini
Jul 03, 2025$0.2 / $0.4
Jurassic 2 Light
Mar 09, 2023$0.1 / $0.5
Jurassic 2 Mid
Mar 09, 2023$0.25 / $1.25
Jurassic 2 Ultra
Mar 09, 2023$2 / $10
Aleph Alpha
Luminous Base
Apr 14, 2023$30
Luminous Base Control
Apr 14, 2023$37.5
Luminous Extended
Apr 14, 2023$45
Luminous Extended Control
Apr 14, 2023$56.25
Luminous Supreme
Apr 14, 2023$175
Luminous Supreme Control
Apr 14, 2023$218.75
Anthropic
Claude 2
Jul 11, 2023$8 / $24
Claude 2.1
Nov 21, 2023$8 / $24
Claude 3 Haiku
Mar 15, 2024$0.25 / $1.25
Claude 3 Opus
Mar 04, 2024$15 / $75
Claude 3 Sonnet
Mar 04, 2024Jul 21, 2025$3 / $15
Claude 3.5 Haiku
2024/10/22
Oct 22, 2024$0.8 / $4
Claude 3.5 Sonnet
2024/10/22
Oct 22, 2024Oct 22, 2025$3 / $15
Claude 3.5 Sonnet
2024/06/20
Jun 20, 2024Oct 22, 2025$3 / $15
Claude 3.7 Sonnet
latest
Feb 24, 2025$3 / $15
Claude 4 Opus
2025/05/14
May 22, 2025$15 / $75
Claude 4 Sonnet
2025/05/14
May 22, 2025$3 / $15
Claude 4.1 Opus
2025/08/05
Aug 05, 2025$15 / $75
Claude 4.5 Haiku
2025/10/01
Oct 15, 2025$1 / $5
Claude 4.5 Sonnet
2025/09/29
Sep 29, 2025$3 / $15
Claude Instant 1
Mar 14, 2023$1.63 / $5.51
Claude Instant 1.2
Sep 21, 2023$1.63 / $5.51
Cohere
Command
Feb 07, 2024$15
Command A
03/2025
Mar 13, 2025$2.5 / $10
Command Light
Feb 07, 2024$15
Command Nightly
Feb 07, 2024$15
Command R
Mar 11, 2024$0.5 / $1.5
Command R 7b
12/2024
Dec 13, 2024
7B
$0.04 / $0.15
Command R+
Apr 04, 2024$3 / $15
DeepInfra
Code Llama 34B
Instruct HF
Aug 24, 2023
34B
$0.6
DeepSeek Chat 3.1
Aug 21, 2025
671B
$0.3 / $1
DeepSeek R1
Jan 20, 2025
671B
$0.55 / $2.19
DeepSeek R1 Turbo
Mar 26, 2025
671B
$1 / $3
DeepSeek V3
Dec 26, 2024
671B
$0.85 / $0.9
Gemma 2 27B
Jun 27, 2024
27B
$0.27
Gemma 2 9B
Jun 27, 2024
9B
$0.06
Gemma 3 12B
Mar 10, 2025
12B
$0.05 / $0.1
Gemma 3 27B
Mar 10, 2025
27B
$0.1 / $0.2
Gemma 3 4B
Mar 10, 2025
4B
$0.02 / $0.04
Llama 2 13B
Chat HF
Jul 18, 2023
13B
$0.35
Llama 2 70B
Chat HF
Jul 18, 2023
70B
$1.88
Llama 2 7B
Chat HF
Jul 18, 2023
7B
$0.2
Llama 3.3 70B
Instruct
Dec 06, 2024
70B
$0.23 / $0.4
Llama 3.3 70B Turbo
Instruct
Dec 06, 2024
70B
$0.12 / $0.3
Llama 4 Maverick 17B 128e
Instruct FP8
Apr 05, 2025
17B
$0.2 / $0.6
Llama 4 Scout 17B 16e
Instruct
Apr 05, 2025
17B
$0.1 / $0.3
Meta Llama 3.1 405B
Instruct
Jul 23, 2024
405B
$1.79
Meta Llama 3.1 70B
Instruct
Jul 23, 2024
70B
$0.35 / $0.4
Meta Llama 3.1 8B
Instruct
Jul 23, 2024
8B
$0.06
Meta Llama 3.2 1B
Instruct
Sep 25, 2024
1B
$0.01 / $0.02
Meta Llama 3.2 3B
Instruct
Sep 25, 2024
3B
$0.03 / $0.05
Mistral 7B
Instruct v0.1
Sep 27, 2023
7B
$0.2
Mistral Nemo
Instruct 24/07
Jul 17, 2024
12B
$0.13
Mixtral 8x22B
Instruct v0.1
Sep 19, 2024
22B
$0.65
Mixtral 8x7B
Instruct v0.1
Sep 19, 2024
56B
$0.24
Moonshot AI Kimi K2
Jul 11, 2025
32B
$0.55 / $2.2
OpenAI GPT-OSS 120B
Aug 05, 2025
117B
$0.09 / $0.45
OpenAI GPT-OSS 20B
Aug 05, 2025
21B
$0.04 / $0.16
Phi 4
Dec 12, 2024
14B
$0.07 / $0.14
Qwen 2 72B
Instruct
Jul 23, 2024
72B
$0.35 / $0.4
Qwen 2.5 72B
Instruct
Sep 19, 2024
72B
$0.23 / $0.4
Qwen 3 14B
Apr 29, 2025
14B
$0.08 / $0.24
Qwen 3 30B A3B
Apr 29, 2025
30B
$0.1 / $0.3
Qwen 3 32B
Apr 29, 2025
32B
$0.1 / $0.3
Qwen3 235B A22B
Apr 29, 2025
325B
$0.2 / $0.6
WizardLM 2 7B
Apr 16, 2024
7B
$0.06
WizardLM 2 8x22B
Apr 16, 2024
176B
$0.5
DeepMind
Chat Bison
May 10, 2023$0.5
Code Bison
May 10, 2023$0.5
Gemini 1.0 Pro
Dec 13, 2023$0.5 / $1.5
Gemini 1.5 Flash
May 14, 2024$0.07 / $0.3
Gemini 1.5 Flash
exp 08/27
Aug 27, 2024$0.07 / $0.3
Gemini 1.5 Flash 8B
exp 08/27
Aug 27, 2024
8B
$0.04 / $0.15
Gemini 1.5 Flash 8B
Oct 03, 2024
8B
$0.07 / $0.3
Gemini 1.5 Pro
Feb 15, 2024$1.25 / $5
Gemini 1.5 Pro
exp 08/01
Aug 27, 2024$1.25 / $5
Gemini 1.5 Pro
exp 08/27
Aug 27, 2024$1.25 / $5
Gemini 1.5 Pro
Feb 15, 2024$1.25 / $5
Gemini 2.0 Flash
Dec 11, 2024$0.1 / $0.4
Gemini 2.0 Flash
exp
Dec 11, 2024$0
Gemini 2.0 Flash Lite
Feb 05, 2025$0.07 / $0.3
Gemini 2.0 Flash Thinking
exp 12/19
Dec 19, 2024$0
Gemini 2.0 Pro
exp 02/05
Feb 05, 2025$0
Gemini 2.5 Flash
May 20, 2025$0.02 / $0.6
Gemini 2.5 Flash
preview (05/20)
May 20, 2025$0.02 / $0.6
Gemini 2.5 Pro
May 06, 2025$1.25 / $10
Gemini 2.5 Pro
preview (03/25)
Mar 25, 2025$1.25 / $10
Gemini 2.5 Pro
preview (05/06)
May 06, 2025$1.25 / $10
Text Bison
May 10, 2023$0.5
DeepSeek
Chat
v3.1
Aug 21, 2025
671B
$0.56 / $1.68
Reasoner
v3.1
Aug 21, 2025
671B
$0.56 / $1.68
FetchAI
ASI-1 mini
Feb 25, 2025$0
Groq
Alibaba Qwen 2.5 32B
Sep 19, 2024
32B
$0.79
Alibaba Qwen 3 32B
Apr 29, 2025
32B
$0.29 / $0.59
Alibaba Qwen QwQ 32B
Mar 05, 2025
32B
$0.29 / $0.39
Compound Beta
preview
Apr 15, 2025$0
Compound Beta Mini
preview
Apr 15, 2025$0
DeepSeek R1 Distill Llama 70B
Jan 20, 2025
70B
$0.75 / $0.99
DeepSeek R1 Distill Qwen 32B
Jan 20, 2025
671B
$0.69
Google Gemma 2 9B
Jun 27, 2024
9B
$0.2
Google Gemma 7B
Jan 15, 2024
7B
$0
Meta Llama 2 70B
Jan 15, 2024
70B
$0
Meta Llama 3 70B
Apr 18, 2024
70B
$0.59 / $0.79
Meta Llama 3 8B
Apr 18, 2024
8B
$0.05 / $0.08
Meta Llama 3.1 405B
preview
Jul 23, 2024
405B
$0.59 / $0.79
Meta Llama 3.1 70B
preview
Jul 23, 2024
70B
$0.59 / $0.79
Meta Llama 3.1 8B
preview
Jul 23, 2024
8B
$0.05 / $0.08
Meta Llama 3.2 1B
preview
Sep 25, 2024
1B
$0.04
Meta Llama 3.2 3B
preview
Sep 25, 2024
3B
$0.06
Meta Llama 3.3 70B
versatile
Dec 06, 2024
70B
$0.59 / $0.79
Meta Llama 4 Maverick 17B 128e
Apr 05, 2025
17B
$0.5 / $0.77
Meta Llama 4 Scout 17B 16e
Apr 05, 2025
17B
$0.11 / $0.34
Mistral Saba 24B
Feb 17, 2025
24B
$0.79
Mixtral 8x7B
Jan 15, 2024
7B
$0.24
Moonshot AI Kimi K2
Jul 11, 2025
32B
$1 / $3
OpenAI GPT-OSS 120B
Aug 05, 2025
117B
$0.15 / $0.75
OpenAI GPT-OSS 20B
Aug 05, 2025
21B
$0.1 / $0.5
Mistral
Large
24.02
Feb 26, 2024Jun 16, 2025$4 / $12
Large
latest
Nov 18, 2024$2 / $6
Large 2.1
24.11
Nov 18, 2024$2 / $6
Magistral Medium
25.06
Jun 10, 2025$2 / $5
Magistral Small
25.06
Jun 10, 2025$0.5 / $1.5
Medium
23.12
Dec 11, 2023Jun 16, 2025$2.7 / $8.1
Medium
latest
May 07, 2025$0.4 / $2
Medium 3
25.05
May 07, 2025$0.4 / $2
Ministral 3B
latest
Oct 16, 2024
3B
$0.04
Ministral 8B
latest
Oct 16, 2024
8B
$0.1
Nemo
24.07
Jul 18, 2024
12B
$0.3
Open Mixtral 8x22B
Apr 17, 2024Mar 30, 2025
176B
$2 / $6
Saba
25.02
Feb 17, 2025Sep 30, 2025
24B
$0.2 / $0.6
Small
latest
Dec 11, 2023$1 / $3
Small
Mixtral 8x7B
Dec 11, 2023
56B
$0.7
Small
24.02
Nov 30, 2024Jun 16, 2025
24B
$1 / $3
Small 3.2
25.06
Jun 10, 2025
24B
$1 / $3
Tiny
Mistral 7B
Dec 11, 2023Mar 30, 2025
7B
$0.25
Moonshot AI
Kimi K2
0711 preview
Jul 11, 2025
32B
$0.15 / $2.5
NLP Cloud
Chat Dolphin
$0.5
Dolphin
$0.5
OpenAI
Ada 001
Jun 01, 2020Jan 04, 2024$0.4
Babbage 001
Jun 01, 2020Jan 04, 2024$0.5
Babbage 002
Jan 04, 2024Jan 27, 2025$1.6
DaVinci 002
Jan 04, 2024Jan 27, 2025$12
DaVinci 003
Nov 28, 2022Jan 04, 2024$20
GPT-3.5 Turbo
Instruct
Nov 30, 2022$1.5 / $2
GPT-3.5 Turbo
01/25
Jan 25, 2024$0.5 / $1.5
GPT-3.5 Turbo
11/06
Nov 06, 2023$1 / $2
GPT-3.5 Turbo
03/01
Jun 13, 2024$1.5 / $2
GPT-3.5 Turbo
06/13
Sep 13, 2024$1.5 / $2
GPT-3.5 Turbo
Nov 30, 2022$1.5 / $2
GPT-3.5 Turbo 16k
03/01
Jun 13, 2024$3 / $4
GPT-3.5 Turbo 16k
06/13
Jun 13, 2024$3 / $4
GPT-3.5 Turbo 16k
Nov 30, 2022$3 / $4
GPT-4
03/14
Jun 13, 2024$30 / $60
GPT-4
Mar 14, 2023$30 / $60
GPT-4
06/13
Jul 19, 2023$30 / $60
GPT-4 0125 Turbo
Jan 25, 2024$10 / $30
GPT-4 32k
Mar 14, 2023Jun 06, 2025$60 / $120
GPT-4 32k
03/14
Jun 06, 2025$60 / $120
GPT-4 32k
06/13
Jun 06, 2025$60 / $120
GPT-4 Turbo
Jan 25, 2024$10 / $30
GPT-4 Turbo
11/06
Nov 06, 2023$10 / $30
GPT-4 Turbo
2024/04/09
Apr 09, 2024$10 / $30
GPT-4.1
2025/04/14
Apr 14, 2025$2 / $8
GPT-4.1
Apr 14, 2025$2 / $8
GPT-4.1 mini
Apr 14, 2025$0.4 / $1.6
GPT-4.1 mini
2025/04/14
Apr 14, 2025$0.4 / $1.6
GPT-4.1 nano
Apr 14, 2025$0.1 / $0.4
GPT-4.1 nano
2025/04/14
Apr 14, 2025$0.1 / $0.4
GPT-4.5
preview
Feb 27, 2025Jul 14, 2025$75 / $150
GPT-4o
2024/11/20
Nov 20, 2024$5 / $15
GPT-4o
May 13, 2024$5 / $15
GPT-4o
2024/08/06
Aug 06, 2024$5 / $15
GPT-4o
2024/05/13
May 13, 2024$5 / $15
GPT-4o Search
preview
Jul 19, 2025$2.5 / $10
GPT-4o mini
2024/07/18
Jul 18, 2024$0.15 / $0.6
GPT-4o mini
Jul 18, 2024$0.15 / $0.6
GPT-4o mini Search
preview
Mar 11, 2025$0.15 / $0.6
GPT-5
2025/08/07
Aug 07, 2025$1.25 / $10
GPT-5 mini
2025/08/07
Aug 07, 2025$0.25 / $2
GPT-5 nano
2025/08/07
Aug 07, 2025$0.05 / $0.4
GPT-5 pro
2025/10/06
Oct 06, 2025$15 / $120
o1
Sep 12, 2024$15 / $60
o1
2024/12/17
Sep 12, 2024$15 / $60
o1 mini
Sep 12, 2024$3 / $12
o1 mini
2024/09/12
Sep 12, 2024$3 / $12
o1 preview
Sep 12, 2024$15 / $60
o1 preview
2024/09/12
Sep 12, 2024$15 / $60
o1-pro
Mar 19, 2025$150 / $600
o3
2025/04/16
Apr 16, 2025$10 / $40
o3
Apr 16, 2025$10 / $40
o3 mini
Jan 31, 2025$1.1 / $4.4
o3 mini
2025/01/31
Jan 31, 2025$1.1 / $4.4
o3 pro
Jun 10, 2025$20 / $80
o4 mini
2025/04/16
Apr 16, 2025$1.1 / $4.4
o4 mini
Apr 16, 2025$1.1 / $4.4
o4 mini deep research
2025/06/26
Jun 26, 2025$2 / $8
OpenRouter
DeepSeek Chat 3.1
Aug 21, 2025
671B
$0.2 / $0.8
Ernie 4.5 300B A47B
Jun 30, 2025
300B
$0.28 / $1.1
Hunyuan A13B
Jun 27, 2025
80B
$0.03
Moonshot AI Kimi K2
Jul 11, 2025
32B
$0.14 / $2.49
OpenAI GPT-OSS 120B
Aug 05, 2025
117B
$0.09 / $0.45
OpenAI GPT-OSS 20B
Aug 05, 2025
21B
$0.04 / $0.16
Perplexity
Code Llama 34B
Instruct
Oct 04, 2023
34B
$0.35 / $1.4
Code Llama 70B
Instruct
Oct 04, 2023
70B
$0.7 / $2.8
Llama 2 70B
Oct 04, 2023
70B
$0.7 / $2.8
Llama 3 70B
Instruct
May 14, 2024Aug 12, 2024
70B
$0 / $1
Llama 3 8B
Instruct
May 14, 2024Aug 12, 2024
8B
$0 / $0.2
Llama 3 Sonar large 32k
Chat
May 14, 2024Aug 12, 2024$0 / $1
Llama 3 Sonar large 32k
Online
May 14, 2024Aug 12, 2024$0 / $1
Llama 3 Sonar small 32k
Chat
May 14, 2024Aug 12, 2024$0 / $0.2
Llama 3 Sonar small 32k
Online
May 14, 2024Aug 12, 2024$0 / $0.2
Llama 3.1 Sonar huge 128k
Online
Aug 14, 2024Feb 22, 2025$0 / $5
Llama 3.1 Sonar large 128k
Online
Jul 31, 2024Feb 22, 2025$0 / $1
Llama 3.1 Sonar large 128k
Chat
Jul 31, 2024Feb 22, 2025$0 / $1
Llama 3.1 Sonar small 128k
Online
Jul 31, 2024Feb 22, 2025$0 / $0.2
Llama 3.1 Sonar small 128k
Chat
Jul 31, 2024Feb 22, 2025$0 / $0.2
Mistral 7B
Oct 04, 2023Aug 12, 2024
7B
$0.07 / $0.28
Mixtral 8x7B
Instruct
Oct 04, 2023Aug 12, 2024
7B
$0 / $0.6
R1 1776
Feb 18, 2025
671B
$2 / $8
Sonar
Jan 25, 2025$1
Sonar Deep Research
Feb 14, 2025$2 / $8
Sonar Pro
Jan 25, 2025$3 / $15
Sonar Reasoning
Jan 30, 2025$1 / $5
Sonar Reasoning Pro
Jan 30, 2025$2 / $8
Sonar medium
Online
Feb 24, 2024$0 / $1.8
Sonar medium
Chat
Feb 24, 2024$0.6 / $1.8
Sonar small
Online
Feb 24, 2024$0 / $0.28
Sonar small
Chat
Feb 24, 2024$0.07 / $0.28
pplx 70B
Online
Nov 29, 2023
70B
$0 / $2.8
pplx 70B
Chat
Oct 27, 2023
70B
$0.7 / $2.8
pplx 7B
Chat
Oct 27, 2023
7B
$0.07 / $0.28
pplx 7B
Online
Nov 29, 2023
7B
$0 / $0.28
Replicate
Llama 2 13B
13B
$0.1 / $0.5
Llama 2 13B chat
13B
$0.1 / $0.5
Meta Llama 3 8B
8B
$0.05 / $0.25
Venice
DeepSeek R1
Jan 20, 2025
671B
$3.5 / $14
Qwen 3
235B
Apr 28, 2025
235B
$1.5 / $6
Venice Uncensored
1.1
Jul 03, 2025$0.5 / $2
xAI
Grok
beta
Nov 04, 2024$5 / $15
Grok 1
Nov 06, 2020
314B
$0
Grok 1.5
Mar 28, 2024$0
Grok 2
12/12
Aug 13, 2024Sep 15, 2025
314B
$2 / $10
Grok 2 mini
Aug 13, 2024Sep 15, 2025
35B
$0
Grok 3
Feb 19, 2025$3 / $15
Grok 3 fast
Feb 19, 2025Sep 15, 2025$5 / $25
Grok 3 mini
Feb 19, 2025$0.3 / $0.5
Grok 3 mini fast
Feb 19, 2025Sep 15, 2025$0.6 / $4
Grok 4
Jul 09, 2025$3 / $15
Grok 4 fast non-reasoning
Sep 19, 2025$0.2 / $0.5
Grok 4 fast reasoning
Sep 19, 2025$0.2 / $0.5

Promptmetheus currently supports 15 providers and 151 models.

Legend

Status:ย ย 
Supportedย ย  Deprecatedย ย  Not available

Distribution:ย ย 
Proprietaryย ย  Open weightsย ย  Open source

Links:ย ย 
Infoย ย  Announcementย ย  Benchmark

Disclaimer

Even though we try to keep this list accurate and up-to-date, please do not rely on the presented information for critical use cases and always double check official sources.

Model Parameters

Promptmetheus currently relies on the LiteLLM open-source library to efficiently connect to the different LLM APIs. You can find all relevant details for model parameter support in their documentation.

Shout-out to Krrish and Ishaan who make adding new models easy as pie.

LLM Support

We aim to evenutally support all major LLM providers that have a public API and always add new foundation models as soon as they get released. It is also planned to support local inference and hosted products from Amazon, Google, Microsoft, etc. in the future.

If there is a provider or model missing that you would like to use in your Promptmetheus project, please don't hesitate to request it.

LLM Benchmarks

We currently don't do any benchmarking ourselves. Please consult the listed links for benchmarks of each specific model.

Promptmetheus ยฉ 2025