| Provider | Model | Status | Distribution | Release | Deprecation | Parameters | Context length | Max tokens | Price in/out [$/1Mt] | Links |
|---|---|---|---|---|---|---|---|---|---|---|
| AI21 | Jamba 1.5 Large | Aug 22, 2024 | May 06, 2025 | $2 / $8 | ||||||
Jamba 1.5 Mini | Aug 22, 2024 | May 06, 2025 | $0.2 / $0.4 | |||||||
Jamba 1.6 Large | Mar 06, 2025 | Mar 08, 2025 | $2 / $8 | |||||||
Jamba 1.6 Mini | Mar 06, 2025 | Mar 08, 2025 | $0.2 / $0.4 | |||||||
Jamba 1.7 Large | Jul 03, 2025 | $2 / $8 | ||||||||
Jamba 1.7 Mini | Jul 03, 2025 | $0.2 / $0.4 | ||||||||
Jurassic 2 Light | Mar 09, 2023 | $0.1 / $0.5 | ||||||||
Jurassic 2 Mid | Mar 09, 2023 | $0.25 / $1.25 | ||||||||
Jurassic 2 Ultra | Mar 09, 2023 | $2 / $10 | ||||||||
| Aleph Alpha | Luminous Base | Apr 14, 2023 | $30 | |||||||
Luminous Base Control | Apr 14, 2023 | $37.5 | ||||||||
Luminous Extended | Apr 14, 2023 | $45 | ||||||||
Luminous Extended Control | Apr 14, 2023 | $56.25 | ||||||||
Luminous Supreme | Apr 14, 2023 | $175 | ||||||||
Luminous Supreme Control | Apr 14, 2023 | $218.75 | ||||||||
| Anthropic | Claude 2 | Jul 11, 2023 | $8 / $24 | |||||||
Claude 2.1 | Nov 21, 2023 | $8 / $24 | ||||||||
Claude 3 Haiku | Mar 15, 2024 | $0.25 / $1.25 | ||||||||
Claude 3 Opus | Mar 04, 2024 | Jan 30, 2025 | $15 / $75 | |||||||
Claude 3 Sonnet | Mar 04, 2024 | Jul 21, 2025 | $3 / $15 | |||||||
Claude 3.5 Haiku 2024/10/22 | Oct 22, 2024 | $0.8 / $4 | ||||||||
Claude 3.5 Sonnet 2024/06/20 | Jun 20, 2024 | Oct 22, 2025 | $3 / $15 | |||||||
Claude 3.5 Sonnet 2024/10/22 | Oct 22, 2024 | Oct 22, 2025 | $3 / $15 | |||||||
Claude 3.7 Sonnet latest | Feb 24, 2025 | $3 / $15 | ||||||||
Claude 4 Opus 2025/05/14 | May 22, 2025 | $15 / $75 | ||||||||
Claude 4 Sonnet 2025/05/14 | May 22, 2025 | $3 / $15 | ||||||||
Claude 4.1 Opus 2025/08/05 | Aug 05, 2025 | $15 / $75 | ||||||||
Claude 4.5 Haiku 2025/10/01 | Oct 15, 2025 | $1 / $5 | ||||||||
Claude 4.5 Sonnet 2025/09/29 | Sep 29, 2025 | $3 / $15 | ||||||||
Claude Instant 1 | Mar 14, 2023 | $1.63 / $5.51 | ||||||||
Claude Instant 1.2 | Sep 21, 2023 | $1.63 / $5.51 | ||||||||
| Cohere | Aya Expanse 32B | Oct 24, 2024 | 32B | $0 | ||||||
Aya Expanse 8B | Oct 24, 2024 | 8B | $0 | |||||||
Command | Feb 07, 2024 | Sep 15, 2025 | $15 | |||||||
Command A 03/2025 | Mar 13, 2025 | $2.5 / $10 | ||||||||
Command A Reasoning 08/2025 | Aug 21, 2025 | $2.5 / $10 | ||||||||
Command Light | Feb 07, 2024 | Sep 15, 2025 | $15 | |||||||
Command Nightly | Feb 07, 2024 | Sep 15, 2025 | $15 | |||||||
Command R 08/2024 | Mar 11, 2024 | $0.15 / $0.6 | ||||||||
Command R 7b 12/2024 | Dec 13, 2024 | 7B | $0.04 / $0.15 | |||||||
Command R+ 08/2024 | Apr 04, 2024 | $2.5 / $10 | ||||||||
| DeepInfra | Code Llama 34B Instruct HF | Aug 24, 2023 | 34B | $0.6 | ||||||
DeepSeek R1 | Jan 20, 2025 | 671B | $0.55 / $2.19 | |||||||
DeepSeek R1 Turbo | Mar 26, 2025 | 671B | $1 / $3 | |||||||
DeepSeek V3 | Dec 26, 2024 | 671B | $0.85 / $0.9 | |||||||
DeepSeek V3.1 | Aug 21, 2025 | 671B | $0.3 / $1 | |||||||
DeepSeek V3.2 Exp | Sep 29, 2025 | 671B | $0.27 / $0.4 | |||||||
Gemma 2 27B | Jun 27, 2024 | 27B | $0.27 | |||||||
Gemma 2 9B | Jun 27, 2024 | 9B | $0.06 | |||||||
Gemma 3 12B | Mar 10, 2025 | 12B | $0.05 / $0.1 | |||||||
Gemma 3 27B | Mar 10, 2025 | 27B | $0.1 / $0.2 | |||||||
Gemma 3 4B | Mar 10, 2025 | 4B | $0.02 / $0.04 | |||||||
Kimi K2 Instruct | Jul 11, 2025 | 32B | $0.55 / $2.2 | |||||||
Kimi K2 Thinking | Nov 11, 2025 | 32B | $0.55 / $2.5 | |||||||
Llama 2 13B Chat HF | Jul 18, 2023 | 13B | $0.35 | |||||||
Llama 2 70B Chat HF | Jul 18, 2023 | 70B | $1.88 | |||||||
Llama 2 7B Chat HF | Jul 18, 2023 | 7B | $0.2 | |||||||
Llama 3.3 70B Instruct | Dec 06, 2024 | 70B | $0.23 / $0.4 | |||||||
Llama 3.3 70B Turbo Instruct | Dec 06, 2024 | 70B | $0.12 / $0.3 | |||||||
Llama 4 Maverick 17B 128e Instruct FP8 | Apr 05, 2025 | 17B | $0.2 / $0.6 | |||||||
Llama 4 Scout 17B 16e Instruct | Apr 05, 2025 | 17B | $0.1 / $0.3 | |||||||
Meta Llama 3.1 405B Instruct | Jul 23, 2024 | 405B | $1.79 | |||||||
Meta Llama 3.1 70B Instruct | Jul 23, 2024 | 70B | $0.35 / $0.4 | |||||||
Meta Llama 3.1 8B Instruct | Jul 23, 2024 | 8B | $0.06 | |||||||
Meta Llama 3.2 1B Instruct | Sep 25, 2024 | 1B | $0.01 / $0.02 | |||||||
Meta Llama 3.2 3B Instruct | Sep 25, 2024 | 3B | $0.03 / $0.05 | |||||||
MiniMax M2 | Nov 11, 2025 | 10B | $0.27 / $1.15 | |||||||
Mistral 7B Instruct v0.1 | Sep 27, 2023 | 7B | $0.2 | |||||||
Mistral Nemo Instruct 24/07 | Jul 17, 2024 | 12B | $0.13 | |||||||
Mixtral 8x22B Instruct v0.1 | Sep 19, 2024 | 22B | $0.65 | |||||||
Mixtral 8x7B Instruct v0.1 | Sep 19, 2024 | 56B | $0.24 | |||||||
OpenAI GPT-OSS 120B | Aug 05, 2025 | 117B | $0.09 / $0.45 | |||||||
OpenAI GPT-OSS 20B | Aug 05, 2025 | 21B | $0.04 / $0.16 | |||||||
Phi 4 | Dec 12, 2024 | 14B | $0.07 / $0.14 | |||||||
Qwen 2 72B Instruct | Jul 23, 2024 | 72B | $0.35 / $0.4 | |||||||
Qwen 2.5 72B Instruct | Sep 19, 2024 | 72B | $0.23 / $0.4 | |||||||
Qwen 3 14B | Apr 29, 2025 | 14B | $0.08 / $0.24 | |||||||
Qwen 3 30B A3B | Apr 29, 2025 | 30B | $0.1 / $0.3 | |||||||
Qwen 3 32B | Apr 29, 2025 | 32B | $0.1 / $0.3 | |||||||
Qwen3 235B A22B | Apr 29, 2025 | 325B | $0.2 / $0.6 | |||||||
WizardLM 2 7B | Apr 16, 2024 | 7B | $0.06 | |||||||
WizardLM 2 8x22B | Apr 16, 2024 | 176B | $0.5 | |||||||
| DeepMind | Chat Bison | May 10, 2023 | $0.5 | |||||||
Code Bison | May 10, 2023 | $0.5 | ||||||||
Gemini 1.0 Pro | Dec 13, 2023 | $0.5 / $1.5 | ||||||||
Gemini 1.5 Flash exp 08/27 | Aug 27, 2024 | $0.07 / $0.3 | ||||||||
Gemini 1.5 Flash | May 14, 2024 | Sep 24, 2025 | $0.07 / $0.3 | |||||||
Gemini 1.5 Flash 8B | Oct 03, 2024 | Sep 24, 2025 | 8B | $0.07 / $0.3 | ||||||
Gemini 1.5 Flash 8B exp 08/27 | Aug 27, 2024 | 8B | $0.04 / $0.15 | |||||||
Gemini 1.5 Pro | Feb 15, 2024 | May 24, 2025 | $1.25 / $5 | |||||||
Gemini 1.5 Pro | Feb 15, 2024 | $1.25 / $5 | ||||||||
Gemini 1.5 Pro exp 08/01 | Aug 27, 2024 | $1.25 / $5 | ||||||||
Gemini 1.5 Pro exp 08/27 | Aug 27, 2024 | $1.25 / $5 | ||||||||
Gemini 2.0 Flash exp | Dec 11, 2024 | $0 | ||||||||
Gemini 2.0 Flash | Dec 11, 2024 | $0.1 / $0.4 | ||||||||
Gemini 2.0 Flash Lite | Feb 05, 2025 | $0.07 / $0.3 | ||||||||
Gemini 2.0 Flash Thinking exp 12/19 | Dec 19, 2024 | $0 | ||||||||
Gemini 2.0 Pro exp 02/05 | Feb 05, 2025 | $0 | ||||||||
Gemini 2.5 Flash preview (05/20) | May 20, 2025 | $0.02 / $0.6 | ||||||||
Gemini 2.5 Flash | May 20, 2025 | $0.3 / $2.5 | ||||||||
Gemini 2.5 Flash Lite | Jul 22, 2025 | $0.1 / $0.4 | ||||||||
Gemini 2.5 Pro preview (03/25) | Mar 25, 2025 | $1.25 / $10 | ||||||||
Gemini 2.5 Pro preview (05/06) | May 06, 2025 | $1.25 / $10 | ||||||||
Gemini 2.5 Pro | May 06, 2025 | $1.25 / $10 | ||||||||
Gemini 3 Pro preview | Nov 18, 2025 | $2 / $12 | ||||||||
Text Bison | May 10, 2023 | $0.5 | ||||||||
| DeepSeek | Chat v3.2 | Sep 29, 2025 | 671B | $0.28 / $0.42 | ||||||
Reasoner v3.2 | Sep 29, 2025 | 671B | $0.28 / $0.42 | |||||||
| FetchAI | ASI:One extended | Apr 22, 2025 | $0 | |||||||
ASI:One fast | Apr 30, 2025 | $0 | ||||||||
ASI:One mini | Feb 25, 2025 | $0 | ||||||||
| Groq | Alibaba Qwen 2.5 32B | Sep 19, 2024 | 32B | $0.79 | ||||||
Alibaba Qwen 3 32B | Apr 29, 2025 | 32B | $0.29 / $0.59 | |||||||
Alibaba Qwen QwQ 32B | Mar 05, 2025 | 32B | $0.29 / $0.39 | |||||||
Compound | Apr 15, 2025 | $0 | ||||||||
Compound Mini | Apr 15, 2025 | $0 | ||||||||
DeepSeek R1 Distill Llama 70B | Jan 20, 2025 | Oct 02, 2025 | 70B | $0.75 / $0.99 | ||||||
DeepSeek R1 Distill Qwen 32B | Jan 20, 2025 | Apr 14, 2025 | 671B | $0.69 | ||||||
Google Gemma 2 9B | Jun 27, 2024 | Oct 08, 2025 | 9B | $0.2 | ||||||
Google Gemma 7B | Jan 15, 2024 | 7B | $0 | |||||||
Meta Llama 2 70B | Jan 15, 2024 | 70B | $0 | |||||||
Meta Llama 3 70B | Apr 18, 2024 | 70B | $0.59 / $0.79 | |||||||
Meta Llama 3 8B | Apr 18, 2024 | 8B | $0.05 / $0.08 | |||||||
Meta Llama 3.1 405B preview | Jul 23, 2024 | 405B | $0.59 / $0.79 | |||||||
Meta Llama 3.1 70B preview | Jul 23, 2024 | 70B | $0.59 / $0.79 | |||||||
Meta Llama 3.1 8B | Jul 23, 2024 | 8B | $0.05 / $0.08 | |||||||
Meta Llama 3.2 1B preview | Sep 25, 2024 | 1B | $0.04 | |||||||
Meta Llama 3.2 3B preview | Sep 25, 2024 | 3B | $0.06 | |||||||
Meta Llama 3.3 70B | Dec 06, 2024 | 70B | $0.59 / $0.79 | |||||||
Meta Llama 4 Maverick 17B 128e | Apr 05, 2025 | 17B | $0.2 / $0.6 | |||||||
Meta Llama 4 Scout 17B 16e | Apr 05, 2025 | 17B | $0.11 / $0.34 | |||||||
Mistral Saba 24B | Feb 17, 2025 | Jul 30, 2025 | 24B | $0.79 | ||||||
Mixtral 8x7B | Jan 15, 2024 | 7B | $0.24 | |||||||
Moonshot AI Kimi K2 09/05 | Sep 05, 2025 | 32B | $1 / $3 | |||||||
Moonshot AI Kimi K2 Instruct | Jul 11, 2025 | Oct 10, 2025 | 32B | $1 / $3 | ||||||
OpenAI GPT-OSS 120B | Aug 05, 2025 | 117B | $0.15 / $0.6 | |||||||
OpenAI GPT-OSS 20B | Aug 05, 2025 | 21B | $0.07 / $0.3 | |||||||
| Mistral | Large 24.02 | Feb 26, 2024 | Jun 16, 2025 | $4 / $12 | ||||||
Large 2.1 24.11 | Nov 18, 2024 | $2 / $6 | ||||||||
Magistral Medium 25.06 | Jun 10, 2025 | $2 / $5 | ||||||||
Magistral Small 25.06 | Jun 10, 2025 | $0.5 / $1.5 | ||||||||
Medium latest | May 07, 2025 | $0.4 / $2 | ||||||||
Medium 23.12 | Dec 11, 2023 | Jun 16, 2025 | $2.7 / $8.1 | |||||||
Medium 3 25.05 | May 07, 2025 | $0.4 / $2 | ||||||||
Ministral 3B latest | Oct 16, 2024 | 3B | $0.04 | |||||||
Ministral 8B latest | Oct 16, 2024 | 8B | $0.1 | |||||||
Nemo 24.07 | Jul 18, 2024 | 12B | $0.3 | |||||||
Open Mixtral 8x22B | Apr 17, 2024 | Mar 30, 2025 | 176B | $2 / $6 | ||||||
Saba 25.02 | Feb 17, 2025 | Sep 30, 2025 | 24B | $0.2 / $0.6 | ||||||
Small Mixtral 8x7B | Dec 11, 2023 | 56B | $0.7 | |||||||
Small 24.02 | Nov 30, 2024 | Jun 16, 2025 | 24B | $1 / $3 | ||||||
Small latest | Dec 11, 2023 | $1 / $3 | ||||||||
Small 3.2 25.06 | Jun 10, 2025 | 24B | $1 / $3 | |||||||
Tiny Mistral 7B | Dec 11, 2023 | Mar 30, 2025 | 7B | $0.25 | ||||||
| Moonshot AI | Kimi K2 09/05 preview | Sep 05, 2025 | 32B | $0.6 / $2.5 | ||||||
Kimi K2 07/11 preview | Jul 11, 2025 | 32B | $0.6 / $2.5 | |||||||
Kimi K2 thinking | Nov 06, 2025 | 32B | $0.6 / $2.5 | |||||||
Kimi K2 thinking turbo | Nov 06, 2025 | 32B | $1.15 / $8 | |||||||
Kimi K2 turbo preview | Aug 03, 2025 | 32B | $1.15 / $8 | |||||||
| NLP Cloud | Chat Dolphin | $0.5 | ||||||||
Dolphin | $0.5 | |||||||||
| OpenAI | Ada 001 | Jun 01, 2020 | Jan 04, 2024 | $0.4 | ||||||
Babbage 001 | Jun 01, 2020 | Jan 04, 2024 | $0.5 | |||||||
Babbage 002 | Jan 04, 2024 | Jan 27, 2025 | $1.6 | |||||||
DaVinci 002 | Jan 04, 2024 | Jan 27, 2025 | $12 | |||||||
DaVinci 003 | Nov 28, 2022 | Jan 04, 2024 | $20 | |||||||
GPT-3.5 Turbo 06/13 | Sep 13, 2024 | $1.5 / $2 | ||||||||
GPT-3.5 Turbo 03/01 | Jun 13, 2024 | $1.5 / $2 | ||||||||
GPT-3.5 Turbo Instruct | Nov 30, 2022 | $1.5 / $2 | ||||||||
GPT-3.5 Turbo | Nov 30, 2022 | $1.5 / $2 | ||||||||
GPT-3.5 Turbo 01/25 | Jan 25, 2024 | $0.5 / $1.5 | ||||||||
GPT-3.5 Turbo 11/06 | Nov 06, 2023 | $1 / $2 | ||||||||
GPT-3.5 Turbo 16k 03/01 | Jun 13, 2024 | $3 / $4 | ||||||||
GPT-3.5 Turbo 16k 06/13 | Jun 13, 2024 | $3 / $4 | ||||||||
GPT-3.5 Turbo 16k | Nov 30, 2022 | $3 / $4 | ||||||||
GPT-4 | Mar 14, 2023 | $30 / $60 | ||||||||
GPT-4 03/14 | Jun 13, 2024 | $30 / $60 | ||||||||
GPT-4 06/13 | Jul 19, 2023 | $30 / $60 | ||||||||
GPT-4 0125 Turbo | Jan 25, 2024 | $10 / $30 | ||||||||
GPT-4 32k 06/13 | Jun 06, 2025 | $60 / $120 | ||||||||
GPT-4 32k | Mar 14, 2023 | Jun 06, 2025 | $60 / $120 | |||||||
GPT-4 32k 03/14 | Jun 06, 2025 | $60 / $120 | ||||||||
GPT-4 Turbo 2024/04/09 | Apr 09, 2024 | $10 / $30 | ||||||||
GPT-4 Turbo 11/06 | Nov 06, 2023 | $10 / $30 | ||||||||
GPT-4 Turbo | Jan 25, 2024 | $10 / $30 | ||||||||
GPT-4.1 | Apr 14, 2025 | $2 / $8 | ||||||||
GPT-4.1 2025/04/14 | Apr 14, 2025 | $2 / $8 | ||||||||
GPT-4.1 mini | Apr 14, 2025 | $0.4 / $1.6 | ||||||||
GPT-4.1 mini 2025/04/14 | Apr 14, 2025 | $0.4 / $1.6 | ||||||||
GPT-4.1 nano 2025/04/14 | Apr 14, 2025 | $0.1 / $0.4 | ||||||||
GPT-4.1 nano | Apr 14, 2025 | $0.1 / $0.4 | ||||||||
GPT-4.5 preview | Feb 27, 2025 | Jul 14, 2025 | $75 / $150 | |||||||
GPT-4o 2024/11/20 | Nov 20, 2024 | $5 / $15 | ||||||||
GPT-4o | May 13, 2024 | $5 / $15 | ||||||||
GPT-4o 2024/05/13 | May 13, 2024 | $5 / $15 | ||||||||
GPT-4o 2024/08/06 | Aug 06, 2024 | $5 / $15 | ||||||||
GPT-4o Search preview | Jul 19, 2025 | $2.5 / $10 | ||||||||
GPT-4o mini 2024/07/18 | Jul 18, 2024 | $0.15 / $0.6 | ||||||||
GPT-4o mini | Jul 18, 2024 | $0.15 / $0.6 | ||||||||
GPT-4o mini Search preview | Mar 11, 2025 | $0.15 / $0.6 | ||||||||
GPT-5 2025/08/07 | Aug 07, 2025 | $1.25 / $10 | ||||||||
GPT-5 mini 2025/08/07 | Aug 07, 2025 | $0.25 / $2 | ||||||||
GPT-5 nano 2025/08/07 | Aug 07, 2025 | $0.05 / $0.4 | ||||||||
GPT-5 pro 2025/10/06 | Oct 06, 2025 | $15 / $120 | ||||||||
GPT-5.1 2025/11/13 | Nov 13, 2025 | $1.25 / $10 | ||||||||
o1 2024/12/17 | Sep 12, 2024 | $15 / $60 | ||||||||
o1 | Sep 12, 2024 | $15 / $60 | ||||||||
o1 mini | Sep 12, 2024 | $3 / $12 | ||||||||
o1 mini 2024/09/12 | Sep 12, 2024 | $3 / $12 | ||||||||
o1 preview | Sep 12, 2024 | $15 / $60 | ||||||||
o1 preview 2024/09/12 | Sep 12, 2024 | $15 / $60 | ||||||||
o1-pro | Mar 19, 2025 | $150 / $600 | ||||||||
o3 2025/04/16 | Apr 16, 2025 | $10 / $40 | ||||||||
o3 | Apr 16, 2025 | $10 / $40 | ||||||||
o3 mini | Jan 31, 2025 | $1.1 / $4.4 | ||||||||
o3 mini 2025/01/31 | Jan 31, 2025 | $1.1 / $4.4 | ||||||||
o3 pro | Jun 10, 2025 | $20 / $80 | ||||||||
o4 mini 2025/04/16 | Apr 16, 2025 | $1.1 / $4.4 | ||||||||
o4 mini | Apr 16, 2025 | $1.1 / $4.4 | ||||||||
o4 mini deep research 2025/06/26 | Jun 26, 2025 | $2 / $8 | ||||||||
| OpenRouter | Claude 4.5 Sonnet | Sep 29, 2025 | $3 / $15 | |||||||
DeepSeek V3.1 | Aug 21, 2025 | 671B | $0.2 / $0.8 | |||||||
DeepSeek V3.2 Exp | Sep 29, 2025 | 671B | $0.27 / $0.4 | |||||||
Ernie 4.5 300B A47B | Jun 30, 2025 | 300B | $0.28 / $1.1 | |||||||
Gemini 3 Pro preview | Nov 18, 2025 | $2 / $12 | ||||||||
Grok 4.1 Fast | Nov 19, 2025 | $0 | ||||||||
Hunyuan A13B | Jun 27, 2025 | 80B | $0.03 | |||||||
Moonshot AI Kimi K2 | Jul 11, 2025 | 32B | $0.14 / $2.49 | |||||||
OpenAI GPT-OSS 120B | Aug 05, 2025 | 117B | $0.09 / $0.45 | |||||||
OpenAI GPT-OSS 20B | Aug 05, 2025 | 21B | $0.04 / $0.16 | |||||||
| Perplexity | Code Llama 34B Instruct | Oct 04, 2023 | 34B | $0.35 / $1.4 | ||||||
Code Llama 70B Instruct | Oct 04, 2023 | 70B | $0.7 / $2.8 | |||||||
Llama 2 70B | Oct 04, 2023 | 70B | $0.7 / $2.8 | |||||||
Llama 3 70B Instruct | May 14, 2024 | Aug 12, 2024 | 70B | $0 / $1 | ||||||
Llama 3 8B Instruct | May 14, 2024 | Aug 12, 2024 | 8B | $0 / $0.2 | ||||||
Llama 3 Sonar large 32k Online | May 14, 2024 | Aug 12, 2024 | $0 / $1 | |||||||
Llama 3 Sonar large 32k Chat | May 14, 2024 | Aug 12, 2024 | $0 / $1 | |||||||
Llama 3 Sonar small 32k Online | May 14, 2024 | Aug 12, 2024 | $0 / $0.2 | |||||||
Llama 3 Sonar small 32k Chat | May 14, 2024 | Aug 12, 2024 | $0 / $0.2 | |||||||
Llama 3.1 Sonar huge 128k Online | Aug 14, 2024 | Feb 22, 2025 | $0 / $5 | |||||||
Llama 3.1 Sonar large 128k Online | Jul 31, 2024 | Feb 22, 2025 | $0 / $1 | |||||||
Llama 3.1 Sonar large 128k Chat | Jul 31, 2024 | Feb 22, 2025 | $0 / $1 | |||||||
Llama 3.1 Sonar small 128k Online | Jul 31, 2024 | Feb 22, 2025 | $0 / $0.2 | |||||||
Llama 3.1 Sonar small 128k Chat | Jul 31, 2024 | Feb 22, 2025 | $0 / $0.2 | |||||||
Mistral 7B | Oct 04, 2023 | Aug 12, 2024 | 7B | $0.07 / $0.28 | ||||||
Mixtral 8x7B Instruct | Oct 04, 2023 | Aug 12, 2024 | 7B | $0 / $0.6 | ||||||
R1 1776 | Feb 18, 2025 | Aug 01, 2025 | 671B | $2 / $8 | ||||||
Sonar | Jan 25, 2025 | $1 | ||||||||
Sonar Deep Research | Feb 14, 2025 | $2 / $8 | ||||||||
Sonar Pro | Jan 25, 2025 | $3 / $15 | ||||||||
Sonar Reasoning | Jan 30, 2025 | $1 / $5 | ||||||||
Sonar Reasoning Pro | Jan 30, 2025 | $2 / $8 | ||||||||
Sonar medium Chat | Feb 24, 2024 | $0.6 / $1.8 | ||||||||
Sonar medium Online | Feb 24, 2024 | $0 / $1.8 | ||||||||
Sonar small Chat | Feb 24, 2024 | $0.07 / $0.28 | ||||||||
Sonar small Online | Feb 24, 2024 | $0 / $0.28 | ||||||||
pplx 70B Online | Nov 29, 2023 | 70B | $0 / $2.8 | |||||||
pplx 70B Chat | Oct 27, 2023 | 70B | $0.7 / $2.8 | |||||||
pplx 7B Online | Nov 29, 2023 | 7B | $0 / $0.28 | |||||||
pplx 7B Chat | Oct 27, 2023 | 7B | $0.07 / $0.28 | |||||||
| Replicate | Llama 2 13B | 13B | $0.1 / $0.5 | |||||||
Llama 2 13B chat | 13B | $0.1 / $0.5 | ||||||||
Meta Llama 3 8B | 8B | $0.05 / $0.25 | ||||||||
| Venice | DeepSeek R1 | Jan 20, 2025 | 671B | $3.5 / $14 | ||||||
GLM 4.6 | Sep 30, 2025 | 357B | $0.85 / $2.75 | |||||||
Qwen 3 235B A22B Instruct 25/07 | Jul 21, 2025 | 235B | $0.15 / $0.75 | |||||||
Qwen 3 235B A22B Thinking 25/07 | Jul 21, 2025 | 235B | $0.45 / $3.5 | |||||||
Venice Large (Qwen 3 235B) 1.1 (D) | Apr 28, 2025 | 235B | $0.45 / $3.5 | |||||||
Venice Medium (Mistral 3.1 24B) 3.1 | Apr 28, 2025 | 24B | $0.5 / $2 | |||||||
Venice Small (Qwen 3 4B) | May 12, 2025 | 4B | $0.05 / $0.15 | |||||||
Venice Uncensored 1.1 | Jul 03, 2025 | $0.2 / $0.9 | ||||||||
| xAI | Grok beta | Nov 04, 2024 | $5 / $15 | |||||||
Grok 1 | Nov 06, 2020 | 314B | $0 | |||||||
Grok 1.5 | Mar 28, 2024 | $0 | ||||||||
Grok 2 12/12 | Aug 13, 2024 | Sep 15, 2025 | 314B | $2 / $10 | ||||||
Grok 2 Mini | Aug 13, 2024 | Sep 15, 2025 | 35B | $0 | ||||||
Grok 3 | Feb 19, 2025 | $3 / $15 | ||||||||
Grok 3 Fast | Feb 19, 2025 | Sep 15, 2025 | $5 / $25 | |||||||
Grok 3 Mini | Feb 19, 2025 | $0.3 / $0.5 | ||||||||
Grok 3 Mini Fast | Feb 19, 2025 | Sep 15, 2025 | $0.6 / $4 | |||||||
Grok 4 07/09 | Jul 09, 2025 | $3 / $15 | ||||||||
Grok 4 Fast (non-reasoning) | Sep 19, 2025 | $0.2 / $0.5 | ||||||||
Grok 4 Fast (reasoning) | Sep 19, 2025 | $0.2 / $0.5 | ||||||||
Grok 4.1 Fast (non-reasoning) | Nov 19, 2025 | $0.2 / $0.5 | ||||||||
Grok 4.1 Fast (reasoning) | Nov 19, 2025 | $0.2 / $0.5 | ||||||||
Grok Code Fast 1 | Aug 28, 2020 | $0.2 / $1.5 |
Promptmetheus currently supports 15 providers and 156 models.
Legend
Status:ย ย
Supportedย ย Deprecatedย ย Not available
Distribution:ย ย
Proprietaryย ย Open weightsย ย Open source
Links:ย ย
Infoย ย Announcementย ย Benchmark
Disclaimer
Even though we try to keep this list accurate and up-to-date, please do not rely on the presented information for critical use cases and always double check official sources.
Model Parameters
Promptmetheus uses LiteLLM to connect to the different LLM APIs. You can find all relevant details for model parameter support in their documentation.
LLM Support
f We aim to support all major providers that have a public inference API and to add new models as soon as they become available. If a provider or model that you need is missing, please don't hesitate to request it.
LLM Benchmarks
We currently don't do any model benchmarking ourselves. Please consult the listed links for official benchmarks of each specific model and the LLM Benchmarks page for a list of trusted sources and leaderboards.