Overview and comparison of available LLM API providers and supported language models
Provider | Model | Mode | Status | Release date | Deprecation date | Max. tokens | Token price in / out ($/1k) | Links |
---|---|---|---|---|---|---|---|---|
AI21 | Jurassic 2 Light | completion | available | Mar 9, 2023 | 8,192 | 0.003 | ||
Jurassic 2 Mid | completion | available | Mar 9, 2023 | 8,192 | 0.01 | |||
Jurassic 2 Ultra | completion | available | Mar 9, 2023 | 8,192 | 0.015 | |||
Aleph Alpha | Luminous Base | completion | available | Apr 14, 2023 | 2,048 | 0.03 | ||
Luminous Base Control | chat | available | Apr 14, 2023 | 2,048 | 0.0375 | |||
Luminous Extended | completion | available | Apr 14, 2023 | 2,048 | 0.045 | |||
Luminous Extended Control | chat | available | Apr 14, 2023 | 2,048 | 0.05625 | |||
Luminous Supreme | completion | available | Apr 14, 2023 | 2,048 | 0.175 | |||
Luminous Supreme Control | chat | available | Apr 14, 2023 | 2,048 | 0.21875 | |||
Anthropic | Claude 2 | chat | available | Jul 11, 2023 | 100,000 | 0.008 / 0.024 | ||
Claude 2.1 | chat | available | Nov 21, 2023 | 200,000 | 0.008 / 0.024 | |||
Claude 3 Haiku | chat | available | Mar 15, 2024 | 200,000 | 0.00025 / 0.00125 | |||
Claude 3 Opus | chat | available | Mar 4, 2024 | 200,000 | 0.015 / 0.075 | |||
Claude 3 Sonnet | chat | available | Mar 4, 2024 | 200,000 | 0.003 / 0.015 | |||
Claude Instant 1 | chat | available | Mar 14, 2023 | 100,000 | 0.00163 / 0.00551 | |||
Claude Instant 1.2 | chat | available | Sep 21, 2023 | 100,000 | 0.00163 / 0.00551 | |||
Cohere | Command | completion | available | 4,096 | 0.015 | |||
Command Light | completion | available | 4,096 | 0.015 | ||||
Command Nightly | completion | available | 4,096 | 0.015 | ||||
Command R | chat | available | Mar 11, 2024 | 128,000 | 0.0005 / 0.0015 | |||
Command R+ | chat | available | Apr 4, 2024 | 128,000 | 0.003 / 0.015 | |||
Deep Infra | Code Llama 34B Instruct HF | chat | available | Aug 24, 2023 | 4,096 | 0.0006 | ||
Llama 2 13B Chat HF | chat | available | Jul 18, 2023 | 4,096 | 0.00035 | |||
Llama 2 70B Chat HF | chat | available | Jul 18, 2023 | 6,144 | 0.001875 | |||
Llama 2 7B Chat HF | chat | available | Jul 18, 2023 | 4,096 | 0.0002 | |||
Mistral 7B Instruct v0.1 | chat | available | Sep 27, 2023 | 4,096 | 0.0002 | |||
Google AI | Chat Bison | chat | available | May 10, 2023 | 4,096 | 0.0005 | ||
Code Bison | completion | May 10, 2023 | 6,144 | 0.0005 | ||||
Gemini 1.0 Pro | chat | available | Dec 13, 2023 | 30,720 | 0 | |||
Gemini 1.5 Pro | chat | available | Feb 15, 2024 | 1,000,000 | 0 | |||
Text Bison | completion | available | May 10, 2023 | 8,192 | 0.0005 | |||
Groq | Gemma 7B | chat | available | Jan 15, 2024 | 8,192 | 0.0001 | ||
Llama 2 70B | chat | available | Jan 15, 2024 | 4,096 | 0.0007 / 0.0008 | |||
Llama 3 70B | chat | available | Apr 18, 2024 | 8,192 | 0.00064 / 0.0008 | |||
Llama 3 8B | chat | available | Apr 18, 2024 | 8,192 | 0.0001 | |||
Mixtral 8x7B | chat | available | Jan 15, 2024 | 32,768 | 0.00027 | |||
Mistral | Large (latest) | chat | available | Feb 26, 2024 | 32,000 | 0.008 / 0.024 | ||
Medium (latest) | chat | available | Dec 11, 2023 | 8,192 | 0.0025 / 0.0075 | |||
Small (Mixtral 8x7B) | chat | available | Dec 11, 2023 | 8,192 | 0.0006 / 0.0018 | |||
Tiny (Mistral-7B-v0.2) | chat | available | Dec 11, 2023 | 8,192 | 0.00014 / 0.00042 | |||
NLP Cloud | Chat Dolphin | chat | available | 16,384 | 0.0005 | |||
Dolphin | completion | available | 16,384 | 0.0005 | ||||
OpenAI | Babbage 002 | completion | available | 16,384 | 0.0016 | |||
DaVinci 002 | completion | available | 16,384 | 0.012 | ||||
GPT-3.5 Turbo | chat | available | Nov 30, 2022 | 4,096 | 0.0015 / 0.002 | |||
GPT-3.5 Turbo 0125 | chat | available | Jan 25, 2024 | 16,385 | 0.0005 / 0.0015 | |||
GPT-3.5 Turbo 0301 | chat | available | Jun 13, 2024 | 4,096 | 0.0015 / 0.002 | |||
GPT-3.5 Turbo 0613 | chat | available | 4,096 | 0.0015 / 0.002 | ||||
GPT-3.5 Turbo 1106 | chat | available | Nov 6, 2023 | 16,385 | 0.001 / 0.002 | |||
GPT-3.5 Turbo 16k | chat | available | Nov 30, 2022 | 16,384 | 0.003 / 0.004 | |||
GPT-3.5 Turbo 16k 0301 | chat | available | Jun 13, 2024 | 16,384 | 0.003 / 0.004 | |||
GPT-3.5 Turbo 16k 0613 | chat | available | Jun 13, 2024 | 16,384 | 0.003 / 0.004 | |||
GPT-3.5 Turbo Instruct | completion | available | 4,096 | 0.0015 / 0.002 | ||||
GPT-4 | chat | available | Mar 14, 2023 | 8,192 | 0.03 / 0.06 | |||
GPT-4 0125 Turbo | chat | available | Jan 25, 2024 | 128,000 | 0.01 / 0.03 | |||
GPT-4 0314 | chat | available | 8,192 | 0.03 / 0.06 | ||||
GPT-4 0613 | chat | available | 8,192 | 0.03 / 0.06 | ||||
GPT-4 1106 Turbo | chat | available | Nov 6, 2023 | 128,000 | 0.01 / 0.03 | |||
GPT-4 32k | chat | available | Mar 14, 2023 | 32,768 | 0.06 / 0.12 | |||
GPT-4 32k 0314 | chat | available | 32,768 | 0.06 / 0.12 | ||||
GPT-4 32k 0613 | chat | available | 32,768 | 0.06 / 0.12 | ||||
GPT-4 Turbo | chat | available | Jan 25, 2024 | 128,000 | 0.01 / 0.03 | |||
GPT-4 Turbo 2024-04-09 | chat | available | Apr 9, 2024 | 128,000 | 0.01 / 0.03 | |||
Perplexity | Code Llama 34B Instruct | chat | available | Oct 4, 2023 | 16,384 | 0.00035 / 0.0014 | ||
Code Llama 70B Instruct | chat | available | Oct 4, 2023 | 16,384 | 0.0007 / 0.0028 | |||
Llama 2 70B | chat | available | Oct 4, 2023 | 4,096 | 0.0007 / 0.0028 | |||
Mistral 7B | chat | available | Oct 4, 2023 | 4,096 | 0.00007 / 0.00028 | |||
Mixtral 8x7B Instruct | chat | available | Oct 4, 2023 | 4,096 | 0.00007 / 0.00028 | |||
Sonar medium (chat) | chat | available | Feb 24, 2024 | 16,384 | 0.0006 / 0.0018 | |||
Sonar medium (online) | chat | available | Feb 24, 2024 | 12,000 | 0 / 0.0018 | |||
Sonar small (chat) | chat | available | Feb 24, 2024 | 16,384 | 0.00007 / 0.00028 | |||
Sonar small (online) | chat | available | Feb 24, 2024 | 12,000 | 0 / 0.00028 | |||
pplx 70B chat | chat | available | Oct 27, 2023 | 4,096 | 0.0007 / 0.0028 | |||
pplx 70B online | chat | available | Oct 27, 2023 | 4,096 | 0 / 0.0028 | |||
pplx 7B chat | chat | available | Oct 27, 2023 | 8,192 | 0.00007 / 0.00028 | |||
pplx 7B online | chat | available | Oct 27, 2023 | 4,096 | 0 / 0.00028 | |||
xAI | Grok 1 | chat | Nov 6, 2020 | 25,000 | ||||
Grok 1.5 | chat | Mar 28, 2024 | 128,000 |
Disclaimer: Even though we try to keep this list accurate and up-to-date, please do not rely on the presented information for critical use cases and always double check official sources.
PROMPTMETHEUS currently relies on the LiteLLM open-source library to connect to the different LLM APIs. You can find all the details for model parameter support in their documentation.
If you are missing a provider or model that you would like to use for your PROMPTMETHEUS project, please don't hesitate to request it. The goal is to support all relevant foundation models and eventually even custom models.
Shout-out to Krrish and Ishaan from LiteLLM who make adding new models easy as pie.