An AI Programming Interface (AIPI) is similar to a conventional API, but instead of executing static code on a server, its endpoints mediate interactions with Large Language Models (LLMs) via hosted prompts.

Prompt
Engineering
IDE

Forge better prompts for your
LLM-powered applications, agents,
and workflows.

Compose advanced prompts

Test prompt reliability

Optimize prompt performance

Collaborate with your team

Get Started

Supports 100+ LLMs
and 15 inference APIs

Compose
Advanced Prompts

Promptmetheus breaks prompts down into LEGO-like blocks for better composability, e.g. Context ⇢ Task ⇢ Instructions ⇢ Samples (shots) ⇢ Primer. You can play with different variations for each section and systematically fine-tune your prompts for minimal cost and maximum performance.

Compose

Test
Prompt Reliability

The Prompt IDE includes a range of tools to evaluate your prompts under various conditions. For instance, Datasets enable rapid iteration with different inputs, while completion Ratings and the respective visual statistics help gauge output quality.

Test

Optimize
Prompt Performance

End-to-end performance and reliability of prompt chains (agents) depend heavily on the accuracy of each prompt in the sequence. Errors can compound and compromise the final output. Promptmetheus can help you optimize each prompt in the chain to consistently generate great completions.

Optimize

Traceability

Track and reconstruct the complete prompt design process.

Evaluators

Define evaluators and automatically validate completions.

Cost Estimation

Estimate inference costs for different configurations and scenarios.

Data Export

Export prompts and completions in .txt, .csv, .xlsx, or .json format.

Analytics

Analyze prompt performance with statistics, charts, and insights.

Models

We support all the latest LLMs and APIs

Anthropic

Claude 4.1 Opus

Claude 4 Opus

Claude 4 Sonnet

Claude 3.7 Sonnet

Claude 3.5 Sonnet

Claude 3.5 Haiku

Claude 3 Opus

Claude 3 Sonnet

Claude 3 Haiku

DeepMind

Gemini 2.5 Pro

Gemini 2.5 Flash

Gemini 2.0 Flash Light

Gemini 2.0 Flash

Gemini 1.5 Pro

Gemini 1.5 Flash

OpenAI

o4 mini

o3 pro

o3 mini

o1 pro

o1 mini

GPT-4.5

GPT-4.1

GPT-4.1 mini

GPT-4.1 nano

GPT-4o

GPT-4o mini

GPT-4

GPT-4 Turbo

GPT-3.5 Turbo

xAI

Grok 4

Grok 3

Grok 3 fast

Grok 3 mini

Grok 3 mini fast

Grok 2

Mistral

Magistral Medium

Magistral Small

Mistral Large 2.1

Mistral Medium 3

Mistral Small 3.2

Mistral Nemo

Ministral 8b

Ministral 3b

DeepSeek

DeepSeek R1

DeepSeek V3

Moonshot AI

Kimi K2

Cohere

Command A

Command R+

Command R

Command R 7b

Command

Command Light

Perplexity

Sonar Deep Research

Sonar Reasoning Pro

Sonar Reasoning

Sonar Pro

Sonar

R1 1776

AI21 Labs

Jamba 1.7 Large

Jamba 1.7 Mini

FetchAI

asi1-mini

Venice

Venice Uncensored

Alibaba Qwen 3

DeepSeek R1

Groq

Compound Beta

Compound Beta Mini

OpenAI GPT-OSS 120B

OpenAI GPT-OSS 20B

Moonshot AI Kimi K2

DeepSeek R1 Distill Llama 70B

DeepSeek R1 Distill Qwen 32B

Alibaba Qwen 3 32B

Meta Llama 4 Maverick 17B 128e

Meta Llama 4 Scout 17B 16e

Meta Llama 3.3

Meta Llama 3.1

Mistral Saba 24B

Google Gemma 2

Deep Infra

OpenAI GPT-OSS 120B

OpenAI GPT-OSS 20B

Moonshot AI Kimi K2

DeepSeek R1

DeepSeek V3

Alibaba Qwen 2.5

Alibaba Qwen 2

Meta Llama 4 Maverick 17B 128e

Meta Llama 4 Scout 17B 16e

Meta Llama 3.3

Meta Llama 3.2

Meta Llama 3.1

Google Gemma 3

Microsoft WizardLM

Microsoft Phi 4

OpenRouter

OpenAI GPT-OSS 120B

OpenAI GPT-OSS 20B

Moonshot AI Kimi K2

Tencent Hunyuan A13B

Baidu Ernie 4.5 300B A47B

“The hottest new programming language is English.”
— Andrej Karpathy

Pricing

Great plans for individuals and teams

Playground

FREE

Forge
1 user
Local data storage
OpenAI models only
Stats & Insights
Data import / export
Community support

Single

$29

month

7-day free trial

Prompt IDE
1 user
Cloud sync between devices
12 providers and 100+ models
Multiple projects
Automatic evaluators
Prompt history and full traceability
Stats & Insights
Data export
Dedicated support

Team

Starting at

$99

month

Prompt IDE
3 users included
$19/month per additional user
All Single features, plus
User management
Shared workspace with real-time collaboration
Business support

You can cancel subscriptions any time.

Subscriptions do not include budget for inference, you need to provide your own API keys.

For Enterprise plans and special requests, please get in touch.

What is Prompt Engineering?

What is a Prompt IDE?

How is Promptmetheus different from the OpenAI and other playgrounds?

How is Promptmetheus different from other prompt engineering tools?

Is there an API or SDK?

Can I use Promptmetheus together with LangChain, LangFlow, and other AI agent builders?

What is the difference between Forge and Archery?

What is an AIPI?

Does Promptmetheus integrate with automation tools like Make, Zapier, IFTTT, and n8n?

FAQ

If you have any open questions,
please get in touch.

Email Discord

PromptEngineeringIDE

ComposeAdvanced Prompts

TestPrompt Reliability

OptimizePrompt Performance

Collaboratewith your Team

Traceability

Evaluators

Cost Estimation

Data Export

Analytics

Models

Pricing

Playground

Single

Team

FAQ

Prompt
Engineering
IDE

Compose
Advanced Prompts

Test
Prompt Reliability

Optimize
Prompt Performance

Collaborate
with your Team