Live pricing table for every frontier LLM — GPT, Claude, Gemini, Llama, DeepSeek, Mistral
| Model | Vendor | Input $ / 1M ▲ | Output $ / 1M | Context | Released | Scenario $ |
|---|---|---|---|---|---|---|
GPT-5 nano text | OpenAI | $0.05 | $0.40 | 272,000 | 2025-08-07 | $0.150 |
Gemini 2.5 Flash-Lite text · vision | $0.10 | $0.40 | 1,000,000 | 2025-07-22 | $0.200 | |
Llama 4 Scout text · vision | Meta | $0.11 | $0.34 | 10,000,000 | 2025-04-05 | $0.195 |
GPT-5 mini text · vision | OpenAI | $0.25 | $2.00 | 272,000 | 2025-08-07 | $0.750 |
Llama 4 Maverick text · vision | Meta | $0.27 | $0.85 | 10,000,000 | 2025-04-05 | $0.483 |
DeepSeek V3.1 text | DeepSeek | $0.27 | $1.10 | 128,000 | 2025-08-21 | $0.545 |
Gemini 2.5 Flash text · vision · audio · video | $0.30 | $2.50 | 1,048,576 | 2025-06-17 | $0.925 | |
Mistral Medium 3 text | Mistral | $0.40 | $2.00 | 128,000 | 2025-05-07 | $0.900 |
DeepSeek R1 (reasoning) text | DeepSeek | $0.55 | $2.19 | 64,000 | 2025-01-20 | $1.098 |
Claude Haiku 4.5 text · vision | Anthropic | $1.00 | $5.00 | 200,000 | 2025-10-01 | $2.250 |
GPT-5 text · vision | OpenAI | $1.25 | $10.00 | 272,000 | 2025-08-07 | $3.750 |
Gemini 2.5 Pro text · vision · audio · video | $1.25 | $10.00 | 2,097,152 | 2025-06-17 | $3.750 | |
GPT-4.1 text · vision | OpenAI | $2.00 | $8.00 | 1,047,576 | 2025-04-14 | $4.000 |
o3 (reasoning) text · vision | OpenAI | $2.00 | $8.00 | 200,000 | 2025-04-16 | $4.000 |
Mistral Large 2 text | Mistral | $2.00 | $6.00 | 128,000 | 2024-11-18 | $3.500 |
Claude Sonnet 4.6 text · vision | Anthropic | $3.00 | $15.00 | 1,000,000 | 2026-01-22 | $6.750 |
Grok 4 text · vision | xAI | $3.00 | $15.00 | 256,000 | 2025-07-09 | $6.750 |
Claude Opus 4.7 text · vision | Anthropic | $15.00 | $75.00 | 1,000,000 | 2026-03-10 | $33.750 |
Choosing an LLM vendor has become a 7-figure decision for many companies, yet prices, context windows, and capabilities change quarterly. This comparison table gives you a single-pane view across every frontier model, with a scenario calculator so you can instantly see what your workload would cost on each. Prices are verified monthly against vendor pricing pages.
Monthly. We verify against each vendor's official pricing page at the start of each month.
No — all prices shown are standard pay-as-you-go API tier. Volume discounts (OpenAI >$100k/mo, Anthropic custom contracts, AWS Bedrock committed use) can cut 20-40% off these rates.
Cached-input tokens (repeated system prompts or context) price at ~10-25% of standard input. Many workloads save 50%+ with caching — a big lever we recommend profiling.
The weights are free to run yourself. The prices we show are for managed endpoints (Together, Fireworks, Groq, etc. — representative of market).
No — images and audio have separate per-token or per-request pricing that varies by model. Text-only here.