TOTAL MODELS TRACKED
51
10 providers
CHEAPEST (BLENDED)
$0.067/M
Nova Micro
PRICIEST (BLENDED)
$97.50/M
GPT-4.5
AVG BLENDED COST
$6.22/M
per 1M tokens
LOWEST INPUT
$0.020/M
Amazon Nova Micro
LARGEST CONTEXT
2M
Gemini 1.5 Pro
PROVIDERCATEGORY
$/M TOKENS/$/K TOKENS
| PROVIDER↕ | MODEL↕ | CATEGORY | INPUT↑ | OUTPUT↕ | CONTEXT↕ | COST / 1M BLENDED | NOTES |
|---|---|---|---|---|---|---|---|
Amazon | Nova Micro Micro | EFFICIENT | $0.0350+0.00% | $0.1400 | 128K | Text-only micro | |
Cohere | Command R7B Compact | EFFICIENT | $0.0375+0.00% | $0.1500 | 128K | 7B model | |
Amazon | Nova Lite Efficient | EFFICIENT | $0.0600+0.00% | $0.2400 | 300K | Ultra low-cost | |
Mistral | Mistral Nemo Open | EFFICIENT | $0.1500+0.00% | $0.1500 | 128K | 12B open model | |
Mistral | Mistral Small 3.2 Efficient | EFFICIENT | $0.10000.00% | $0.3000 | 128K | Cost leader | |
Meta | Llama 4 Scout Efficient | EFFICIENT | $0.10000.00% | $0.3500 | 512K | Long-context MoE | |
Google | Gemini 2.0 Flash Efficient | EFFICIENT | $0.10000.00% | $0.4000 | 1M | Next-gen Flash | |
Meta | Llama 3.3 70B Open | EFFICIENT | $0.2300+0.00% | $0.4000 | 128K | Open weights | |
OpenAI | GPT-4o mini Efficient | EFFICIENT | $0.15000.00% | $0.6000 | 128K | Cost-optimized | |
Google | Gemini 2.5 Flash Balanced | MULTIMODAL | $0.15000.00% | $0.6000 | 1M | Speed-optimized | |
Cohere | Command R Balanced | EFFICIENT | $0.1500+0.00% | $0.6000 | 128K | RAG-optimized | |
Meta | Llama 4 Maverick Frontier | FRONTIER | $0.20000.00% | $0.6000 | 128K | MoE architecture | |
Mistral | Codestral Code | EFFICIENT | $0.2000+0.00% | $0.6000 | 256K | Code specialist | |
xAI | Grok 3 mini Efficient | EFFICIENT | $0.3000+0.00% | $0.5000 | 131K | Cost-efficient | |
DeepSeek | DeepSeek-V3 Frontier | FRONTIER | $0.27000.00% | $1.10 | 64K | Top open source | |
DeepSeek | DeepSeek-V3-0324 Frontier | FRONTIER | $0.27000.00% | $1.10 | 128K | Extended context | |
Meta | Llama 3.1 405B Large | FRONTIER | $0.80000.00% | $0.8000 | 128K | Largest open model | |
Amazon | Titan Text Premier Amazon | EFFICIENT | $0.5000+0.00% | $1.50 | 32K | Amazon-built | |
Mistral | Mixtral 8x22B Open MoE | EFFICIENT | $0.90000.00% | $0.9000 | 64K | Open MoE | |
Perplexity | Sonar Efficient | EFFICIENT | $1.000.00% | $1.00 | 127K | Grounded search | |
DeepSeek | DeepSeek-R1 Reasoning | REASONING | $0.5500+0.00% | $2.19 | 64K | Chain-of-thought | |
DeepSeek | DeepSeek-R1-0528 Reasoning | REASONING | $0.5500+0.00% | $2.19 | 64K | Latest R1 release | |
Amazon | Nova Pro Frontier | MULTIMODAL | $0.8000+0.00% | $3.20 | 300K | Video understanding | |
Anthropic | Claude Haiku 4.5 Efficient | EFFICIENT | $0.8000+0.00% | $4.00 | 200K | Fast & affordable | |
OpenAI | o3-mini Reasoning | REASONING | $1.10+0.00% | $4.40 | 200K | Efficient reasoning | |
OpenAI | o4-mini Reasoning | REASONING | $1.10+0.00% | $4.40 | 200K | Latest mini reasoner | |
Google | Gemini 1.5 Pro Previous Gen | FRONTIER | $1.25+0.00% | $5.00 | 2M | 2M context | |
Mistral | Mistral Large 2 Frontier | FRONTIER | $2.00+0.00% | $6.00 | 128K | Top European model | |
Perplexity | Sonar Reasoning Pro Reasoning | REASONING | $2.00+0.00% | $8.00 | 127K | R1 + web search | |
Perplexity | Sonar Deep Research Research | REASONING | $2.00+0.00% | $8.00 | 127K | Agentic research | |
Google | Gemini 2.5 Pro Frontier | FRONTIER | $1.250.00% | $10.00 | 1M | 1M ctx, thinking | |
xAI | Grok 2 Vision Previous Gen | MULTIMODAL | $2.000.00% | $10.00 | 32K | Vision model | |
OpenAI | GPT-4o Flagship | MULTIMODAL | $2.50+0.00% | $10.00 | 128K | Omni multimodal | |
Cohere | Command R+ Frontier | FRONTIER | $2.50+0.00% | $10.00 | 128K | Enterprise RAG | |
Amazon | Nova Premier Top-tier | FRONTIER | $2.50+0.00% | $12.50 | 300K | Highest intelligence | |
Anthropic | Claude Sonnet 4.5 Balanced | MULTIMODAL | $3.000.00% | $15.00 | 200K | Vision + extended thinking | |
Anthropic | Claude Sonnet 3.7 Previous Gen | REASONING | $3.00+0.00% | $15.00 | 200K | Hybrid reasoning | |
xAI | Grok 3 Frontier | FRONTIER | $3.000.00% | $15.00 | 131K | Real-time X data | |
Perplexity | Sonar Pro Frontier | FRONTIER | $3.00+0.00% | $15.00 | 200K | With web search | |
xAI | Grok 3 Fast Balanced | MULTIMODAL | $5.00+0.00% | $25.00 | 131K | High throughput | |
OpenAI | o3 Reasoning | REASONING | $10.00+0.00% | $40.00 | 200K | Advanced reasoning | |
Anthropic | Claude Opus 4.5 Frontier | FRONTIER | $15.000.00% | $75.00 | 200K | State-of-the-art | |
Anthropic | Claude Opus 3 Previous Gen | FRONTIER | $15.000.00% | $75.00 | 200K | Legacy frontier | |
OpenAI | GPT-4.5 Frontier | FRONTIER | $75.00+0.00% | $150.00 | 128K | Most capable GPT | |
OpenAI | text-embedding-3-large Embedding | $0.13000.00% | — | 8K | 3072 dimensions | ||
OpenAI | text-embedding-3-small Embedding | $0.0200+0.00% | — | 8K | 1536 dimensions | ||
Google | text-embedding-004 Embedding | $0.0250+0.00% | — | 2K | 768 dimensions | ||
Mistral | mistral-embed Embedding | $0.1000+0.00% | — | 8K | 1024 dimensions | ||
Cohere | Embed v3 English Embedding | $0.10000.00% | — | 512 | 1024 dimensions | ||
Cohere | Embed v3 Multilingual Embedding | $0.1000+0.00% | — | 512 | 100+ languages | ||
Cohere | Rerank v3.5 Reranker | $2.000.00% | — | — | Per search unit |