All AI Models & Pricing
25 models from 9 providers. Click any model for a full breakdown.
Anthropic
5 modelsClaude Opus 4
frontierAnthropic's most powerful model — best for complex reasoning, research, and advanced coding tasks.
Input /1M
$15.00
Output /1M
$75.00
Context
200K
Claude Sonnet 4
efficientBest balance of intelligence and speed. Ideal for agentic workflows, coding, and data analysis.
Input /1M
$3.00
Output /1M
$15.00
Context
200K
Claude Haiku 4
economyAnthropic's fastest and most compact model — optimal for high-volume, latency-sensitive tasks.
Input /1M
$0.800
Output /1M
$4.00
Context
200K
Claude 3.5 Sonnet
efficientTop coding and reasoning performance in the Claude 3.x family. Still widely used in production.
Input /1M
$3.00
Output /1M
$15.00
Context
200K
Claude 3.5 Haiku
economyFast and affordable Claude model with vision support. Great for real-time applications.
Input /1M
$0.800
Output /1M
$4.00
Context
200K
OpenAI
5 modelsGPT-4o
efficientOpenAI's flagship multimodal model — combines speed, intelligence, and vision capabilities.
Input /1M
$2.50
Output /1M
$10.00
Context
128K
GPT-4o mini
economySmall and affordable — best value for lightweight tasks, classification, and extraction.
Input /1M
$0.150
Output /1M
$0.600
Context
128K
o1
frontierOpenAI's reasoning model — designed for complex STEM, coding, and multi-step logic.
Input /1M
$15.00
Output /1M
$60.00
Context
200K
o3-mini
efficientCost-efficient reasoning model. Excels at science, math, and coding at fraction of o1 cost.
Input /1M
$1.10
Output /1M
$4.40
Context
200K
GPT-4 Turbo
frontierPrevious GPT-4 flagship with vision. Being superseded by GPT-4o but still available.
Input /1M
$10.00
Output /1M
$30.00
Context
128K
Gemini 2.5 Pro
frontierGoogle's most capable model with 1M token context — leads benchmarks in coding and reasoning.
Input /1M
$1.25
Output /1M
$10.00
Context
1M
Gemini 2.0 Flash
economyFast and affordable multimodal model with massive context window. Google's workhorse for production.
Input /1M
$0.100
Output /1M
$0.400
Context
1M
Gemini 1.5 Pro
efficientIndustry-leading 2M context window. Ideal for processing entire codebases or long documents.
Input /1M
$1.25
Output /1M
$5.00
Context
2M
Gemini 1.5 Flash
economyGoogle's most cost-efficient model with 1M context. Best value for high-volume applications.
Input /1M
$0.075
Output /1M
$0.300
Context
1M
Meta / Groq
2 modelsLlama 3.3 70B (Groq)
efficientMeta's latest 70B open-weights model on Groq's ultra-fast inference hardware (LPU). Incredible speed.
Input /1M
$0.590
Output /1M
$0.790
Context
128K
Llama 3.1 8B (Groq)
economyThe cheapest capable model for simple tasks — ideal for summarization, extraction, and classification.
Input /1M
$0.050
Output /1M
$0.080
Context
128K
Meta / Together AI
1 modelsMistral AI
2 modelsMistral Large 2
efficientMistral's flagship model — top multilingual performance and strong coding across 80+ languages.
Input /1M
$3.00
Output /1M
$9.00
Context
128K
Mistral Small 3.1
economySmall but mighty — vision-capable, Apache 2.0 licensed, and extremely cost-efficient.
Input /1M
$0.100
Output /1M
$0.300
Context
128K
xAI
2 modelsGrok 3
frontierxAI's latest model with extended thinking — excels at deep research and complex problem solving.
Input /1M
$3.00
Output /1M
$15.00
Context
131.072K
Grok 3 Mini
economyCompact reasoning model from xAI — great value for logic, math, and structured tasks.
Input /1M
$0.300
Output /1M
$0.500
Context
131.072K
Cohere
2 modelsCommand R+
efficientEnterprise-grade model optimized for RAG and tool use. Excels at retrieval-augmented generation.
Input /1M
$2.50
Output /1M
$10.00
Context
128K
Command R
economyAffordable RAG-optimized model. Best for high-volume document search and summarization pipelines.
Input /1M
$0.150
Output /1M
$0.600
Context
128K