All AI Models &amp; Pricing

Anthropic's most powerful model — best for complex reasoning, research, and advanced coding tasks.

Input /1M

$15.00

Output /1M

$75.00

Context

200K

Claude Sonnet 4

Best balance of intelligence and speed. Ideal for agentic workflows, coding, and data analysis.

Input /1M

$3.00

Output /1M

$15.00

Context

200K

Claude Haiku 4

Anthropic's fastest and most compact model — optimal for high-volume, latency-sensitive tasks.

Input /1M

$0.800

Output /1M

$4.00

Context

200K

Claude 3.5 Sonnet

Top coding and reasoning performance in the Claude 3.x family. Still widely used in production.

Input /1M

$3.00

Output /1M

$15.00

Context

200K

Claude 3.5 Haiku

Fast and affordable Claude model with vision support. Great for real-time applications.

Input /1M

$0.800

Output /1M

$4.00

Context

200K

OpenAI

5 models

GPT-4o

OpenAI's flagship multimodal model — combines speed, intelligence, and vision capabilities.

Input /1M

$2.50

Output /1M

$10.00

Context

128K

GPT-4o mini

Small and affordable — best value for lightweight tasks, classification, and extraction.

Input /1M

$0.150

Output /1M

$0.600

Context

128K

o1

OpenAI's reasoning model — designed for complex STEM, coding, and multi-step logic.

Input /1M

$15.00

Output /1M

$60.00

Context

200K

o3-mini

Cost-efficient reasoning model. Excels at science, math, and coding at fraction of o1 cost.

Input /1M

$1.10

Output /1M

$4.40

Context

200K

GPT-4 Turbo

Previous GPT-4 flagship with vision. Being superseded by GPT-4o but still available.

Input /1M

$10.00

Output /1M

$30.00

Context

128K

Google

4 models

Gemini 2.5 Pro

Google's most capable model with 1M token context — leads benchmarks in coding and reasoning.

Input /1M

$1.25

Output /1M

$10.00

Context

Gemini 2.0 Flash

Fast and affordable multimodal model with massive context window. Google's workhorse for production.

Input /1M

$0.100

Output /1M

$0.400

Context

Gemini 1.5 Pro

Industry-leading 2M context window. Ideal for processing entire codebases or long documents.

Input /1M

$1.25

Output /1M

$5.00

Context

Gemini 1.5 Flash

Google's most cost-efficient model with 1M context. Best value for high-volume applications.

Input /1M

$0.075

Output /1M

$0.300

Context

Meta / Groq

2 models

Llama 3.3 70B (Groq)

Meta's latest 70B open-weights model on Groq's ultra-fast inference hardware (LPU). Incredible speed.

Input /1M

$0.590

Output /1M

$0.790

Context

128K

Llama 3.1 8B (Groq)

The cheapest capable model for simple tasks — ideal for summarization, extraction, and classification.

Input /1M

$0.050

Output /1M

$0.080

Context

128K

Meta / Together AI

1 models

Llama 3.1 405B (Together AI)

Meta's largest open-source model — frontier performance with the freedom of open weights.

Input /1M

$3.50

Output /1M

$3.50

Context

128K

Mistral AI

2 models

Mistral Large 2

Mistral's flagship model — top multilingual performance and strong coding across 80+ languages.

Input /1M

$3.00

Output /1M

$9.00

Context

128K

Mistral Small 3.1

Small but mighty — vision-capable, Apache 2.0 licensed, and extremely cost-efficient.

Input /1M

$0.100

Output /1M

$0.300

Context

128K

xAI

2 models

Grok 3

xAI's latest model with extended thinking — excels at deep research and complex problem solving.

Input /1M

$3.00

Output /1M

$15.00

Context

131.072K

Grok 3 Mini

Compact reasoning model from xAI — great value for logic, math, and structured tasks.

Input /1M

$0.300

Output /1M

$0.500

Context

131.072K

Cohere

2 models

Command R+

Enterprise-grade model optimized for RAG and tool use. Excels at retrieval-augmented generation.

Input /1M

$2.50

Output /1M

$10.00

Context

128K

Command R

Affordable RAG-optimized model. Best for high-volume document search and summarization pipelines.

Input /1M

$0.150

Output /1M

$0.600

Context

128K

DeepSeek

2 models

DeepSeek R1

Open-source reasoning model rivaling o1 at a fraction of the cost. MIT licensed.

Input /1M

$0.550

Output /1M

$2.19

Context

64K

DeepSeek V3