Comparison hub

LLM API pricing comparison: every model, ranked

25+ models across OpenAI, Anthropic, and Google. Sorted by real cost, not marketing.

Section 1

Rankings by category

Four top-10 leaderboards covering the workloads that drive most API bills. Every number is computed from the pricing dataset at render time: retune the rates in lib/pricing.tsand this page updates on the next build.

Cheapest per request

1,000 input / 500 output tokens

RankModelCost / request
1GPT-5 nano$0.000250
2Gemini 2.5 Flash-Lite$0.000300
3GPT-4o mini$0.000450
4GPT-5.4 nano$0.000825
5Gemini 3.1 Flash-Lite (preview)$0.001000
6GPT-5.5 nano$0.001000
7GPT-4.1 mini$0.001200
8GPT-5 mini$0.001250
9Gemini 2.5 Flash$0.001550
10Gemini 3 Flash (preview)$0.002000

Cheapest for long context

100,000 input / 10,000 output tokens

RankModelCost / request
1GPT-5 nano$0.009000
2Gemini 2.5 Flash-Lite$0.0140
3GPT-4o mini$0.0210
4GPT-5.4 nano$0.0325
5Gemini 3.1 Flash-Lite (preview)$0.0400
6GPT-5.5 nano$0.0400
7GPT-5 mini$0.0450
8Gemini 2.5 Flash$0.0550
9GPT-4.1 mini$0.0560
10Gemini 3 Flash (preview)$0.0800

Cheapest at high volume

1,000,000 requests / month at 1k input + 500 output each

RankModelMonthly total
1GPT-5 nano$250.00
2Gemini 2.5 Flash-Lite$300.00
3GPT-4o mini$450.00
4GPT-5.4 nano$825.00
5Gemini 3.1 Flash-Lite (preview)$1,000
6GPT-5.5 nano$1,000
7GPT-4.1 mini$1,200
8GPT-5 mini$1,250
9Gemini 2.5 Flash$1,550
10Gemini 3 Flash (preview)$2,000

Largest context window

For RAG, long-document, and large-codebase workloads

RankModelContext
1Gemini 2.5 Pro2M
2GPT-5.51.1M
3GPT-5.41.1M
4GPT-4.11.0M
5GPT-4.1 mini1.0M
6Claude Opus 4.71M
7Claude Opus 4.61M
8Claude Sonnet 4.61M
9Gemini 3.1 Pro (preview)1M
10Gemini 3 Flash (preview)1M

Section 2

Full comparison matrix

Every tracked model, sortable by any column. Filter by provider, minimum context window, or maximum cost per request; search by model name or id. Costs are computed at render time fromlib/pricing.ts.

Provider
Showing 28 of 28 models
GPT-5 nano
gpt-5-nano
OpenAI
$0.05 in · $0.40 out
Cost @ 1k/500
$0.000250
Cost @ 10k/5k
$0.002500
Context
400k
Max output
128k
Gemini 2.5 Flash-Lite
gemini-2.5-flash-lite
Google
$0.10 in · $0.40 out
Cost @ 1k/500
$0.000300
Cost @ 10k/5k
$0.003000
Context
1M
Max output
-
GPT-4o mini
gpt-4o-mini
OpenAI
$0.15 in · $0.60 out
Cost @ 1k/500
$0.000450
Cost @ 10k/5k
$0.004500
Context
128k
Max output
16k
GPT-5.4 nano
gpt-5.4-nano
OpenAI
$0.20 in · $1.25 out
Cost @ 1k/500
$0.000825
Cost @ 10k/5k
$0.008250
Context
400k
Max output
128k
Gemini 3.1 Flash-Lite (preview)
gemini-3.1-flash-lite-preview
Google
$0.25 in · $1.50 out
Cost @ 1k/500
$0.001000
Cost @ 10k/5k
$0.0100
Context
1M
Max output
-
GPT-5.5 nano
gpt-5.5-nano
OpenAI
$0.25 in · $1.50 out
Cost @ 1k/500
$0.001000
Cost @ 10k/5k
$0.0100
Context
400k
Max output
128k
GPT-4.1 mini
gpt-4.1-mini
OpenAI
$0.40 in · $1.60 out
Cost @ 1k/500
$0.001200
Cost @ 10k/5k
$0.0120
Context
1.0M
Max output
33k
GPT-5 mini
gpt-5-mini
OpenAI
$0.25 in · $2.00 out
Cost @ 1k/500
$0.001250
Cost @ 10k/5k
$0.0125
Context
400k
Max output
128k
Gemini 2.5 Flash
gemini-2.5-flash
Google
$0.30 in · $2.50 out
Cost @ 1k/500
$0.001550
Cost @ 10k/5k
$0.0155
Context
1M
Max output
-
Gemini 3 Flash (preview)
gemini-3-flash-preview
Google
$0.50 in · $3.00 out
Cost @ 1k/500
$0.002000
Cost @ 10k/5k
$0.0200
Context
1M
Max output
-
GPT-5.4 mini
gpt-5.4-mini
OpenAI
$0.75 in · $4.50 out
Cost @ 1k/500
$0.003000
Cost @ 10k/5k
$0.0300
Context
400k
Max output
128k
o4-mini
o4-mini
OpenAI
$1.10 in · $4.40 out
Cost @ 1k/500
$0.003300
Cost @ 10k/5k
$0.0330
Context
200k
Max output
100k
Claude Haiku 4.5
claude-haiku-4-5
Anthropic
$1.00 in · $5.00 out
Cost @ 1k/500
$0.003500
Cost @ 10k/5k
$0.0350
Context
200k
Max output
64k
GPT-5.5 mini
gpt-5.5-mini
OpenAI
$0.90 in · $5.50 out
Cost @ 1k/500
$0.003650
Cost @ 10k/5k
$0.0365
Context
400k
Max output
128k
GPT-4.1
gpt-4.1
OpenAI
$2.00 in · $8.00 out
Cost @ 1k/500
$0.006000
Cost @ 10k/5k
$0.0600
Context
1.0M
Max output
33k
o3
o3
OpenAI
$2.00 in · $8.00 out
Cost @ 1k/500
$0.006000
Cost @ 10k/5k
$0.0600
Context
200k
Max output
100k
Gemini 2.5 Pro
gemini-2.5-pro
Google
$1.25 in · $10.00 out
Cost @ 1k/500
$0.006250
Cost @ 10k/5k
$0.0625
Context
2M
Max output
-
GPT-5
gpt-5
OpenAI
$1.25 in · $10.00 out
Cost @ 1k/500
$0.006250
Cost @ 10k/5k
$0.0625
Context
400k
Max output
128k
GPT-4o
gpt-4o
OpenAI
$2.50 in · $10.00 out
Cost @ 1k/500
$0.007500
Cost @ 10k/5k
$0.0750
Context
128k
Max output
16k
Gemini 3.1 Pro (preview)
gemini-3.1-pro-preview
Google
$2.00 in · $12.00 out
Cost @ 1k/500
$0.008000
Cost @ 10k/5k
$0.0800
Context
1M
Max output
-
GPT-5.4
gpt-5.4
OpenAI
$2.50 in · $15.00 out
Cost @ 1k/500
$0.0100
Cost @ 10k/5k
$0.1000
Context
1.1M
Max output
128k
Claude Sonnet 4.6
claude-sonnet-4-6
Anthropic
$3.00 in · $15.00 out
Cost @ 1k/500
$0.0105
Cost @ 10k/5k
$0.1050
Context
1M
Max output
64k
Claude Sonnet 4.5
claude-sonnet-4-5
Anthropic
$3.00 in · $15.00 out
Cost @ 1k/500
$0.0105
Cost @ 10k/5k
$0.1050
Context
200k
Max output
64k
GPT-5.5
gpt-5.5
OpenAI
$3.00 in · $20.00 out
Cost @ 1k/500
$0.0130
Cost @ 10k/5k
$0.1300
Context
1.1M
Max output
128k
Claude Opus 4.6
claude-opus-4-6
Anthropic
$5.00 in · $25.00 out
Cost @ 1k/500
$0.0175
Cost @ 10k/5k
$0.1750
Context
1M
Max output
128k
Claude Opus 4.5
claude-opus-4-5
Anthropic
$5.00 in · $25.00 out
Cost @ 1k/500
$0.0175
Cost @ 10k/5k
$0.1750
Context
200k
Max output
64k
Claude Opus 4.7
claude-opus-4-7
Anthropic
$5.00 in · $25.00 out
Cost @ 1k/500
$0.0201
Cost @ 10k/5k
$0.2012
Context
1M
Max output
128k
Claude Opus 4.1
claude-opus-4-1
Anthropic
$15.00 in · $75.00 out
Cost @ 1k/500
$0.0525
Cost @ 10k/5k
$0.5250
Context
200k
Max output
32k

Section 3

Cost scenarios

Same workload, every model. The bar is proportional to the most expensive model in each scenario so you can eyeball the 10×-and-more spread without doing arithmetic.

Simple chatbot

Lightweight conversational UX: 500 in / 200 out tokens, 10,000 requests/month.

RankModelProviderCost / reqMonthlyRelative
1GPT-5 nanoOpenAI$0.000105$1.05
2Gemini 2.5 Flash-LiteGoogle$0.000130$1.30
3GPT-4o miniOpenAI$0.000195$1.95
4GPT-5.4 nanoOpenAI$0.000350$3.50
5Gemini 3.1 Flash-Lite (preview)Google$0.000425$4.25
6GPT-5.5 nanoOpenAI$0.000425$4.25
7GPT-4.1 miniOpenAI$0.000520$5.20
8GPT-5 miniOpenAI$0.000525$5.25
9Gemini 2.5 FlashGoogle$0.000650$6.50
10Gemini 3 Flash (preview)Google$0.000850$8.50
11GPT-5.4 miniOpenAI$0.001275$12.75
12o4-miniOpenAI$0.001430$14.30
13Claude Haiku 4.5Anthropic$0.001500$15.00
14GPT-5.5 miniOpenAI$0.001550$15.50
15GPT-4.1OpenAI$0.002600$26.00
16o3OpenAI$0.002600$26.00
17Gemini 2.5 ProGoogle$0.002625$26.25
18GPT-5OpenAI$0.002625$26.25
19GPT-4oOpenAI$0.003250$32.50
20Gemini 3.1 Pro (preview)Google$0.003400$34.00
21GPT-5.4OpenAI$0.004250$42.50
22Claude Sonnet 4.6Anthropic$0.004500$45.00
23Claude Sonnet 4.5Anthropic$0.004500$45.00
24GPT-5.5OpenAI$0.005500$55.00
25Claude Opus 4.6Anthropic$0.007500$75.00
26Claude Opus 4.5Anthropic$0.007500$75.00
27Claude Opus 4.7Anthropic$0.008625$86.25
28Claude Opus 4.1Anthropic$0.0225$225.00

Coding agent

Iterative code edits: 5,000 in / 2,000 out tokens, 1,000 requests/month.

RankModelProviderCost / reqMonthlyRelative
1GPT-5 nanoOpenAI$0.001050$1.05
2Gemini 2.5 Flash-LiteGoogle$0.001300$1.30
3GPT-4o miniOpenAI$0.001950$1.95
4GPT-5.4 nanoOpenAI$0.003500$3.50
5Gemini 3.1 Flash-Lite (preview)Google$0.004250$4.25
6GPT-5.5 nanoOpenAI$0.004250$4.25
7GPT-4.1 miniOpenAI$0.005200$5.20
8GPT-5 miniOpenAI$0.005250$5.25
9Gemini 2.5 FlashGoogle$0.006500$6.50
10Gemini 3 Flash (preview)Google$0.008500$8.50
11GPT-5.4 miniOpenAI$0.0128$12.75
12o4-miniOpenAI$0.0143$14.30
13Claude Haiku 4.5Anthropic$0.0150$15.00
14GPT-5.5 miniOpenAI$0.0155$15.50
15GPT-4.1OpenAI$0.0260$26.00
16o3OpenAI$0.0260$26.00
17Gemini 2.5 ProGoogle$0.0263$26.25
18GPT-5OpenAI$0.0263$26.25
19GPT-4oOpenAI$0.0325$32.50
20Gemini 3.1 Pro (preview)Google$0.0340$34.00
21GPT-5.4OpenAI$0.0425$42.50
22Claude Sonnet 4.6Anthropic$0.0450$45.00
23Claude Sonnet 4.5Anthropic$0.0450$45.00
24GPT-5.5OpenAI$0.0550$55.00
25Claude Opus 4.6Anthropic$0.0750$75.00
26Claude Opus 4.5Anthropic$0.0750$75.00
27Claude Opus 4.7Anthropic$0.0862$86.25
28Claude Opus 4.1Anthropic$0.2250$225.00

RAG pipeline

Retrieval-augmented answers: 50,000 in / 1,000 out tokens, 5,000 requests/month.

RankModelProviderCost / reqMonthlyRelative
1GPT-5 nanoOpenAI$0.002900$14.50
2Gemini 2.5 Flash-LiteGoogle$0.005400$27.00
3GPT-4o miniOpenAI$0.008100$40.50
4GPT-5.4 nanoOpenAI$0.0113$56.25
5Gemini 3.1 Flash-Lite (preview)Google$0.0140$70.00
6GPT-5.5 nanoOpenAI$0.0140$70.00
7GPT-5 miniOpenAI$0.0145$72.50
8Gemini 2.5 FlashGoogle$0.0175$87.50
9GPT-4.1 miniOpenAI$0.0216$108.00
10Gemini 3 Flash (preview)Google$0.0280$140.00
11GPT-5.4 miniOpenAI$0.0420$210.00
12GPT-5.5 miniOpenAI$0.0505$252.50
13Claude Haiku 4.5Anthropic$0.0550$275.00
14o4-miniOpenAI$0.0594$297.00
15Gemini 2.5 ProGoogle$0.0725$362.50
16GPT-5OpenAI$0.0725$362.50
17GPT-4.1OpenAI$0.1080$540.00
18o3OpenAI$0.1080$540.00
19Gemini 3.1 Pro (preview)Google$0.1120$560.00
20GPT-4oOpenAI$0.1350$675.00
21GPT-5.4OpenAI$0.1400$700.00
22Claude Sonnet 4.6Anthropic$0.1650$825.00
23Claude Sonnet 4.5Anthropic$0.1650$825.00
24GPT-5.5OpenAI$0.1700$850.00
25Claude Opus 4.6Anthropic$0.2750$1,375
26Claude Opus 4.5Anthropic$0.2750$1,375
27Claude Opus 4.7Anthropic$0.3162$1,581
28Claude Opus 4.1Anthropic$0.8250$4,125

Document processing

Long-document summarisation: 100,000 in / 5,000 out tokens, 500 requests/month.

RankModelProviderCost / reqMonthlyRelative
1GPT-5 nanoOpenAI$0.007000$3.50
2Gemini 2.5 Flash-LiteGoogle$0.0120$6.00
3GPT-4o miniOpenAI$0.0180$9.00
4GPT-5.4 nanoOpenAI$0.0263$13.13
5Gemini 3.1 Flash-Lite (preview)Google$0.0325$16.25
6GPT-5.5 nanoOpenAI$0.0325$16.25
7GPT-5 miniOpenAI$0.0350$17.50
8Gemini 2.5 FlashGoogle$0.0425$21.25
9GPT-4.1 miniOpenAI$0.0480$24.00
10Gemini 3 Flash (preview)Google$0.0650$32.50
11GPT-5.4 miniOpenAI$0.0975$48.75
12GPT-5.5 miniOpenAI$0.1175$58.75
13Claude Haiku 4.5Anthropic$0.1250$62.50
14o4-miniOpenAI$0.1320$66.00
15Gemini 2.5 ProGoogle$0.1750$87.50
16GPT-5OpenAI$0.1750$87.50
17GPT-4.1OpenAI$0.2400$120.00
18o3OpenAI$0.2400$120.00
19Gemini 3.1 Pro (preview)Google$0.2600$130.00
20GPT-4oOpenAI$0.3000$150.00
21GPT-5.4OpenAI$0.3250$162.50
22Claude Sonnet 4.6Anthropic$0.3750$187.50
23Claude Sonnet 4.5Anthropic$0.3750$187.50
24GPT-5.5OpenAI$0.4000$200.00
25Claude Opus 4.6Anthropic$0.6250$312.50
26Claude Opus 4.5Anthropic$0.6250$312.50
27Claude Opus 4.7Anthropic$0.7188$359.38
28Claude Opus 4.1Anthropic$1.88$937.50

Section 4

Every pair, one click away

378 head-to-head pages, generated programmatically from the pricing dataset. Every link resolves to a dedicated comparison page with side-by-side rates, scenario cost ladders, and volume projections.

Anthropic vs OpenAI

105 pairs

OpenAI vs Google

90 pairs

Anthropic vs Google

42 pairs

Within OpenAI

105 pairs

Groupings reflect lib/pricing.ts providers: Anthropic · OpenAI · Google.

Section 5

Frequently asked questions

Every answer is composed from the current pricing dataset at render time: both the prose above and the FAQPage JSON-LD embedded for search engines.

Which LLM API is the cheapest?
Based on current pricing, GPT-5 nano is the cheapest model at $0.000250 per request for a standard workload of 1,000 input / 500 output tokens. If you care most about input-token rate at scale, GPT-5 nano offers the best per-token price at $0.05 per million input tokens.
How much does it cost to run an LLM chatbot?
A chatbot handling 10,000 messages per month (500 input / 200 output tokens each) costs between $1.05 on GPT-5 nano and $225.00 on Claude Opus 4.1. Use the comparison table above to pick the cheapest model that meets your quality bar.
Is GPT-5.4 cheaper than Claude Opus 4.7?
At a standard workload (1,000 input / 500 output tokens), GPT-5.4 costs $0.0100 per request while Claude Opus 4.7 costs $0.0201. GPT-5.4 is cheaper by $0.0101 per request: roughly 50% less.
What is the cheapest LLM for coding?
For coding tasks at 5,000 input / 2,000 output tokens, GPT-5 nano offers the best cost at $0.001050 per request. Budget-conscious teams can drop to Gemini 2.5 Flash-Lite at $0.001300 per request if quality permits: typically fine for boilerplate edits, riskier for architectural reasoning.
How do LLM API prices compare in 2026?
LLM API prices dropped roughly 80% from 2024 to 2026. Frontier models now cost $0.05–$15.00 per million input tokens, while budget models start at $0.05 per million. See the full comparison table above for every currently-tracked model.

Estimate the cost of your actual prompt

Paste a prompt, pick a model, and see the exact cost before you send anything.