AI Model Pricing Comparison 2026

Q: How is AI model pricing calculated?

AI models charge per token, where a token is roughly 3/4 of a word. Prices are quoted per million tokens. Most models charge separately for input (prompt) tokens and output (completion) tokens, with output typically costing 2-4x more than input.

Q: Why do output tokens cost more than input tokens?

Output tokens require the model to generate new text, which is computationally more expensive than reading input. The model must perform inference for each output token sequentially, while input tokens can be processed in parallel.

Q: What is the cheapest good AI model?

Among paid models with benchmark data, LiquidAI: LFM2-2.6B offers some of the lowest pricing at $0.01/1M input tokens. There are also many free models available.

Q: How much does it cost to use AI models per month?

For a typical individual workload of 100 requests per day at 2,000 tokens each, monthly costs range from $0 (free models) to $50+ (premium models). Enterprise usage at higher volumes can cost significantly more. Use the cost examples on this page to estimate your specific usage.

Q: Are free AI models any good?

Yes, there are 27 free models available, and some rank well on intelligence benchmarks. Free models are a great way to start, though premium models generally offer better quality for demanding tasks.

Compare pricing for 349+ AI models side by side. From free open-source models to premium enterprise options, find the right price-performance balance for your use case.

Price Tiers

Free

27 models

Top by intelligence

Google: Gemma 3n 2B (free)4.8

Budget

<$1/1M

231 models

Top by intelligence

MiniMax: MiniMax M249.6 OpenAI: GPT-5.4 Nano44.6 OpenAI: GPT-5.4 Mini44.6

Mid-Range

$1-$5/1M

74 models

Top by intelligence

Google: Gemini 3.1 Pro Preview Custom Tools57.2 Google: Gemini 3.1 Pro Preview57.2 Google: Nano Banana Pro (Gemini 3 Pro Image Preview)48.4

Premium

$5-$15/1M

11 models

Top by intelligence

Anthropic: Claude Opus 446.5 OpenAI: o3 Deep Research38.4 OpenAI: o123.7

Enterprise

$15+/1M

6 models

Top by intelligence

OpenAI: o3 Pro38.4 OpenAI: o1-pro30.8 OpenAI: GPT-418.6

Understanding AI Pricing

AI model pricing is based on tokens, which are the fundamental units of text that language models process. One token is roughly 3/4 of an English word, so 1 million tokens equals approximately 750,000 words or about 1,500 pages of text.

Most providers charge separately for input tokens (the text you send to the model) and output tokens (the text the model generates). Output tokens typically cost 2-4x more than input tokens because generating new text requires more computational work than reading existing text. Each output token must be produced sequentially through a full forward pass of the neural network.

To estimate your monthly cost, multiply your average daily requests by 30, then by your average tokens per request. For example, 100 requests per day at 2,000 tokens each equals 6 million tokens per month. At $3/1M tokens, that would be $18/month. Some providers also offer cached token pricing at a discount for repeated prompts, and batch processing at lower rates for non-real-time workloads.

Free models are a great starting point for experimentation and low-volume use cases. As your needs grow, budget and mid-range models offer an excellent balance of quality and cost. Premium and enterprise models deliver the highest benchmark scores for tasks demanding maximum accuracy, like complex coding or research analysis.

Monthly Cost Example

100 requests/day at 2,000 tokens each (input + output)

Model	Provider	Input/1M	Output/1M	Est. Monthly
LiquidAI: LFM2-2.6B	Liquid AI	$0.01	$0.02	$0.18/mo
LiquidAI: LFM2-8B-A1B	Liquid AI	$0.01	$0.02	$0.18/mo
IBM: Granite 4.0 Micro	Ibm Granite	$0.02	$0.11	$0.76/mo
Mistral: Mistral Nemo	Mistral AI	$0.02	$0.04	$0.36/mo
Llama Guard 3 8B	Meta	$0.02	$0.06	$0.48/mo

All Model Pricing

Model	Provider	Input/1M	Output/1M	Context	Intelligence
Body Builder (beta)	OpenRouter	<$0.01	<$0.01	128K	--
Auto Router	OpenRouter	<$0.01	<$0.01	2M	--
StepFun: Step 3.5 Flash (free)Free	StepFun	Free	Free	256K	--
Google: Gemma 3 4B (free)Free	Google	Free	Free	33K	--
Google: Gemma 3 27B (free)Free	Google	Free	Free	131K	--
NVIDIA: Nemotron 3 Super (free)Free	NVIDIA	Free	Free	262K	--
NVIDIA: Nemotron Nano 12B 2 VL (free)Free	NVIDIA	Free	Free	128K	--
Google: Gemma 3n 2B (free)Free	Google	Free	Free	8K	4.8
Venice: Uncensored (free)Free	Cognitive Computations	Free	Free	33K	--
Free Models RouterFree	OpenRouter	Free	Free	200K	--
Z.ai: GLM 4.5 Air (free)Free	Z Ai	Free	Free	131K	--
Arcee AI: Trinity Large Preview (free)Free	Arcee AI	Free	Free	131K	--
Qwen: Qwen3 Coder 480B A35B (free)Free	Qwen	Free	Free	262K	--
NVIDIA: Nemotron 3 Nano 30B A3B (free)Free	NVIDIA	Free	Free	256K	--
Qwen: Qwen3 4B (free)Free	Qwen	Free	Free	41K	--
Qwen: Qwen3 Next 80B A3B Instruct (free)Free	Qwen	Free	Free	262K	--
NVIDIA: Nemotron Nano 9B V2 (free)Free	NVIDIA	Free	Free	128K	--
Arcee AI: Trinity Mini (free)Free	Arcee AI	Free	Free	131K	--
Google: Gemma 3n 4B (free)Free	Google	Free	Free	8K	--
Mistral: Mistral Small 3.1 24B (free)Free	Mistral AI	Free	Free	128K	--
Nous: Hermes 3 405B Instruct (free)Free	Nousresearch	Free	Free	131K	--
Meta: Llama 3.3 70B Instruct (free)Free	Meta	Free	Free	66K	--
LiquidAI: LFM2.5-1.2B-Instruct (free)Free	Liquid AI	Free	Free	33K	--
LiquidAI: LFM2.5-1.2B-Thinking (free)Free	Liquid AI	Free	Free	33K	--
OpenAI: gpt-oss-120b (free)Free	OpenAI	Free	Free	131K	--
OpenAI: gpt-oss-20b (free)Free	OpenAI	Free	Free	131K	--
Google: Gemma 3 12B (free)Free	Google	Free	Free	33K	--
Meta: Llama 3.2 3B Instruct (free)Free	Meta	Free	Free	131K	--
MiniMax: MiniMax M2.5 (free)Free	MiniMax	Free	Free	197K	--
LiquidAI: LFM2-2.6B	Liquid AI	$0.01	$0.02	33K	--
LiquidAI: LFM2-8B-A1B	Liquid AI	$0.01	$0.02	33K	--
IBM: Granite 4.0 Micro	Ibm Granite	$0.02	$0.11	131K	--
Mistral: Mistral Nemo	Mistral AI	$0.02	$0.04	131K	--
Llama Guard 3 8B	Meta	$0.02	$0.06	131K	--
Google: Gemma 3n 4B	Google	$0.02	$0.04	33K	6.4
Meta: Llama 3.1 8B Instruct	Meta	$0.02	$0.05	16K	11.8
Meta: Llama 3.2 1B Instruct	Meta	$0.03	$0.20	60K	8.7
Qwen: Qwen2.5 Coder 7B Instruct	Qwen	$0.03	$0.09	33K	--
LiquidAI: LFM2-24B-A2B	Liquid AI	$0.03	$0.12	33K	--
Google: Gemma 2 9B	Google	$0.03	$0.09	8K	--
Mistral: Mistral Small 3.1 24B	Mistral AI	$0.03	$0.11	131K	12.7
Meta: Llama 3 8B Instruct	Meta	$0.03	$0.04	8K	11.8
OpenAI: gpt-oss-20b	OpenAI	$0.03	$0.11	131K	24.5
Qwen: Qwen-Turbo	Qwen	$0.03	$0.13	131K	--
Amazon: Nova Micro 1.0	Amazon	$0.04	$0.14	128K	10.3
Cohere: Command R7B (12-2024)	Cohere	$0.04	$0.15	128K	--
OpenAI: gpt-oss-120b	OpenAI	$0.04	$0.19	131K	33.3
Sao10K: Llama 3 8B Lunaris	Sao10k	$0.04	$0.05	8K	--
NVIDIA: Nemotron Nano 9B V2	NVIDIA	$0.04	$0.16	131K	14.8
Google: Gemma 3 12B	Google	$0.04	$0.13	131K	8.8
Google: Gemma 3 4B	Google	$0.04	$0.08	131K	6.3
Qwen: Qwen2.5 7B Instruct	Qwen	$0.04	$0.10	33K	--
Arcee AI: Trinity Mini	Arcee AI	$0.04	$0.15	131K	--
Meta: Llama 3.2 11B Vision Instruct	Meta	$0.05	$0.05	131K	8.7
OpenAI: GPT-5 Nano	OpenAI	$0.05	$0.40	400K	13.8
AllenAI: Olmo 2 32B Instruct	Allen AI	$0.05	$0.20	128K	--
Mistral: Mistral Small 3	Mistral AI	$0.05	$0.08	33K	15.1
Qwen: Qwen3.5-9B	Qwen	$0.05	$0.15	256K	--
NVIDIA: Nemotron 3 Nano 30B A3B	NVIDIA	$0.05	$0.20	262K	24.3
Qwen: Qwen3 8B	Qwen	$0.05	$0.40	41K	--
Meta: Llama 3.2 3B Instruct	Meta	$0.05	$0.34	80K	9.7
Qwen: Qwen3 14B	Qwen	$0.06	$0.24	41K	--
Z.ai: GLM 4.7 Flash	Z Ai	$0.06	$0.40	203K	--
Amazon: Nova Lite 1.0	Amazon	$0.06	$0.24	300K	12.7
MythoMax 13B	Gryphe	$0.06	$0.06	4K	--
Microsoft: Phi 4	Microsoft	$0.07	$0.14	16K	--
Qwen: Qwen3.5-Flash	Qwen	$0.07	$0.26	1M	--
Qwen: Qwen3 Coder 30B A3B Instruct	Qwen	$0.07	$0.27	160K	--
Baidu: ERNIE 4.5 21B A3B Thinking	Baidu	$0.07	$0.28	131K	--
Baidu: ERNIE 4.5 21B A3B	Baidu	$0.07	$0.28	120K	--
Qwen: Qwen3 235B A22B Instruct 2507	Qwen	$0.07	$0.10	262K	--
Mistral: Mistral Small 3.2 24B	Mistral AI	$0.07	$0.20	128K	12.7
OpenAI: gpt-oss-safeguard-20b	OpenAI	$0.07	$0.30	131K	--
Google: Gemini 2.0 Flash Lite	Google	$0.07	$0.30	1.0M	16.8
ByteDance Seed: Seed 1.6 Flash	ByteDance Seed	$0.07	$0.30	262K	--
Meta: Llama 4 Scout	Meta	$0.08	$0.30	328K	13.5
Qwen: Qwen3 32B	Qwen	$0.08	$0.24	41K	--
Qwen: Qwen3 VL 8B Instruct	Qwen	$0.08	$0.50	131K	--
Google: Gemma 3 27B	Google	$0.08	$0.16	131K	10.3
Qwen: Qwen3 30B A3B	Qwen	$0.08	$0.28	41K	--
Qwen: Qwen3 30B A3B Thinking 2507	Qwen	$0.08	$0.40	131K	--
Xiaomi: MiMo-V2-Flash	Xiaomi	$0.09	$0.29	262K	--
Qwen: Qwen3 30B A3B Instruct 2507	Qwen	$0.09	$0.30	262K	--
Qwen: Qwen3 Next 80B A3B Instruct	Qwen	$0.09	$1.10	262K	--
Tongyi DeepResearch 30B A3B	Alibaba	$0.09	$0.45	131K	15.3
Qwen: Qwen3 Next 80B A3B Thinking	Qwen	$0.10	$0.78	131K	--
Mistral: Devstral Small 1.1	Mistral AI	$0.10	$0.30	131K	19.5
Mistral: Ministral 3 3B 2512	Mistral AI	$0.10	$0.10	131K	11.2
Meta: Llama 3.3 70B Instruct	Meta	$0.10	$0.32	131K	14.5
Mistral: Voxtral Small 24B 2507	Mistral AI	$0.10	$0.30	32K	--
NVIDIA: Nemotron 3 Super	NVIDIA	$0.10	$0.50	262K	36.0
Mistral: Mistral Small Creative	Mistral AI	$0.10	$0.30	33K	10.2
Google: Gemini 2.5 Flash Lite Preview 09-2025	Google	$0.10	$0.40	1.0M	33.5
Z.ai: GLM 4 32B	Z Ai	$0.10	$0.10	128K	--
ByteDance: UI-TARS 7B	ByteDance	$0.10	$0.20	128K	--
Google: Gemini 2.5 Flash Lite	Google	$0.10	$0.40	1.0M	--
Mistral: Pixtral 12B	Mistral AI	$0.10	$0.10	33K	--
StepFun: Step 3.5 Flash	StepFun	$0.10	$0.30	256K	37.8
ByteDance Seed: Seed-2.0-Mini	ByteDance Seed	$0.10	$0.40	262K	--
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5	NVIDIA	$0.10	$0.40	131K	18.5
Google: Gemini 2.0 Flash	Google	$0.10	$0.40	1.0M	14.5
OpenAI: GPT-4.1 Nano	OpenAI	$0.10	$0.40	1.0M	--
Qwen: Qwen3 VL 32B Instruct	Qwen	$0.10	$0.42	131K	--
Mistral: Mistral 7B Instruct v0.1	Mistral AI	$0.11	$0.19	3K	7.4
Qwen: Qwen3 VL 8B Thinking	Qwen	$0.12	$1.36	131K	--
Qwen: Qwen3 Coder Next	Qwen	$0.12	$0.75	262K	--
Qwen2.5 72B Instruct	Qwen	$0.12	$0.39	33K	--
Z.ai: GLM 4.5 Air	Z Ai	$0.13	$0.85	131K	--
Qwen: Qwen3 VL 30B A3B Instruct	Qwen	$0.13	$0.52	131K	--
Nous: Hermes 4 70B	Nousresearch	$0.13	$0.40	131K	--
Qwen: Qwen3 VL 30B A3B Thinking	Qwen	$0.13	$1.56	131K	--
Qwen: Qwen VL Plus	Qwen	$0.14	$0.41	131K	--
Baidu: ERNIE 4.5 VL 28B A3B	Baidu	$0.14	$0.56	30K	--
Tencent: Hunyuan A13B Instruct	Tencent	$0.14	$0.57	131K	--
NousResearch: Hermes 2 Pro - Llama-3 8B	Nousresearch	$0.14	$0.14	8K	--
Qwen: Qwen3 235B A22B Thinking 2507	Qwen	$0.15	$1.50	131K	--
OpenAI: GPT-4o-mini Search Preview	OpenAI	$0.15	$0.60	128K	12.6
AllenAI: Olmo 3.1 32B Think	Allen AI	$0.15	$0.50	66K	--
Cohere: Command R (08-2024)	Cohere	$0.15	$0.60	128K	--
AllenAI: Olmo 3 32B Think	Allen AI	$0.15	$0.50	66K	--
Upstage: Solar Pro 3	Upstage	$0.15	$0.60	128K	--
Qwen: QwQ 32B	Qwen	$0.15	$0.58	131K	--
EssentialAI: Rnj 1 Instruct	Essential AI	$0.15	$0.15	33K	--
OpenAI: GPT-4o-mini	OpenAI	$0.15	$0.60	128K	--
Mistral: Ministral 3 8B 2512	Mistral AI	$0.15	$0.15	262K	14.8
Mistral: Mistral Small 4	Mistral AI	$0.15	$0.60	262K	26.9
OpenAI: GPT-4o-mini (2024-07-18)	OpenAI	$0.15	$0.60	128K	--
Meta: Llama 4 Maverick	Meta	$0.15	$0.60	1.0M	18.4
DeepSeek: DeepSeek V3.1	DeepSeek	$0.15	$0.75	33K	--
Qwen: Qwen3.5-35B-A3B	Qwen	$0.16	$1.30	262K	--
TheDrummer: Rocinante 12B	Thedrummer	$0.17	$0.43	33K	--
Arcee AI: Spotlight	Arcee AI	$0.18	$0.18	131K	--
Meta: Llama Guard 4 12B	Meta	$0.18	$0.18	164K	--
Qwen: Qwen3 Coder Flash	Qwen	$0.20	$0.97	1M	--
Qwen: Qwen3.5-27B	Qwen	$0.20	$1.56	262K	--
Mistral: Saba	Mistral AI	$0.20	$0.60	33K	12.1
Qwen: Qwen3 VL 235B A22B Instruct	Qwen	$0.20	$0.88	262K	--
Meituan: LongCat Flash Chat	Meituan	$0.20	$0.80	131K	--
MiniMax: MiniMax M2.5	MiniMax	$0.20	$1.17	197K	36.1
xAI: Grok 4.1 Fast	xAI	$0.20	$0.50	2M	--
AllenAI: Olmo 3.1 32B Instruct	Allen AI	$0.20	$0.60	66K	--
Qwen: Qwen2.5-VL 7B Instruct	Qwen	$0.20	$0.20	33K	--
Prime Intellect: INTELLECT-3	Prime Intellect	$0.20	$1.10	131K	--
OpenAI: GPT-5.4 Nano	OpenAI	$0.20	$1.25	400K	44.6
xAI: Grok 4 Fast	xAI	$0.20	$0.50	2M	35.1
Mistral: Ministral 3 14B 2512	Mistral AI	$0.20	$0.20	262K	16.0
Qwen: Qwen2.5 VL 32B Instruct	Qwen	$0.20	$0.60	128K	--
MiniMax: MiniMax-01	MiniMax	$0.20	$1.10	1.0M	--
NVIDIA: Nemotron Nano 12B 2 VL	NVIDIA	$0.20	$0.60	131K	10.1
DeepSeek: DeepSeek V3 0324	DeepSeek	$0.20	$0.77	164K	22.3
xAI: Grok Code Fast 1	xAI	$0.20	$1.50	256K	28.7
Kwaipilot: KAT-Coder-Pro V1	Kwaipilot	$0.21	$0.83	256K	--
DeepSeek: DeepSeek V3.1 Terminus	DeepSeek	$0.21	$0.79	164K	28.1
Qwen: Qwen3 Coder 480B A35B	Qwen	$0.22	$1.00	262K	--
ByteDance Seed: Seed-2.0-Lite	ByteDance Seed	$0.25	$2.00	262K	--
OpenAI: GPT-5.1-Codex-Mini	OpenAI	$0.25	$2.00	400K	--
Inception: Mercury Coder	Inception	$0.25	$0.75	128K	--
OpenAI: GPT-5 Mini	OpenAI	$0.25	$2.00	400K	20.7
Google: Gemini 3.1 Flash Lite Preview	Google	$0.25	$1.50	1.0M	33.5
ByteDance Seed: Seed 1.6	ByteDance Seed	$0.25	$2.00	262K	--
Anthropic: Claude 3 Haiku	Anthropic	$0.25	$1.25	200K	37.1
Inception: Mercury	Inception	$0.25	$0.75	128K	--
Inception: Mercury 2	Inception	$0.25	$0.75	128K	--
MiniMax: MiniMax M2	MiniMax	$0.26	$1.00	197K	49.6
Qwen: Qwen3.5-122B-A10B	Qwen	$0.26	$2.08	262K	--
Qwen: Qwen Plus 0728	Qwen	$0.26	$0.78	1M	--
Qwen: Qwen Plus 0728 (thinking)	Qwen	$0.26	$0.78	1M	--
Qwen: Qwen3 VL 235B A22B Thinking	Qwen	$0.26	$2.60	131K	--
Qwen: Qwen3.5 Plus 2026-02-15	Qwen	$0.26	$1.56	1M	--
DeepSeek: DeepSeek V3.2	DeepSeek	$0.26	$0.38	164K	41.7
Qwen: Qwen-Plus	Qwen	$0.26	$0.78	1M	--
DeepSeek: DeepSeek V3.2 Exp	DeepSeek	$0.27	$0.41	164K	--
MiniMax: MiniMax M2.1	MiniMax	$0.27	$0.95	197K	--
Nex AGI: DeepSeek V3.1 Nex N1	Nex Agi	$0.27	$1.00	131K	--
Baidu: ERNIE 4.5 300B A47B	Baidu	$0.28	$1.10	123K	--
DeepSeek: R1 Distill Qwen 32B	DeepSeek	$0.29	$0.29	33K	17.2
Nous: Hermes 3 70B Instruct	Nousresearch	$0.30	$0.30	131K	--
MiniMax: MiniMax M2.7	MiniMax	$0.30	$1.20	205K	36.1
xAI: Grok 3 Mini Beta	xAI	$0.30	$0.50	131K	--
Z.ai: GLM 4.6V	Z Ai	$0.30	$0.90	131K	--
Amazon: Nova 2 Lite	Amazon	$0.30	$2.50	1M	18.0
TNG: DeepSeek R1T2 Chimera	Tngtech	$0.30	$1.10	164K	--
xAI: Grok 3 Mini	xAI	$0.30	$0.50	131K	32.1
Google: Gemini 2.5 Flash	Google	$0.30	$2.50	1.0M	19.4
Google: Nano Banana (Gemini 2.5 Flash Image)	Google	$0.30	$2.50	33K	--
Mistral: Codestral 2508	Mistral AI	$0.30	$0.90	256K	--
MiniMax: MiniMax M2-her	MiniMax	$0.30	$1.20	66K	--
TheDrummer: Cydonia 24B V4.1	Thedrummer	$0.30	$0.50	131K	--
DeepSeek: DeepSeek V3	DeepSeek	$0.32	$0.89	164K	--
Z.ai: GLM 4.6	Z Ai	$0.39	$1.90	205K	--
Z.ai: GLM 4.7	Z Ai	$0.39	$1.75	203K	--
Qwen: Qwen3.5 397B A17B	Qwen	$0.39	$2.34	262K	--
OpenAI: GPT-4.1 Mini	OpenAI	$0.40	$1.60	1.0M	--
TheDrummer: UnslopNemo 12B	Thedrummer	$0.40	$0.40	33K	--
Meta: Llama 3.1 70B Instruct	Meta	$0.40	$0.40	131K	14.5
Mistral: Devstral 2 2512	Mistral AI	$0.40	$2.00	262K	22.0
Mistral: Mistral Medium 3.1	Mistral AI	$0.40	$2.00	131K	9.0
MoonshotAI: Kimi K2 0905	Moonshotai	$0.40	$2.00	131K	--
Mistral: Devstral Medium	Mistral AI	$0.40	$2.00	131K	18.7
Mistral: Mistral Medium 3	Mistral AI	$0.40	$2.00	131K	21.3
DeepSeek: DeepSeek V3.2 Speciale	DeepSeek	$0.40	$1.20	164K	41.7
Xiaomi: MiMo-V2-Omni	Xiaomi	$0.40	$2.00	262K	--
MiniMax: MiniMax M1	MiniMax	$0.40	$2.20	1M	20.9
Baidu: ERNIE 4.5 VL 424B A47B	Baidu	$0.42	$1.25	123K	--
ReMM SLERP 13B	Undi95	$0.45	$0.65	6K	--
DeepSeek: R1 0528	DeepSeek	$0.45	$2.15	164K	27.1
MoonshotAI: Kimi K2.5	Moonshotai	$0.45	$2.20	262K	--
Qwen: Qwen3 235B A22B	Qwen	$0.45	$1.82	131K	--
MoonshotAI: Kimi K2 Thinking	Moonshotai	$0.47	$2.00	131K	--
Google: Gemini 3 Flash Preview	Google	$0.50	$3.00	1.0M	35.0
Mistral: Mistral Large 3 2512	Mistral AI	$0.50	$1.50	262K	15.1
Arcee AI: Coder Large	Arcee AI	$0.50	$0.80	33K	--
OpenAI: GPT-3.5 Turbo	OpenAI	$0.50	$1.50	16K	--
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)	Google	$0.50	$3.00	66K	--
Meta: Llama 3 70B Instruct	Meta	$0.51	$0.74	8K	--
Qwen: Qwen VL Max	Qwen	$0.52	$2.08	131K	--
Mistral: Mixtral 8x7B Instruct	Mistral AI	$0.54	$0.54	33K	7.7
TheDrummer: Skyfall 36B V2	Thedrummer	$0.55	$0.80	33K	--
MoonshotAI: Kimi K2 0711	Moonshotai	$0.55	$2.20	131K	--
Writer: Palmyra X5	Writer	$0.60	$6.00	1.0M	--
NVIDIA: Llama 3.1 Nemotron Ultra 253B v1	NVIDIA	$0.60	$1.80	131K	14.4
Z.ai: GLM 4.5V	Z Ai	$0.60	$1.80	66K	--
Z.ai: GLM 4.5	Z Ai	$0.60	$2.20	131K	--
OpenAI: GPT Audio Mini	OpenAI	$0.60	$2.40	128K	--
WizardLM-2 8x22B	Microsoft	$0.62	$0.62	66K	--
Sao10K: Llama 3.3 Euryale 70B	Sao10k	$0.65	$0.75	131K	--
Google: Gemma 2 27B	Google	$0.65	$0.65	8K	--
Qwen: Qwen3 Coder Plus	Qwen	$0.65	$3.25	1M	--
Qwen2.5 Coder 32B Instruct	Qwen	$0.66	$1.00	33K	--
AionLabs: Aion-1.0-Mini	Aion Labs	$0.70	$1.40	131K	--
DeepSeek: R1 Distill Llama 70B	DeepSeek	$0.70	$0.80	131K	16.0
DeepSeek: R1	DeepSeek	$0.70	$2.50	64K	16.0
Z.ai: GLM 5	Z Ai	$0.72	$2.30	80K	--
Mancer: Weaver (alpha)	Mancer	$0.75	$1.00	8K	--
OpenAI: GPT-5.4 Mini	OpenAI	$0.75	$4.50	400K	44.6
Arcee AI: Virtuoso Large	Arcee AI	$0.75	$1.20	131K	--
Qwen: Qwen3 Max Thinking	Qwen	$0.78	$3.90	262K	--
Qwen: Qwen3 Max	Qwen	$0.78	$3.90	262K	--
AionLabs: Aion-RP 1.0 (8B)	Aion Labs	$0.80	$1.60	33K	--
Morph: Morph V3 Fast	Morph	$0.80	$1.20	82K	--
Amazon: Nova Pro 1.0	Amazon	$0.80	$3.20	300K	35.7
AlfredPros: CodeLLaMa 7B Instruct Solidity	AlfredPros	$0.80	$1.20	4K	--
AionLabs: Aion-2.0	Aion Labs	$0.80	$1.60	131K	--
Anthropic: Claude 3.5 Haiku	Anthropic	$0.80	$4.00	200K	18.7
EleutherAI: Llemma 7b	EleutherAI	$0.80	$1.20	4K	--
Qwen: Qwen2.5 VL 72B Instruct	Qwen	$0.80	$0.80	33K	--
Sao10K: Llama 3.1 Euryale 70B v2.2	Sao10k	$0.85	$0.85	131K	--
Relace: Relace Apply 3	Relace	$0.85	$1.25	256K	--
Switchpoint Router	Switchpoint	$0.85	$3.40	131K	--
Morph: Morph V3 Large	Morph	$0.90	$1.90	262K	--
Arcee AI: Maestro Reasoning	Arcee AI	$0.90	$3.30	131K	--
Anthropic: Claude Haiku 4.5	Anthropic	$1.00	$5.00	200K	37.1
Nous: Hermes 3 405B Instruct	Nousresearch	$1.00	$1.00	131K	--
Relace: Relace Search	Relace	$1.00	$3.00	256K	--
Nous: Hermes 4 405B	Nousresearch	$1.00	$3.00	131K	--
Xiaomi: MiMo-V2-Pro	Xiaomi	$1.00	$3.00	1.0M	--
Perplexity: Sonar	Perplexity	$1.00	$1.00	127K	--
OpenAI: GPT-3.5 Turbo (older v0613)	OpenAI	$1.00	$2.00	4K	9.0
Qwen: Qwen-Max	Qwen	$1.04	$4.16	33K	--
OpenAI: o4 Mini	OpenAI	$1.10	$4.40	200K	--
OpenAI: o3 Mini High	OpenAI	$1.10	$4.40	200K	--
OpenAI: o4 Mini High	OpenAI	$1.10	$4.40	200K	--
OpenAI: o3 Mini	OpenAI	$1.10	$4.40	200K	--
Z.ai: GLM 5 Turbo	Z Ai	$1.20	$4.00	203K	--
NVIDIA: Llama 3.1 Nemotron 70B Instruct	NVIDIA	$1.20	$1.20	131K	13.4
Google: Gemini 2.5 Pro Preview 06-05	Google	$1.25	$10	1.0M	34.6
OpenAI: GPT-5.1	OpenAI	$1.25	$10	400K	--
Google: Gemini 2.5 Pro Preview 05-06	Google	$1.25	$10	1.0M	--
Google: Gemini 2.5 Pro	Google	$1.25	$10	1.0M	34.6
OpenAI: GPT-5 Codex	OpenAI	$1.25	$10	400K	44.6
OpenAI: GPT-5 Chat	OpenAI	$1.25	$10	128K	21.8
OpenAI: GPT-5	OpenAI	$1.25	$10	400K	44.4
OpenAI: GPT-5.1-Codex-Max	OpenAI	$1.25	$10	400K	--
OpenAI: GPT-5.1-Codex	OpenAI	$1.25	$10	400K	--
OpenAI: GPT-5.1 Chat	OpenAI	$1.25	$10	128K	--
Deep Cogito: Cogito v2.1 671B	DeepCogito	$1.25	$1.25	128K	--
Sao10k: Llama 3 Euryale 70B v2.1	Sao10k	$1.48	$1.48	8K	--
OpenAI: GPT-3.5 Turbo Instruct	OpenAI	$1.50	$2.00	4K	9.0
OpenAI: GPT-5.3 Chat	OpenAI	$1.75	$14	128K	--
OpenAI: GPT-5.2-Codex	OpenAI	$1.75	$14	400K	--
OpenAI: GPT-5.2	OpenAI	$1.75	$14	400K	--
OpenAI: GPT-5.3-Codex	OpenAI	$1.75	$14	400K	--
OpenAI: GPT-5.2 Chat	OpenAI	$1.75	$14	128K	--
OpenAI: o4 Mini Deep Research	OpenAI	$2.00	$8.00	200K	33.1
Perplexity: Sonar Reasoning Pro	Perplexity	$2.00	$8.00	128K	17.9
Perplexity: Sonar Deep Research	Perplexity	$2.00	$8.00	128K	--
OpenAI: GPT-4.1	OpenAI	$2.00	$8.00	1.0M	--
OpenAI: o3	OpenAI	$2.00	$8.00	200K	25.9
Mistral: Pixtral Large 2411	Mistral AI	$2.00	$6.00	131K	14.0
Mistral Large 2411	Mistral AI	$2.00	$6.00	131K	--
AI21: Jamba Large 1.7	AI21 Labs	$2.00	$8.00	256K	10.9
Mistral: Mixtral 8x22B Instruct	Mistral AI	$2.00	$6.00	66K	9.8
xAI: Grok 4.20 Multi-Agent Beta	xAI	$2.00	$6.00	2M	--
Google: Gemini 3.1 Pro Preview Custom Tools	Google	$2.00	$12	1.0M	57.2
Google: Gemini 3 Pro Preview	Google	$2.00	$12	1.0M	41.3
Mistral Large	Mistral AI	$2.00	$6.00	128K	22.8
Google: Nano Banana Pro (Gemini 3 Pro Image Preview)	Google	$2.00	$12	66K	48.4
Mistral Large 2407	Mistral AI	$2.00	$6.00	131K	13.0
Google: Gemini 3.1 Pro Preview	Google	$2.00	$12	1.0M	57.2
xAI: Grok 4.20 Beta	xAI	$2.00	$6.00	2M	11.7
Amazon: Nova Premier 1.0	Amazon	$2.50	$13	1M	19.0
OpenAI: GPT-4o Audio	OpenAI	$2.50	$10	128K	17.3
Cohere: Command A	Cohere	$2.50	$10	256K	13.5
OpenAI: GPT-5.4	OpenAI	$2.50	$15	1.1M	--
OpenAI: GPT-5 Image Mini	OpenAI	$2.50	$2.00	400K	--
OpenAI: GPT-4o (2024-11-20)	OpenAI	$2.50	$10	128K	--
OpenAI: GPT-4o	OpenAI	$2.50	$10	128K	--
OpenAI: GPT Audio	OpenAI	$2.50	$10	128K	--
OpenAI: GPT-4o Search Preview	OpenAI	$2.50	$10	128K	--
Inflection: Inflection 3 Pi	Inflection	$2.50	$10	8K	--
Inflection: Inflection 3 Productivity	Inflection	$2.50	$10	8K	--
Cohere: Command R+ (08-2024)	Cohere	$2.50	$10	128K	8.3
OpenAI: GPT-4o (2024-08-06)	OpenAI	$2.50	$10	128K	--
Anthropic: Claude 3.7 Sonnet (thinking)	Anthropic	$3.00	$15	200K	34.7
Anthropic: Claude Sonnet 4.5	Anthropic	$3.00	$15	1M	--
OpenAI: GPT-3.5 Turbo 16k	OpenAI	$3.00	$4.00	16K	9.0
xAI: Grok 4	xAI	$3.00	$15	256K	29.7
Magnum v4 72B	Anthracite	$3.00	$5.00	16K	--
Perplexity: Sonar Pro	Perplexity	$3.00	$15	200K	--
Anthropic: Claude 3.7 Sonnet	Anthropic	$3.00	$15	200K	34.7
Anthropic: Claude Sonnet 4	Anthropic	$3.00	$15	200K	44.4
xAI: Grok 3	xAI	$3.00	$15	131K	21.6
Anthropic: Claude Sonnet 4.6	Anthropic	$3.00	$15	1M	10.3
Sao10K: Llama 3.1 70B Hanami x1	Sao10k	$3.00	$3.00	16K	--
xAI: Grok 3 Beta	xAI	$3.00	$15	131K	--
Perplexity: Sonar Pro Search	Perplexity	$3.00	$15	200K	15.5
Goliath 120B	Alpindale	$3.75	$7.50	6K	--
Meta: Llama 3.1 405B (base)	Meta	$4.00	$4.00	33K	--
AionLabs: Aion-1.0	Aion Labs	$4.00	$8.00	131K	--
Anthropic: Claude Opus 4.5	Anthropic	$5.00	$25	200K	18.0
OpenAI: GPT-4o (2024-05-13)	OpenAI	$5.00	$15	128K	--
Anthropic: Claude Opus 4.6	Anthropic	$5.00	$25	1M	18.0
OpenAI: GPT-4o (extended)	OpenAI	$6.00	$18	128K	--
Anthropic: Claude 3.5 Sonnet	Anthropic	$6.00	$30	200K	15.9
OpenAI: GPT-5 Image	OpenAI	$10	$10	400K	--
OpenAI: GPT-4 Turbo (older v1106)	OpenAI	$10	$30	128K	--
OpenAI: GPT-4 Turbo Preview	OpenAI	$10	$30	128K	--
OpenAI: GPT-4 Turbo	OpenAI	$10	$30	128K	--
OpenAI: o3 Deep Research	OpenAI	$10	$40	200K	38.4
OpenAI: GPT-5 Pro	OpenAI	$15	$120	400K	--
Anthropic: Claude Opus 4	Anthropic	$15	$75	200K	46.5
Anthropic: Claude Opus 4.1	Anthropic	$15	$75	200K	18.0
OpenAI: o1	OpenAI	$15	$60	200K	23.7
OpenAI: o3 Pro	OpenAI	$20	$80	200K	38.4
OpenAI: GPT-5.2 Pro	OpenAI	$21	$168	400K	--
OpenAI: GPT-4 (older v0314)	OpenAI	$30	$60	8K	--
OpenAI: GPT-4	OpenAI	$30	$60	8K	18.6
OpenAI: GPT-5.4 Pro	OpenAI	$30	$180	1.1M	--
OpenAI: o1-pro	OpenAI	$150	$600	200K	30.8

Frequently Asked Questions

How is AI model pricing calculated?

AI models charge per token, where a token is roughly 3/4 of a word. Prices are quoted per million tokens. Most models charge separately for input (prompt) tokens and output (completion) tokens, with output typically costing 2-4x more than input.

Why do output tokens cost more than input tokens?

Output tokens require the model to generate new text, which is computationally more expensive than reading input. The model must perform inference for each output token sequentially, while input tokens can be processed in parallel.

What is the cheapest good AI model?

Among paid models with benchmark data, LiquidAI: LFM2-2.6B offers some of the lowest pricing at $0.01/1M input tokens. There are also many free models available.

How much does it cost to use AI models per month?

For a typical individual workload of 100 requests per day at 2,000 tokens each, monthly costs range from $0 (free models) to $50+ (premium models). Enterprise usage at higher volumes can cost significantly more. Use the cost examples on this page to estimate your specific usage.

Are free AI models any good?

Yes, there are 27 free models available, and some rank well on intelligence benchmarks. Free models are a great way to start, though premium models generally offer better quality for demanding tasks.

Browse by use case:Best for Coding·Best for Writing·Cheapest Models·Free Models·Fastest Models