AI Model Pricing Comparison 2026

Compare pricing for 349+ AI models side by side. From free open-source models to premium enterprise options, find the right price-performance balance for your use case.

Price Tiers

Understanding AI Pricing

AI model pricing is based on tokens, which are the fundamental units of text that language models process. One token is roughly 3/4 of an English word, so 1 million tokens equals approximately 750,000 words or about 1,500 pages of text.

Most providers charge separately for input tokens (the text you send to the model) and output tokens (the text the model generates). Output tokens typically cost 2-4x more than input tokens because generating new text requires more computational work than reading existing text. Each output token must be produced sequentially through a full forward pass of the neural network.

To estimate your monthly cost, multiply your average daily requests by 30, then by your average tokens per request. For example, 100 requests per day at 2,000 tokens each equals 6 million tokens per month. At $3/1M tokens, that would be $18/month. Some providers also offer cached token pricing at a discount for repeated prompts, and batch processing at lower rates for non-real-time workloads.

Free models are a great starting point for experimentation and low-volume use cases. As your needs grow, budget and mid-range models offer an excellent balance of quality and cost. Premium and enterprise models deliver the highest benchmark scores for tasks demanding maximum accuracy, like complex coding or research analysis.

Monthly Cost Example

100 requests/day at 2,000 tokens each (input + output)

ModelProviderInput/1MOutput/1MEst. Monthly
LiquidAI: LFM2-2.6BLiquid AI$0.01$0.02$0.18/mo
LiquidAI: LFM2-8B-A1BLiquid AI$0.01$0.02$0.18/mo
IBM: Granite 4.0 MicroIbm Granite$0.02$0.11$0.76/mo
Mistral: Mistral NemoMistral AI$0.02$0.04$0.36/mo
Llama Guard 3 8BMeta$0.02$0.06$0.48/mo

All Model Pricing

ModelProviderInput/1MOutput/1MContextIntelligence
Body Builder (beta)OpenRouter<$0.01<$0.01128K--
Auto RouterOpenRouter<$0.01<$0.012M--
StepFun: Step 3.5 Flash (free)FreeStepFunFreeFree256K--
Google: Gemma 3 4B (free)FreeGoogleFreeFree33K--
Google: Gemma 3 27B (free)FreeGoogleFreeFree131K--
NVIDIA: Nemotron 3 Super (free)FreeNVIDIAFreeFree262K--
NVIDIA: Nemotron Nano 12B 2 VL (free)FreeNVIDIAFreeFree128K--
Google: Gemma 3n 2B (free)FreeGoogleFreeFree8K4.8
Venice: Uncensored (free)FreeCognitive ComputationsFreeFree33K--
Free Models RouterFreeOpenRouterFreeFree200K--
Z.ai: GLM 4.5 Air (free)FreeZ AiFreeFree131K--
Arcee AI: Trinity Large Preview (free)FreeArcee AIFreeFree131K--
Qwen: Qwen3 Coder 480B A35B (free)FreeQwenFreeFree262K--
NVIDIA: Nemotron 3 Nano 30B A3B (free)FreeNVIDIAFreeFree256K--
Qwen: Qwen3 4B (free)FreeQwenFreeFree41K--
Qwen: Qwen3 Next 80B A3B Instruct (free)FreeQwenFreeFree262K--
NVIDIA: Nemotron Nano 9B V2 (free)FreeNVIDIAFreeFree128K--
Arcee AI: Trinity Mini (free)FreeArcee AIFreeFree131K--
Google: Gemma 3n 4B (free)FreeGoogleFreeFree8K--
Mistral: Mistral Small 3.1 24B (free)FreeMistral AIFreeFree128K--
Nous: Hermes 3 405B Instruct (free)FreeNousresearchFreeFree131K--
Meta: Llama 3.3 70B Instruct (free)FreeMetaFreeFree66K--
LiquidAI: LFM2.5-1.2B-Instruct (free)FreeLiquid AIFreeFree33K--
LiquidAI: LFM2.5-1.2B-Thinking (free)FreeLiquid AIFreeFree33K--
OpenAI: gpt-oss-120b (free)FreeOpenAIFreeFree131K--
OpenAI: gpt-oss-20b (free)FreeOpenAIFreeFree131K--
Google: Gemma 3 12B (free)FreeGoogleFreeFree33K--
Meta: Llama 3.2 3B Instruct (free)FreeMetaFreeFree131K--
MiniMax: MiniMax M2.5 (free)FreeMiniMaxFreeFree197K--
LiquidAI: LFM2-2.6BLiquid AI$0.01$0.0233K--
LiquidAI: LFM2-8B-A1BLiquid AI$0.01$0.0233K--
IBM: Granite 4.0 MicroIbm Granite$0.02$0.11131K--
Mistral: Mistral NemoMistral AI$0.02$0.04131K--
Llama Guard 3 8BMeta$0.02$0.06131K--
Google: Gemma 3n 4BGoogle$0.02$0.0433K6.4
Meta: Llama 3.1 8B InstructMeta$0.02$0.0516K11.8
Meta: Llama 3.2 1B InstructMeta$0.03$0.2060K8.7
Qwen: Qwen2.5 Coder 7B InstructQwen$0.03$0.0933K--
LiquidAI: LFM2-24B-A2BLiquid AI$0.03$0.1233K--
Google: Gemma 2 9BGoogle$0.03$0.098K--
Mistral: Mistral Small 3.1 24BMistral AI$0.03$0.11131K12.7
Meta: Llama 3 8B InstructMeta$0.03$0.048K11.8
OpenAI: gpt-oss-20bOpenAI$0.03$0.11131K24.5
Qwen: Qwen-TurboQwen$0.03$0.13131K--
Amazon: Nova Micro 1.0Amazon$0.04$0.14128K10.3
Cohere: Command R7B (12-2024)Cohere$0.04$0.15128K--
OpenAI: gpt-oss-120bOpenAI$0.04$0.19131K33.3
Sao10K: Llama 3 8B LunarisSao10k$0.04$0.058K--
NVIDIA: Nemotron Nano 9B V2NVIDIA$0.04$0.16131K14.8
Google: Gemma 3 12BGoogle$0.04$0.13131K8.8
Google: Gemma 3 4BGoogle$0.04$0.08131K6.3
Qwen: Qwen2.5 7B InstructQwen$0.04$0.1033K--
Arcee AI: Trinity MiniArcee AI$0.04$0.15131K--
Meta: Llama 3.2 11B Vision InstructMeta$0.05$0.05131K8.7
OpenAI: GPT-5 NanoOpenAI$0.05$0.40400K13.8
AllenAI: Olmo 2 32B InstructAllen AI$0.05$0.20128K--
Mistral: Mistral Small 3Mistral AI$0.05$0.0833K15.1
Qwen: Qwen3.5-9BQwen$0.05$0.15256K--
NVIDIA: Nemotron 3 Nano 30B A3BNVIDIA$0.05$0.20262K24.3
Qwen: Qwen3 8BQwen$0.05$0.4041K--
Meta: Llama 3.2 3B InstructMeta$0.05$0.3480K9.7
Qwen: Qwen3 14BQwen$0.06$0.2441K--
Z.ai: GLM 4.7 FlashZ Ai$0.06$0.40203K--
Amazon: Nova Lite 1.0Amazon$0.06$0.24300K12.7
MythoMax 13BGryphe$0.06$0.064K--
Microsoft: Phi 4Microsoft$0.07$0.1416K--
Qwen: Qwen3.5-FlashQwen$0.07$0.261M--
Qwen: Qwen3 Coder 30B A3B InstructQwen$0.07$0.27160K--
Baidu: ERNIE 4.5 21B A3B ThinkingBaidu$0.07$0.28131K--
Baidu: ERNIE 4.5 21B A3BBaidu$0.07$0.28120K--
Qwen: Qwen3 235B A22B Instruct 2507Qwen$0.07$0.10262K--
Mistral: Mistral Small 3.2 24BMistral AI$0.07$0.20128K12.7
OpenAI: gpt-oss-safeguard-20bOpenAI$0.07$0.30131K--
Google: Gemini 2.0 Flash LiteGoogle$0.07$0.301.0M16.8
ByteDance Seed: Seed 1.6 FlashByteDance Seed$0.07$0.30262K--
Meta: Llama 4 ScoutMeta$0.08$0.30328K13.5
Qwen: Qwen3 32BQwen$0.08$0.2441K--
Qwen: Qwen3 VL 8B InstructQwen$0.08$0.50131K--
Google: Gemma 3 27BGoogle$0.08$0.16131K10.3
Qwen: Qwen3 30B A3BQwen$0.08$0.2841K--
Qwen: Qwen3 30B A3B Thinking 2507Qwen$0.08$0.40131K--
Xiaomi: MiMo-V2-FlashXiaomi$0.09$0.29262K--
Qwen: Qwen3 30B A3B Instruct 2507Qwen$0.09$0.30262K--
Qwen: Qwen3 Next 80B A3B InstructQwen$0.09$1.10262K--
Tongyi DeepResearch 30B A3BAlibaba$0.09$0.45131K15.3
Qwen: Qwen3 Next 80B A3B ThinkingQwen$0.10$0.78131K--
Mistral: Devstral Small 1.1Mistral AI$0.10$0.30131K19.5
Mistral: Ministral 3 3B 2512Mistral AI$0.10$0.10131K11.2
Meta: Llama 3.3 70B InstructMeta$0.10$0.32131K14.5
Mistral: Voxtral Small 24B 2507Mistral AI$0.10$0.3032K--
NVIDIA: Nemotron 3 SuperNVIDIA$0.10$0.50262K36.0
Mistral: Mistral Small CreativeMistral AI$0.10$0.3033K10.2
Google: Gemini 2.5 Flash Lite Preview 09-2025Google$0.10$0.401.0M33.5
Z.ai: GLM 4 32B Z Ai$0.10$0.10128K--
ByteDance: UI-TARS 7B ByteDance$0.10$0.20128K--
Google: Gemini 2.5 Flash LiteGoogle$0.10$0.401.0M--
Mistral: Pixtral 12BMistral AI$0.10$0.1033K--
StepFun: Step 3.5 FlashStepFun$0.10$0.30256K37.8
ByteDance Seed: Seed-2.0-MiniByteDance Seed$0.10$0.40262K--
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5NVIDIA$0.10$0.40131K18.5
Google: Gemini 2.0 FlashGoogle$0.10$0.401.0M14.5
OpenAI: GPT-4.1 NanoOpenAI$0.10$0.401.0M--
Qwen: Qwen3 VL 32B InstructQwen$0.10$0.42131K--
Mistral: Mistral 7B Instruct v0.1Mistral AI$0.11$0.193K7.4
Qwen: Qwen3 VL 8B ThinkingQwen$0.12$1.36131K--
Qwen: Qwen3 Coder NextQwen$0.12$0.75262K--
Qwen2.5 72B InstructQwen$0.12$0.3933K--
Z.ai: GLM 4.5 AirZ Ai$0.13$0.85131K--
Qwen: Qwen3 VL 30B A3B InstructQwen$0.13$0.52131K--
Nous: Hermes 4 70BNousresearch$0.13$0.40131K--
Qwen: Qwen3 VL 30B A3B ThinkingQwen$0.13$1.56131K--
Qwen: Qwen VL PlusQwen$0.14$0.41131K--
Baidu: ERNIE 4.5 VL 28B A3BBaidu$0.14$0.5630K--
Tencent: Hunyuan A13B InstructTencent$0.14$0.57131K--
NousResearch: Hermes 2 Pro - Llama-3 8BNousresearch$0.14$0.148K--
Qwen: Qwen3 235B A22B Thinking 2507Qwen$0.15$1.50131K--
OpenAI: GPT-4o-mini Search PreviewOpenAI$0.15$0.60128K12.6
AllenAI: Olmo 3.1 32B ThinkAllen AI$0.15$0.5066K--
Cohere: Command R (08-2024)Cohere$0.15$0.60128K--
AllenAI: Olmo 3 32B ThinkAllen AI$0.15$0.5066K--
Upstage: Solar Pro 3Upstage$0.15$0.60128K--
Qwen: QwQ 32BQwen$0.15$0.58131K--
EssentialAI: Rnj 1 InstructEssential AI$0.15$0.1533K--
OpenAI: GPT-4o-miniOpenAI$0.15$0.60128K--
Mistral: Ministral 3 8B 2512Mistral AI$0.15$0.15262K14.8
Mistral: Mistral Small 4Mistral AI$0.15$0.60262K26.9
OpenAI: GPT-4o-mini (2024-07-18)OpenAI$0.15$0.60128K--
Meta: Llama 4 MaverickMeta$0.15$0.601.0M18.4
DeepSeek: DeepSeek V3.1DeepSeek$0.15$0.7533K--
Qwen: Qwen3.5-35B-A3BQwen$0.16$1.30262K--
TheDrummer: Rocinante 12BThedrummer$0.17$0.4333K--
Arcee AI: SpotlightArcee AI$0.18$0.18131K--
Meta: Llama Guard 4 12BMeta$0.18$0.18164K--
Qwen: Qwen3 Coder FlashQwen$0.20$0.971M--
Qwen: Qwen3.5-27BQwen$0.20$1.56262K--
Mistral: SabaMistral AI$0.20$0.6033K12.1
Qwen: Qwen3 VL 235B A22B InstructQwen$0.20$0.88262K--
Meituan: LongCat Flash ChatMeituan$0.20$0.80131K--
MiniMax: MiniMax M2.5MiniMax$0.20$1.17197K36.1
xAI: Grok 4.1 FastxAI$0.20$0.502M--
AllenAI: Olmo 3.1 32B InstructAllen AI$0.20$0.6066K--
Qwen: Qwen2.5-VL 7B InstructQwen$0.20$0.2033K--
Prime Intellect: INTELLECT-3Prime Intellect$0.20$1.10131K--
OpenAI: GPT-5.4 NanoOpenAI$0.20$1.25400K44.6
xAI: Grok 4 FastxAI$0.20$0.502M35.1
Mistral: Ministral 3 14B 2512Mistral AI$0.20$0.20262K16.0
Qwen: Qwen2.5 VL 32B InstructQwen$0.20$0.60128K--
MiniMax: MiniMax-01MiniMax$0.20$1.101.0M--
NVIDIA: Nemotron Nano 12B 2 VLNVIDIA$0.20$0.60131K10.1
DeepSeek: DeepSeek V3 0324DeepSeek$0.20$0.77164K22.3
xAI: Grok Code Fast 1xAI$0.20$1.50256K28.7
Kwaipilot: KAT-Coder-Pro V1Kwaipilot$0.21$0.83256K--
DeepSeek: DeepSeek V3.1 TerminusDeepSeek$0.21$0.79164K28.1
Qwen: Qwen3 Coder 480B A35BQwen$0.22$1.00262K--
ByteDance Seed: Seed-2.0-LiteByteDance Seed$0.25$2.00262K--
OpenAI: GPT-5.1-Codex-MiniOpenAI$0.25$2.00400K--
Inception: Mercury CoderInception$0.25$0.75128K--
OpenAI: GPT-5 MiniOpenAI$0.25$2.00400K20.7
Google: Gemini 3.1 Flash Lite PreviewGoogle$0.25$1.501.0M33.5
ByteDance Seed: Seed 1.6ByteDance Seed$0.25$2.00262K--
Anthropic: Claude 3 HaikuAnthropic$0.25$1.25200K37.1
Inception: MercuryInception$0.25$0.75128K--
Inception: Mercury 2Inception$0.25$0.75128K--
MiniMax: MiniMax M2MiniMax$0.26$1.00197K49.6
Qwen: Qwen3.5-122B-A10BQwen$0.26$2.08262K--
Qwen: Qwen Plus 0728Qwen$0.26$0.781M--
Qwen: Qwen Plus 0728 (thinking)Qwen$0.26$0.781M--
Qwen: Qwen3 VL 235B A22B ThinkingQwen$0.26$2.60131K--
Qwen: Qwen3.5 Plus 2026-02-15Qwen$0.26$1.561M--
DeepSeek: DeepSeek V3.2DeepSeek$0.26$0.38164K41.7
Qwen: Qwen-PlusQwen$0.26$0.781M--
DeepSeek: DeepSeek V3.2 ExpDeepSeek$0.27$0.41164K--
MiniMax: MiniMax M2.1MiniMax$0.27$0.95197K--
Nex AGI: DeepSeek V3.1 Nex N1Nex Agi$0.27$1.00131K--
Baidu: ERNIE 4.5 300B A47B Baidu$0.28$1.10123K--
DeepSeek: R1 Distill Qwen 32BDeepSeek$0.29$0.2933K17.2
Nous: Hermes 3 70B InstructNousresearch$0.30$0.30131K--
MiniMax: MiniMax M2.7MiniMax$0.30$1.20205K36.1
xAI: Grok 3 Mini BetaxAI$0.30$0.50131K--
Z.ai: GLM 4.6VZ Ai$0.30$0.90131K--
Amazon: Nova 2 LiteAmazon$0.30$2.501M18.0
TNG: DeepSeek R1T2 ChimeraTngtech$0.30$1.10164K--
xAI: Grok 3 MinixAI$0.30$0.50131K32.1
Google: Gemini 2.5 FlashGoogle$0.30$2.501.0M19.4
Google: Nano Banana (Gemini 2.5 Flash Image)Google$0.30$2.5033K--
Mistral: Codestral 2508Mistral AI$0.30$0.90256K--
MiniMax: MiniMax M2-herMiniMax$0.30$1.2066K--
TheDrummer: Cydonia 24B V4.1Thedrummer$0.30$0.50131K--
DeepSeek: DeepSeek V3DeepSeek$0.32$0.89164K--
Z.ai: GLM 4.6Z Ai$0.39$1.90205K--
Z.ai: GLM 4.7Z Ai$0.39$1.75203K--
Qwen: Qwen3.5 397B A17BQwen$0.39$2.34262K--
OpenAI: GPT-4.1 MiniOpenAI$0.40$1.601.0M--
TheDrummer: UnslopNemo 12BThedrummer$0.40$0.4033K--
Meta: Llama 3.1 70B InstructMeta$0.40$0.40131K14.5
Mistral: Devstral 2 2512Mistral AI$0.40$2.00262K22.0
Mistral: Mistral Medium 3.1Mistral AI$0.40$2.00131K9.0
MoonshotAI: Kimi K2 0905Moonshotai$0.40$2.00131K--
Mistral: Devstral MediumMistral AI$0.40$2.00131K18.7
Mistral: Mistral Medium 3Mistral AI$0.40$2.00131K21.3
DeepSeek: DeepSeek V3.2 SpecialeDeepSeek$0.40$1.20164K41.7
Xiaomi: MiMo-V2-OmniXiaomi$0.40$2.00262K--
MiniMax: MiniMax M1MiniMax$0.40$2.201M20.9
Baidu: ERNIE 4.5 VL 424B A47B Baidu$0.42$1.25123K--
ReMM SLERP 13BUndi95$0.45$0.656K--
DeepSeek: R1 0528DeepSeek$0.45$2.15164K27.1
MoonshotAI: Kimi K2.5Moonshotai$0.45$2.20262K--
Qwen: Qwen3 235B A22BQwen$0.45$1.82131K--
MoonshotAI: Kimi K2 ThinkingMoonshotai$0.47$2.00131K--
Google: Gemini 3 Flash PreviewGoogle$0.50$3.001.0M35.0
Mistral: Mistral Large 3 2512Mistral AI$0.50$1.50262K15.1
Arcee AI: Coder LargeArcee AI$0.50$0.8033K--
OpenAI: GPT-3.5 TurboOpenAI$0.50$1.5016K--
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)Google$0.50$3.0066K--
Meta: Llama 3 70B InstructMeta$0.51$0.748K--
Qwen: Qwen VL MaxQwen$0.52$2.08131K--
Mistral: Mixtral 8x7B InstructMistral AI$0.54$0.5433K7.7
TheDrummer: Skyfall 36B V2Thedrummer$0.55$0.8033K--
MoonshotAI: Kimi K2 0711Moonshotai$0.55$2.20131K--
Writer: Palmyra X5Writer$0.60$6.001.0M--
NVIDIA: Llama 3.1 Nemotron Ultra 253B v1NVIDIA$0.60$1.80131K14.4
Z.ai: GLM 4.5VZ Ai$0.60$1.8066K--
Z.ai: GLM 4.5Z Ai$0.60$2.20131K--
OpenAI: GPT Audio MiniOpenAI$0.60$2.40128K--
WizardLM-2 8x22BMicrosoft$0.62$0.6266K--
Sao10K: Llama 3.3 Euryale 70BSao10k$0.65$0.75131K--
Google: Gemma 2 27BGoogle$0.65$0.658K--
Qwen: Qwen3 Coder PlusQwen$0.65$3.251M--
Qwen2.5 Coder 32B InstructQwen$0.66$1.0033K--
AionLabs: Aion-1.0-MiniAion Labs$0.70$1.40131K--
DeepSeek: R1 Distill Llama 70BDeepSeek$0.70$0.80131K16.0
DeepSeek: R1DeepSeek$0.70$2.5064K16.0
Z.ai: GLM 5Z Ai$0.72$2.3080K--
Mancer: Weaver (alpha)Mancer$0.75$1.008K--
OpenAI: GPT-5.4 MiniOpenAI$0.75$4.50400K44.6
Arcee AI: Virtuoso LargeArcee AI$0.75$1.20131K--
Qwen: Qwen3 Max ThinkingQwen$0.78$3.90262K--
Qwen: Qwen3 MaxQwen$0.78$3.90262K--
AionLabs: Aion-RP 1.0 (8B)Aion Labs$0.80$1.6033K--
Morph: Morph V3 FastMorph$0.80$1.2082K--
Amazon: Nova Pro 1.0Amazon$0.80$3.20300K35.7
AlfredPros: CodeLLaMa 7B Instruct SolidityAlfredPros$0.80$1.204K--
AionLabs: Aion-2.0Aion Labs$0.80$1.60131K--
Anthropic: Claude 3.5 HaikuAnthropic$0.80$4.00200K18.7
EleutherAI: Llemma 7bEleutherAI$0.80$1.204K--
Qwen: Qwen2.5 VL 72B InstructQwen$0.80$0.8033K--
Sao10K: Llama 3.1 Euryale 70B v2.2Sao10k$0.85$0.85131K--
Relace: Relace Apply 3Relace$0.85$1.25256K--
Switchpoint RouterSwitchpoint$0.85$3.40131K--
Morph: Morph V3 LargeMorph$0.90$1.90262K--
Arcee AI: Maestro ReasoningArcee AI$0.90$3.30131K--
Anthropic: Claude Haiku 4.5Anthropic$1.00$5.00200K37.1
Nous: Hermes 3 405B InstructNousresearch$1.00$1.00131K--
Relace: Relace SearchRelace$1.00$3.00256K--
Nous: Hermes 4 405BNousresearch$1.00$3.00131K--
Xiaomi: MiMo-V2-ProXiaomi$1.00$3.001.0M--
Perplexity: SonarPerplexity$1.00$1.00127K--
OpenAI: GPT-3.5 Turbo (older v0613)OpenAI$1.00$2.004K9.0
Qwen: Qwen-Max Qwen$1.04$4.1633K--
OpenAI: o4 MiniOpenAI$1.10$4.40200K--
OpenAI: o3 Mini HighOpenAI$1.10$4.40200K--
OpenAI: o4 Mini HighOpenAI$1.10$4.40200K--
OpenAI: o3 MiniOpenAI$1.10$4.40200K--
Z.ai: GLM 5 TurboZ Ai$1.20$4.00203K--
NVIDIA: Llama 3.1 Nemotron 70B InstructNVIDIA$1.20$1.20131K13.4
Google: Gemini 2.5 Pro Preview 06-05Google$1.25$101.0M34.6
OpenAI: GPT-5.1OpenAI$1.25$10400K--
Google: Gemini 2.5 Pro Preview 05-06Google$1.25$101.0M--
Google: Gemini 2.5 ProGoogle$1.25$101.0M34.6
OpenAI: GPT-5 CodexOpenAI$1.25$10400K44.6
OpenAI: GPT-5 ChatOpenAI$1.25$10128K21.8
OpenAI: GPT-5OpenAI$1.25$10400K44.4
OpenAI: GPT-5.1-Codex-MaxOpenAI$1.25$10400K--
OpenAI: GPT-5.1-CodexOpenAI$1.25$10400K--
OpenAI: GPT-5.1 ChatOpenAI$1.25$10128K--
Deep Cogito: Cogito v2.1 671BDeepCogito$1.25$1.25128K--
Sao10k: Llama 3 Euryale 70B v2.1Sao10k$1.48$1.488K--
OpenAI: GPT-3.5 Turbo InstructOpenAI$1.50$2.004K9.0
OpenAI: GPT-5.3 ChatOpenAI$1.75$14128K--
OpenAI: GPT-5.2-CodexOpenAI$1.75$14400K--
OpenAI: GPT-5.2OpenAI$1.75$14400K--
OpenAI: GPT-5.3-CodexOpenAI$1.75$14400K--
OpenAI: GPT-5.2 ChatOpenAI$1.75$14128K--
OpenAI: o4 Mini Deep ResearchOpenAI$2.00$8.00200K33.1
Perplexity: Sonar Reasoning ProPerplexity$2.00$8.00128K17.9
Perplexity: Sonar Deep ResearchPerplexity$2.00$8.00128K--
OpenAI: GPT-4.1OpenAI$2.00$8.001.0M--
OpenAI: o3OpenAI$2.00$8.00200K25.9
Mistral: Pixtral Large 2411Mistral AI$2.00$6.00131K14.0
Mistral Large 2411Mistral AI$2.00$6.00131K--
AI21: Jamba Large 1.7AI21 Labs$2.00$8.00256K10.9
Mistral: Mixtral 8x22B InstructMistral AI$2.00$6.0066K9.8
xAI: Grok 4.20 Multi-Agent BetaxAI$2.00$6.002M--
Google: Gemini 3.1 Pro Preview Custom ToolsGoogle$2.00$121.0M57.2
Google: Gemini 3 Pro PreviewGoogle$2.00$121.0M41.3
Mistral LargeMistral AI$2.00$6.00128K22.8
Google: Nano Banana Pro (Gemini 3 Pro Image Preview)Google$2.00$1266K48.4
Mistral Large 2407Mistral AI$2.00$6.00131K13.0
Google: Gemini 3.1 Pro PreviewGoogle$2.00$121.0M57.2
xAI: Grok 4.20 BetaxAI$2.00$6.002M11.7
Amazon: Nova Premier 1.0Amazon$2.50$131M19.0
OpenAI: GPT-4o AudioOpenAI$2.50$10128K17.3
Cohere: Command ACohere$2.50$10256K13.5
OpenAI: GPT-5.4OpenAI$2.50$151.1M--
OpenAI: GPT-5 Image MiniOpenAI$2.50$2.00400K--
OpenAI: GPT-4o (2024-11-20)OpenAI$2.50$10128K--
OpenAI: GPT-4oOpenAI$2.50$10128K--
OpenAI: GPT AudioOpenAI$2.50$10128K--
OpenAI: GPT-4o Search PreviewOpenAI$2.50$10128K--
Inflection: Inflection 3 PiInflection$2.50$108K--
Inflection: Inflection 3 ProductivityInflection$2.50$108K--
Cohere: Command R+ (08-2024)Cohere$2.50$10128K8.3
OpenAI: GPT-4o (2024-08-06)OpenAI$2.50$10128K--
Anthropic: Claude 3.7 Sonnet (thinking)Anthropic$3.00$15200K34.7
Anthropic: Claude Sonnet 4.5Anthropic$3.00$151M--
OpenAI: GPT-3.5 Turbo 16kOpenAI$3.00$4.0016K9.0
xAI: Grok 4xAI$3.00$15256K29.7
Magnum v4 72BAnthracite$3.00$5.0016K--
Perplexity: Sonar ProPerplexity$3.00$15200K--
Anthropic: Claude 3.7 SonnetAnthropic$3.00$15200K34.7
Anthropic: Claude Sonnet 4Anthropic$3.00$15200K44.4
xAI: Grok 3xAI$3.00$15131K21.6
Anthropic: Claude Sonnet 4.6Anthropic$3.00$151M10.3
Sao10K: Llama 3.1 70B Hanami x1Sao10k$3.00$3.0016K--
xAI: Grok 3 BetaxAI$3.00$15131K--
Perplexity: Sonar Pro SearchPerplexity$3.00$15200K15.5
Goliath 120BAlpindale$3.75$7.506K--
Meta: Llama 3.1 405B (base)Meta$4.00$4.0033K--
AionLabs: Aion-1.0Aion Labs$4.00$8.00131K--
Anthropic: Claude Opus 4.5Anthropic$5.00$25200K18.0
OpenAI: GPT-4o (2024-05-13)OpenAI$5.00$15128K--
Anthropic: Claude Opus 4.6Anthropic$5.00$251M18.0
OpenAI: GPT-4o (extended)OpenAI$6.00$18128K--
Anthropic: Claude 3.5 SonnetAnthropic$6.00$30200K15.9
OpenAI: GPT-5 ImageOpenAI$10$10400K--
OpenAI: GPT-4 Turbo (older v1106)OpenAI$10$30128K--
OpenAI: GPT-4 Turbo PreviewOpenAI$10$30128K--
OpenAI: GPT-4 TurboOpenAI$10$30128K--
OpenAI: o3 Deep ResearchOpenAI$10$40200K38.4
OpenAI: GPT-5 ProOpenAI$15$120400K--
Anthropic: Claude Opus 4Anthropic$15$75200K46.5
Anthropic: Claude Opus 4.1Anthropic$15$75200K18.0
OpenAI: o1OpenAI$15$60200K23.7
OpenAI: o3 ProOpenAI$20$80200K38.4
OpenAI: GPT-5.2 ProOpenAI$21$168400K--
OpenAI: GPT-4 (older v0314)OpenAI$30$608K--
OpenAI: GPT-4OpenAI$30$608K18.6
OpenAI: GPT-5.4 ProOpenAI$30$1801.1M--
OpenAI: o1-proOpenAI$150$600200K30.8

Frequently Asked Questions

How is AI model pricing calculated?

AI models charge per token, where a token is roughly 3/4 of a word. Prices are quoted per million tokens. Most models charge separately for input (prompt) tokens and output (completion) tokens, with output typically costing 2-4x more than input.

Why do output tokens cost more than input tokens?

Output tokens require the model to generate new text, which is computationally more expensive than reading input. The model must perform inference for each output token sequentially, while input tokens can be processed in parallel.

What is the cheapest good AI model?

Among paid models with benchmark data, LiquidAI: LFM2-2.6B offers some of the lowest pricing at $0.01/1M input tokens. There are also many free models available.

How much does it cost to use AI models per month?

For a typical individual workload of 100 requests per day at 2,000 tokens each, monthly costs range from $0 (free models) to $50+ (premium models). Enterprise usage at higher volumes can cost significantly more. Use the cost examples on this page to estimate your specific usage.

Are free AI models any good?

Yes, there are 27 free models available, and some rank well on intelligence benchmarks. Free models are a great way to start, though premium models generally offer better quality for demanding tasks.