OpenAI: GPT-4.1 Nano

OpenAIID: openai/gpt-4.1-nano

For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million token context window, and scores 80.1% on MMLU, 50.3% on GPQA, and 9.8% on Aider polyglot coding – even higher than GPT‑4o mini. It’s ideal for tasks like classification or autocompletion.

Pricing per 1M Tokens

Input (Prompt)	$0.10
Output (Completion)	$0.40
Cache Read	$0.02
Cache Write	Free
Image	N/A

Specifications

Context Length	1.0M
Max Output Tokens	33K
Input Modalities	Image + Text + File
Output Modalities	Text
Tokenizer	GPT
Instruct Type	N/A
Top Provider Context	1.0M
Top Provider Max Output	33K
Moderated	Yes