OpenAI: GPT-4.1 Nano

OpenAIID: openai/gpt-4.1-nano

For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million token context window, and scores 80.1% on MMLU, 50.3% on GPQA, and 9.8% on Aider polyglot coding – even higher than GPT‑4o mini. It’s ideal for tasks like classification or autocompletion.

Pricing per 1M Tokens

Input (Prompt)$0.10
Output (Completion)$0.40
Cache Read$0.02
Cache WriteFree
ImageN/A

Specifications

Context Length1.0M
Max Output Tokens33K
Input ModalitiesImage + Text + File
Output ModalitiesText
TokenizerGPT
Instruct TypeN/A
Top Provider Context1.0M
Top Provider Max Output33K
ModeratedYes

Compare this model

See how OpenAI: GPT-4.1 Nano stacks up against other models.

More from OpenAI

Last updated: March 23, 2026

First tracked: March 23, 2026