OpenAI: GPT-4.1 Nano
OpenAIID: openai/gpt-4.1-nano
For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million token context window, and scores 80.1% on MMLU, 50.3% on GPQA, and 9.8% on Aider polyglot coding – even higher than GPT‑4o mini. It’s ideal for tasks like classification or autocompletion.
Pricing per 1M Tokens
| Input (Prompt) | $0.10 |
| Output (Completion) | $0.40 |
| Cache Read | $0.02 |
| Cache Write | Free |
| Image | N/A |
Specifications
| Context Length | 1.0M |
| Max Output Tokens | 33K |
| Input Modalities | Image + Text + File |
| Output Modalities | Text |
| Tokenizer | GPT |
| Instruct Type | N/A |
| Top Provider Context | 1.0M |
| Top Provider Max Output | 33K |
| Moderated | Yes |
Compare this model
See how OpenAI: GPT-4.1 Nano stacks up against other models.
More from OpenAI
Last updated: March 23, 2026
First tracked: March 23, 2026