OpenAI: GPT-5.4 Nano
OpenAIID: openai/gpt-5.4-nano
GPT-5.4 nano is the most lightweight and cost-efficient variant of the GPT-5.4 family, optimized for speed-critical and high-volume tasks. It supports text and image inputs and is designed for low-latency use cases such as classification, data extraction, ranking, and sub-agent execution. The model prioritizes responsiveness and efficiency over deep reasoning, making it ideal for pipelines that require fast, reliable outputs at scale. GPT-5.4 nano is well suited for background tasks, real-time systems, and distributed agent architectures where minimizing cost and latency is essential.
Pricing per 1M Tokens
| Input (Prompt) | $0.20 |
| Output (Completion) | $1.25 |
| Cache Read | $0.02 |
| Cache Write | Free |
| Image | N/A |
Specifications
| Context Length | 400K |
| Max Output Tokens | 128K |
| Input Modalities | File + Image + Text |
| Output Modalities | Text |
| Tokenizer | GPT |
| Instruct Type | N/A |
| Top Provider Context | 400K |
| Top Provider Max Output | 128K |
| Moderated | Yes |
Compare this model
See how OpenAI: GPT-5.4 Nano stacks up against other models.
More from OpenAI
Last updated: March 23, 2026
First tracked: March 23, 2026