OpenAI: GPT-5.4 Mini
GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads. It supports text and image inputs with strong performance across reasoning, coding, and tool use, while reducing latency and cost for large-scale deployments. The model is designed for production environments that require a balance of capability and efficiency, making it well suited for chat applications, coding assistants, and agent workflows that operate at scale. GPT-5.4 mini delivers reliable instruction following, solid multi-step reasoning, and consistent performance across diverse tasks with improved cost efficiency.
Pricing per 1M Tokens
| Input (Prompt) | $0.75 |
| Output (Completion) | $4.50 |
| Cache Read | $0.07 |
| Cache Write | Free |
| Image | N/A |
Specifications
| Context Length | 400K |
| Max Output Tokens | 128K |
| Input Modalities | File + Image + Text |
| Output Modalities | Text |
| Tokenizer | GPT |
| Instruct Type | N/A |
| Top Provider Context | 400K |
| Top Provider Max Output | 128K |
| Moderated | Yes |
Compare this model
See how OpenAI: GPT-5.4 Mini stacks up against other models.
More from OpenAI
Last updated: March 23, 2026
First tracked: March 23, 2026