GPT-4.1 is a multimodal AI model by OpenAI. It features a 1.0M token context window, 84 tok/s output speed, $2.00/1M input tokens pricing. It scores 26.3 on the intelligence index.
Output Speed
84 tok/s
Latency (TTFT)
0.52s
Blended Price
$3.50/M
| Input (Prompt) | $2.00 |
| Output (Completion) | $8.00 |
| Cache Read | Free |
| Cache Write | Free |
| Context Length | 1.0M |
| Max Output Tokens | N/A |
| Input Modalities | Image + Text + File |
| Output Modalities | Text |
| Tokenizer | N/A |
GPT-4.1 costs $2.00/1M input tokens and $8.00/1M output tokens.
GPT-4.1 supports a context window of 1.0M tokens, which is approximately 524 pages of text.
GPT-4.1 has a lower coding score of 21.8. For demanding coding tasks, consider a model with a higher coding benchmark.
GPT-4.1 generates output at 84 tok/s. Time to first token is 0.52s.
No, GPT-4.1 is a paid model. Check the free models page for zero-cost alternatives.
Last updated:
See the alternatives section above for models with similar capabilities. You can also compare GPT-4.1 head-to-head with any model on our comparison page.