GPT-3.5 Turbo is a text-based AI model by OpenAI. It features a 16K token context window, 97 tok/s output speed, $0.50/1M input tokens pricing. It scores 9.0 on the intelligence index.
Output Speed
97 tok/s
Latency (TTFT)
0.42s
Blended Price
$0.75/M
| Input (Prompt) | $0.50 |
| Output (Completion) | $1.50 |
| Cache Read | Free |
| Cache Write | Free |
| Context Length | 16K |
| Max Output Tokens | N/A |
| Input Modalities | Text |
| Output Modalities | Text |
| Tokenizer | N/A |
GPT-3.5 Turbo costs $0.50/1M input tokens and $1.50/1M output tokens.
GPT-3.5 Turbo supports a context window of 16K tokens, which is approximately 8 pages of text.
GPT-3.5 Turbo has a lower coding score of 10.7. For demanding coding tasks, consider a model with a higher coding benchmark.
GPT-3.5 Turbo generates output at 97 tok/s. Time to first token is 0.42s.
Last updated:
No, GPT-3.5 Turbo is a paid model. Check the free models page for zero-cost alternatives.
See the alternatives section above for models with similar capabilities. You can also compare GPT-3.5 Turbo head-to-head with any model on our comparison page.