GLM-4.6 (Reasoning) is a text-based AI model by Z AI. It features 88 tok/s output speed, $0.57/1M input tokens pricing. It scores 32.5 on the intelligence index.
Output Speed
88 tok/s
Latency (TTFT)
0.76s
Blended Price
$0.98/M
| Input (Prompt) | $0.57 |
| Output (Completion) | $2.20 |
| Cache Read | Free |
| Cache Write | Free |
| Context Length | N/A |
| Max Output Tokens | N/A |
| Input Modalities | Text |
| Output Modalities | Text |
| Tokenizer | N/A |
GLM-4.6 (Reasoning) costs $0.57/1M input tokens and $2.20/1M output tokens.
GLM-4.6 (Reasoning) has a moderate coding score of 29.5. It can handle basic programming tasks but may not be the best for complex coding.
GLM-4.6 (Reasoning) generates output at 88 tok/s. Time to first token is 0.76s.
No, GLM-4.6 (Reasoning) is a paid model. Check the free models page for zero-cost alternatives.
Last updated:
See the alternatives section above for models with similar capabilities. You can also compare GLM-4.6 (Reasoning) head-to-head with any model on our comparison page.