Z.ai: GLM 4.7 Flash
Z AiID: z-ai/glm-4.7-flash
As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, and tool collaboration, and has achieved leading performance among open-source models of the same size on several current public benchmark leaderboards.
Pricing per 1M Tokens
| Input (Prompt) | $0.06 |
| Output (Completion) | $0.40 |
| Cache Read | $0.01 |
| Cache Write | Free |
| Image | N/A |
Specifications
| Context Length | 203K |
| Max Output Tokens | N/A |
| Input Modalities | Text |
| Output Modalities | Text |
| Tokenizer | Other |
| Instruct Type | N/A |
| Top Provider Context | 203K |
| Top Provider Max Output | N/A |
| Moderated | No |
Compare this model
See how Z.ai: GLM 4.7 Flash stacks up against other models.
More from Z Ai
Last updated: March 23, 2026
First tracked: March 23, 2026