Z.ai: GLM 4.7 Flash

Z AiID: z-ai/glm-4.7-flash

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, and tool collaboration, and has achieved leading performance among open-source models of the same size on several current public benchmark leaderboards.

Pricing per 1M Tokens

Input (Prompt)$0.06
Output (Completion)$0.40
Cache Read$0.01
Cache WriteFree
ImageN/A

Specifications

Context Length203K
Max Output TokensN/A
Input ModalitiesText
Output ModalitiesText
TokenizerOther
Instruct TypeN/A
Top Provider Context203K
Top Provider Max OutputN/A
ModeratedNo

Compare this model

See how Z.ai: GLM 4.7 Flash stacks up against other models.

More from Z Ai

Last updated: March 23, 2026

First tracked: March 23, 2026