Z.ai: GLM 4.7 Flash

Z AiID: z-ai/glm-4.7-flash

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, and tool collaboration, and has achieved leading performance among open-source models of the same size on several current public benchmark leaderboards.

Pricing per 1M Tokens

Input (Prompt)	$0.06
Output (Completion)	$0.40
Cache Read	$0.01
Cache Write	Free
Image	N/A

Specifications

Context Length	203K
Max Output Tokens	N/A
Input Modalities	Text
Output Modalities	Text
Tokenizer	Other
Instruct Type	N/A
Top Provider Context	203K
Top Provider Max Output	N/A
Moderated	No