Qwen: Qwen-Max

QwenID: qwen/qwen-max

Qwen-Max, based on Qwen2.5, provides the best inference performance among [Qwen models](/qwen), especially for complex multi-step tasks. It's a large-scale MoE model that has been pretrained on over 20 trillion tokens and further post-trained with curated Supervised Fine-Tuning (SFT) and Reinforcement Learning from Human Feedback (RLHF) methodologies. The parameter count is unknown.

Pricing per 1M Tokens

Input (Prompt)$1.04
Output (Completion)$4.16
Cache Read$0.21
Cache WriteFree
ImageN/A

Specifications

Context Length33K
Max Output Tokens8K
Input ModalitiesText
Output ModalitiesText
TokenizerQwen
Instruct TypeN/A
Top Provider Context33K
Top Provider Max Output8K
ModeratedNo

More from Qwen

Last updated: March 23, 2026

First tracked: March 23, 2026