Qwen: Qwen3 235B A22B

QwenID: qwen/qwen3-235b-a22b

Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by Qwen, activating 22B parameters per forward pass. It supports seamless switching between a "thinking" mode for complex reasoning, math, and code tasks, and a "non-thinking" mode for general conversational efficiency. The model demonstrates strong reasoning ability, multilingual support (100+ languages and dialects), advanced instruction-following, and agent tool-calling capabilities. It natively handles a 32K token context window and extends up to 131K tokens using YaRN-based scaling.

Pricing per 1M Tokens

Input (Prompt)$0.45
Output (Completion)$1.82
Cache ReadFree
Cache WriteFree
ImageN/A

Specifications

Context Length131K
Max Output Tokens8K
Input ModalitiesText
Output ModalitiesText
TokenizerQwen3
Instruct Typeqwen3
Top Provider Context131K
Top Provider Max Output8K
ModeratedNo

Compare this model

See how Qwen: Qwen3 235B A22B stacks up against other models.

More from Qwen

Last updated: March 23, 2026

First tracked: March 23, 2026