Qwen: Qwen3 235B A22B
QwenID: qwen/qwen3-235b-a22b
Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by Qwen, activating 22B parameters per forward pass. It supports seamless switching between a "thinking" mode for complex reasoning, math, and code tasks, and a "non-thinking" mode for general conversational efficiency. The model demonstrates strong reasoning ability, multilingual support (100+ languages and dialects), advanced instruction-following, and agent tool-calling capabilities. It natively handles a 32K token context window and extends up to 131K tokens using YaRN-based scaling.
Pricing per 1M Tokens
| Input (Prompt) | $0.45 |
| Output (Completion) | $1.82 |
| Cache Read | Free |
| Cache Write | Free |
| Image | N/A |
Specifications
| Context Length | 131K |
| Max Output Tokens | 8K |
| Input Modalities | Text |
| Output Modalities | Text |
| Tokenizer | Qwen3 |
| Instruct Type | qwen3 |
| Top Provider Context | 131K |
| Top Provider Max Output | 8K |
| Moderated | No |
Compare this model
See how Qwen: Qwen3 235B A22B stacks up against other models.
More from Qwen
Last updated: March 23, 2026
First tracked: March 23, 2026