Qwen: Qwen3 4B (free)
FreeQwenID: qwen/qwen3-4b:free
Qwen3-4B is a 4 billion parameter dense language model from the Qwen3 series, designed to support both general-purpose and reasoning-intensive tasks. It introduces a dual-mode architecture—thinking and non-thinking—allowing dynamic switching between high-precision logical reasoning and efficient dialogue generation. This makes it well-suited for multi-turn chat, instruction following, and complex agent workflows.
Pricing per 1M Tokens
| Input (Prompt) | Free |
| Output (Completion) | Free |
| Cache Read | Free |
| Cache Write | Free |
| Image | N/A |
Specifications
| Context Length | 41K |
| Max Output Tokens | N/A |
| Input Modalities | Text |
| Output Modalities | Text |
| Tokenizer | Qwen3 |
| Instruct Type | qwen3 |
| Top Provider Context | 41K |
| Top Provider Max Output | N/A |
| Moderated | No |
Compare this model
See how Qwen: Qwen3 4B (free) stacks up against other models.
More from Qwen
Last updated: March 23, 2026
First tracked: March 23, 2026