Qwen: Qwen3 4B (free)

Free

QwenID: qwen/qwen3-4b:free

Qwen3-4B is a 4 billion parameter dense language model from the Qwen3 series, designed to support both general-purpose and reasoning-intensive tasks. It introduces a dual-mode architecture—thinking and non-thinking—allowing dynamic switching between high-precision logical reasoning and efficient dialogue generation. This makes it well-suited for multi-turn chat, instruction following, and complex agent workflows.

Pricing per 1M Tokens

Input (Prompt)	Free
Output (Completion)	Free
Cache Read	Free
Cache Write	Free
Image	N/A

Specifications

Context Length	41K
Max Output Tokens	N/A
Input Modalities	Text
Output Modalities	Text
Tokenizer	Qwen3
Instruct Type	qwen3
Top Provider Context	41K
Top Provider Max Output	N/A
Moderated	No