Qwen3 VL 8B Instruct is a text-based AI model by Alibaba. It features 137 tok/s output speed, $0.18/1M input tokens pricing. It scores 14.3 on the intelligence index.
Output Speed
137 tok/s
Latency (TTFT)
1.01s
Blended Price
$0.31/M
| Input (Prompt) | $0.18 |
| Output (Completion) | $0.70 |
| Cache Read | Free |
| Cache Write | Free |
| Context Length | N/A |
| Max Output Tokens | N/A |
| Input Modalities | Text |
| Output Modalities | Text |
| Tokenizer | N/A |
Qwen3 VL 8B Instruct costs $0.18/1M input tokens and $0.70/1M output tokens.
Qwen3 VL 8B Instruct has a lower coding score of 7.3. For demanding coding tasks, consider a model with a higher coding benchmark.
Qwen3 VL 8B Instruct generates output at 137 tok/s. Time to first token is 1.01s.
No, Qwen3 VL 8B Instruct is a paid model. Check the free models page for zero-cost alternatives.
Last updated:
See the alternatives section above for models with similar capabilities. You can also compare Qwen3 VL 8B Instruct head-to-head with any model on our comparison page.