Qwen: Qwen VL Plus

QwenID: qwen/qwen-vl-plus

Qwen's Enhanced Large Visual Language Model. Significantly upgraded for detailed recognition capabilities and text recognition abilities, supporting ultra-high pixel resolutions up to millions of pixels and extreme aspect ratios for image input. It delivers significant performance across a broad range of visual tasks.

Pricing per 1M Tokens

Input (Prompt)$0.14
Output (Completion)$0.41
Cache Read$0.03
Cache WriteFree
ImageN/A

Specifications

Context Length131K
Max Output Tokens8K
Input ModalitiesText + Image
Output ModalitiesText
TokenizerQwen
Instruct TypeN/A
Top Provider Context131K
Top Provider Max Output8K
ModeratedNo

Compare this model

See how Qwen: Qwen VL Plus stacks up against other models.

More from Qwen

Last updated: March 23, 2026

First tracked: March 23, 2026