Xiaomi: MiMo-V2-Flash

XiaomiID: xiaomi/mimo-v2-flash

MiMo-V2-Flash is an open-source foundation language model developed by Xiaomi. It is a Mixture-of-Experts model with 309B total parameters and 15B active parameters, adopting hybrid attention architecture. MiMo-V2-Flash supports a hybrid-thinking toggle and a 256K context window, and excels at reasoning, coding, and agent scenarios. On SWE-bench Verified and SWE-bench Multilingual, MiMo-V2-Flash ranks as the top #1 open-source model globally, delivering performance comparable to Claude Sonnet 4.5 while costing only about 3.5% as much. Users can control the reasoning behaviour with the `reasoning` `enabled` boolean. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#enable-reasoning-with-default-config).

Pricing per 1M Tokens

Input (Prompt)	$0.09
Output (Completion)	$0.29
Cache Read	$0.04
Cache Write	Free
Image	N/A

Specifications

Context Length	262K
Max Output Tokens	66K
Input Modalities	Text
Output Modalities	Text
Tokenizer	Other
Instruct Type	N/A
Top Provider Context	262K
Top Provider Max Output	66K
Moderated	No

Compare this model

See how Xiaomi: MiMo-V2-Flash stacks up against other models.

vs Xiaomi: MiMo-V2-Omni vs Xiaomi: MiMo-V2-Pro

More from Xiaomi

Last updated: March 23, 2026

First tracked: March 23, 2026