Xiaomi expanded its AI model lineup with two additional releases alongside MiMo-V2-Pro. MiMo-V2-Omni handles multimodal tasks across text, image, and audio, while MiMo-V2-TTS delivers expressive voice synthesis for conversational AI applications.
The triple launch is backed by an $8.7 billion AI investment, signaling Xiaomi's aggressive push into the foundation model space. The announcement sent the company's Hong Kong-listed shares up 5.8%, reflecting investor confidence in the strategy.
For developers building voice-enabled or multimodal applications, the MiMo-V2 family offers a cost-effective alternative to US providers, with the Pro model already demonstrating competitive benchmark performance at a fraction of the price.