xiaomi

Xiaomi: MiMo-V2.5

MiMo-V2.5 is a native omnimodal model by Xiaomi. It delivers Pro-level agentic performance at roughly half the inference cost, while surpassing MiMo-V2-Omni in multimodal perception across image and video understanding tasks. Its 1M context window supports complete documents, extended conversations, and complex task contexts in a single pass, making it ideal for integration with agent frameworks where strong reasoning, rich perception, and cost efficiency all matter.

Try in playground API reference

1,048,576 context

Modalities:text, image, audio, video->text

Released:4/22/2026

Weekly tokens

66.9B

Tokens generated this week (network-wide)

Rankings (last periods)

No ranking data yet for this model.