Xiaomi logo

MiMo V2.5


MiMo V2.5 is Xiaomi logoXiaomi's language model with a 1.1M context window and up to 131K output tokens, available from 2 providers, starting at $0.400 / 1M input and $2.00 / 1M output. Xiaomi's native omnimodal LLM delivering Pro-level agentic performance at reduced inference cost, with strong multimodal perception across image and video inputs.
Spec
Canonical IDxiaomi-mimo-2-5
TypeLanguage
StatusActive
CreatorXiaomiXiaomi
Providers
Context Window1.1M tokens
Max Output131K tokens
Input ModalitiesAudioImagePdfTextVideo
Output ModalitiesText
Reasoning Effortsdefault
Release Date · 9 days ago

Capabilities

Input5/5
Text
Image
Audio
Video
PDF
Output1/5
Text
Image·
Audio·
Video·
Embedding·
Capabilities4/13
Reasoning
Adaptive Reasoning·
Function Calling
Parallel Function Calling·
Structured Outputs
Native JSON Schema·
Web Search·
URL Context·
Computer Use·
Code Execution·
File Search·
Prompt Caching
Assistant Prefill·

Pricing by Provider

ProviderStandard
Input
$ / 1M
Output
$ / 1M
Cache Read
$ / 1M
OpenRouter logo
OpenRouter
xiaomi/mimo-v2.5
$0.400$2.00$0.080
Vercel AI Gateway logo
Vercel AI Gateway
xiaomi/mimo-v2.5
$0.400$2.00$0.080

Cost Calculator

Preset:
Compares every provider & tier in USD

Versions

VersionReleasedContextInput / 1MOutput / 1MStatus
MiMo V2.51.1M$0.400$2.00Current
MiMo V2.5 Pro1.1M$1.00$3.00Available
MiMo V2 Omni262K$0.400$2.00Available
MiMo V2 Pro1.0M$1.00$3.00Available
MiMo V2 Flash262K$0.090$0.290Available
MiMo V2Available
MiMo V2 Flash ReasoningAvailable
MiMo V2 OmniAvailable
MiMo V2 TTSAvailable
MiMo V2.5 424BAvailable

Model IDs