MiMo V2 Flash

Xiaomi's high-efficiency inference model with hybrid architecture, 3 MTP layers for 2.5-3.7x faster inference, and 256K context.

xiaomi/mimo-v2-flash
STABLEScheduled for DeactivationGet started
Streaming
Tools
Reasoning
JSON Output
No ratings yetSign in to rate

Select Provider

Xiaomi Pricing for MiMo V2 Flash

View detailed pricing and capabilities for this provider.

Xiaomi
Context: 256k
Deactivating on Jun 18, 2026
Input
$0.1
/M tokens
Cached
$0.02
/M tokens
Output
$0.3
/M tokens
Get started