AI Models Platforms

Xiaomi Opens Beta for MiMo-V2.5-Pro Agentic Model

Chinese tech giant pushes flagship coding model into public beta, with open-source release pending.

Andrés Martínez
Andrés MartínezAI Content Writer
April 23, 20262 min read
Share:
Abstract visualization of a neural network processing long chains of tool calls, with Xiaomi branding hues of orange and deep blue

Xiaomi pushed its MiMo-V2.5-Pro model into public beta on Wednesday, positioning the release as a flagship for long-horizon agentic work. The announcement post frames V2.5-Pro as a step up from March's MiMo-V2-Pro in software engineering and multi-turn tool use.

The company's headline demos lean hard on coding endurance. V2.5-Pro built a complete SysY compiler in Rust from scratch in 4.3 hours across 672 tool calls, scoring 233 out of 233 on a Peking University course test suite. A separate run produced a working desktop video editor at 8,192 lines of code over 1,868 tool calls and 11.5 hours of autonomous work. Xiaomi also points to an analog-chip design task (an FVF-LDO regulator in TSMC 180nm) that it says the model closed in about an hour, with Claude Code acting as the harness.

The token-efficiency claim is the real pitch to developers. On Xiaomi's own ClawEval numbers, V2.5-Pro hits 64% Pass^3 using roughly 70,000 tokens per trajectory, which the company says is 40 to 60% fewer than Claude Opus 4.6, Gemini 3.1 Pro, and GPT-5.4 at similar scores. All benchmarks are self-reported.

Access during beta runs through the API and Xiaomi's web studio. OpenRouter lists pricing at $1 per million input tokens and $3 per million output, with a 1-million-token context window. Xiaomi says it plans to open-source the full MiMo-V2.5 series soon, though no date has been committed.


Bottom Line

MiMo-V2.5-Pro is live in API-only beta at $1 per million input tokens and $3 per million output, with weights promised but not dated.

Quick Facts

  • 4.3 hours to build a complete SysY Rust compiler (672 tool calls, 233/233 tests, company-reported)
  • 8,192 lines of code for the video editor demo, across 1,868 tool calls over 11.5 hours
  • 64% Pass^3 on ClawEval at ~70K tokens per trajectory (Xiaomi self-reported)
  • 40 to 60% fewer tokens than Opus 4.6, Gemini 3.1 Pro, and GPT-5.4 on the company's own benchmarks
  • 1,048,576-token context window, priced at $1/M input and $3/M output on OpenRouter
Tags:XiaomiMiMoagentic AIcoding modelsopen sourceLLM benchmarks
Andrés Martínez

Andrés Martínez

AI Content Writer

Andrés reports on the AI stories that matter right now. No hype, just clear, daily coverage of the tools, trends, and developments changing industries in real time. He makes the complex feel routine.

Related Articles

Stay Ahead of the AI Curve

Get the latest AI news, reviews, and deals delivered straight to your inbox. Join 100,000+ AI enthusiasts.

By subscribing, you agree to our Privacy Policy. Unsubscribe anytime.

Xiaomi Launches MiMo-V2.5-Pro Beta for Agentic Coding | aiHola