WeSearch

Xiaomi releases MiMo-v2.5 Family weights with strong coding and agent benchmarks

·13 min read · 0 reactions · 0 comments · 0 views
Xiaomi releases MiMo-v2.5 Family weights with strong coding and agent benchmarks

Peking University gives its computer science students a compiler project every semester. Build a complete SysY compiler in Rust including lexer, parser, abstract syntax tree, IR code generation, assembly backend, performance optimization. The whole thing. Students typically need several weeks. MiMo-V2.5-Pro finished it in 4.3 hours. Perfect score. 233 out of 233 tests passed on a hidden test suite it had never seen. That's a real university project and a model that scored higher than most students who spent weeks on it. Xiaomi built this, which is still a sentence that takes a moment to process. V2.5-Pro is the next step up from MiMo-V2-Flash and its closed source for now, but Xiaomi has confirmed open source is coming for the V2.5 series. What V2.5-Pro adds over Flash is meaningful. Better long-horizon coherence, stronger agentic capabilities, and the ability to sustain complex tasks across more than a thousand tool calls without losing the thread.

Original article
Firethering
Read full at Firethering →
Full article excerpt tap to expand

HomeTechMiMo-V2.5-Pro Is Now Open Source and It's Sitting Right Next to Claude... { "@context": "https://schema.org", "@type": "BreadcrumbList", "itemListElement": [{ "@type": "ListItem", "position": 1, "item": { "@type": "WebSite", "@id": "https://firethering.com/", "name": "Home" } },{ "@type": "ListItem", "position": 2, "item": { "@type": "WebPage", "@id": "https://firethering.com/tech/", "name": "Tech" } },{ "@type": "ListItem", "position": 3, "item": { "@type": "WebPage", "@id": "", "name": "MiMo-V2.5-Pro Is Now Open Source and It's Sitting Right Next to Claude..." } } ] } MiMo-V2.5-Pro Is Now Open Source and It’s Sitting Right Next to Claude Opus 4.6 on Coding By Mohit Geryani April 23, 2026 0 .tdi_60_rand_style > .td-element-style-before { content:'' !important; width:100% !important; height:100% !important; position:absolute !important; top:0 !important; left:0 !important; display:block !important; z-index:0 !important; background-repeat: no-repeat !important; background-size:contain !important; background-position:center top !important; } Last updated: April 28, 2026 Share FacebookTwitterPinterestWhatsApp - Advertisement - (adsbygoogle = window.adsbygoogle || []).push({}); Peking University gives its computer science students a compiler project every semester. Build a complete SysY compiler in Rust including lexer, parser, abstract syntax tree, IR code generation, assembly backend, performance optimization. The whole thing. Students typically need several weeks. MiMo-V2.5-Pro finished it in 4.3 hours. Perfect score. 233 out of 233 tests passed on a hidden test suite it had never seen. That’s a real university project and a model that scored higher than most students who spent weeks on it. Xiaomi built this, which is still a sentence that takes a moment to process. V2.5-Pro is the next step up from MiMo-V2-Flash and its now Open Source. What V2.5-Pro adds over Flash is meaningful. Better long-horizon coherence, stronger agentic capabilities, and the ability to sustain complex tasks across more than a thousand tool calls without losing the thread. That’s not a benchmark row. That’s a story. And it’s the most honest way to explain what Xiaomi thinks it has built here. Table of ContentsThree things it built while nobody was watchingWhat the benchmarks sayMiMo vs DeepSeek V4 Pro: two open source giants, one decisionWhat actually changed at long contextThe token efficiency nobody is talking aboutThere’s also a multimodal version: The one that sees, hears, and readsHow to run itWho should care Three things it built while nobody was watching The compiler story is the most dramatic but it’s not alone. After the compiler, Xiaomi gave it a vaguer prompt like build a video editor. No detailed spec or anything specific. What came back after 11.5 hours and 1,868 tool calls was a working desktop application with a multi-track timeline, clip trimming, crossfades, audio mixing, and an export pipeline. The final codebase was 8,192 lines. A working product built start to finish while the humans presumably went home. The third test went somewhere most coding benchmarks don’t touch. A graduate-level analog circuit design task specifically a Flipped-Voltage-Follower low-dropout regulator in a TSMC 180nm process. This is the kind of work that takes trained analog engineers several days. MiMo-V2.5-Pro was wired into an ngspice simulation loop, called the simulator, read the waveforms, adjusted parameters, and iterated. About an hour later every…

This excerpt is published under fair use for community discussion. Read the full article at Firethering.

Anonymous · no account needed
Share 𝕏 Facebook Reddit LinkedIn Email

Discussion

0 comments

More from Firethering