Search: "coding benchmark"

5 stories match your query across our 700+ source catalog. Ranked by relevance and recency.

5 results for "coding benchmark"

Xiaomi releases MiMo-v2.5 Family weights with strong coding and agent benchmarks

Peking University gives its computer science students a compiler project every semester. Build a complete SysY compiler in Rust including lexer, parser, abstract syntax tree, IR code generation, assem…

Tue, 28 Apr 2026 12:24:59 GMT · 3 views

SIMON WILLISON'S WEBLOG

Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model

Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model Big claims from Qwen about their latest open weight model: Qwen3.6-27B delivers flagship-level agentic coding performance, surpassing the previo…

Sun, 26 Apr 2026 22:44:22 GMT · 9 views

DEV COMMUNITY

"Use Claude Code for FREE" is a Trap

Why free AI coding via Nvidia NIM and OpenRouter is a trap. The Cheap-Intelligent-Fast trilemma, 40 RPM rate limits, Opus 4.7 vs GPT-5.5 vs MiniMax M2.7 benchmarks, and why your first AI coding experi…

Tue, 28 Apr 2026 08:54:13 GMT · 3 views

ARXIV.ORG

StoryTR: Narrative-Centric Video Temporal Retrieval with Theory of Mind Reasoning

Current video moment retrieval excels at action-centric tasks but struggles with narrative content. Models can see \textit{what is happening} but fail to reason \textit{why it matters}. This semantic …

Tue, 28 Apr 2026 04:13:21 GMT · 3 views

ARXIV.ORG

AdaMamba: Adaptive Frequency-Gated Mamba for Long-Term Time Series Forecasting

Accurate long-term time series forecasting (LTSF) requires the capture of complex long-range dependencies and dynamic periodic patterns. Recent advances in frequency-domain analysis offer a global per…