WeSearch
Hub / Search / coding benchmark
SEARCH · CODING BENCHMARK

Results for "coding benchmark".

5 stories match your query across our 700+ source catalog. Ranked by relevance and recency.

5 results for "coding benchmark"

FIRETHERING

Xiaomi releases MiMo-v2.5 Family weights with strong coding and agent benchmarks

Peking University gives its computer science students a compiler project every semester. Build a complete SysY compiler in Rust including lexer, parser, abstract syntax tree, IR code generation, assem…

· 3 views
SIMON WILLISON'S WEBLOG

Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model

Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model Big claims from Qwen about their latest open weight model: Qwen3.6-27B delivers flagship-level agentic coding performance, surpassing the previo…

· 9 views
DEV COMMUNITY

"Use Claude Code for FREE" is a Trap

Why free AI coding via Nvidia NIM and OpenRouter is a trap. The Cheap-Intelligent-Fast trilemma, 40 RPM rate limits, Opus 4.7 vs GPT-5.5 vs MiniMax M2.7 benchmarks, and why your first AI coding experi…

· 3 views
ARXIV.ORG

StoryTR: Narrative-Centric Video Temporal Retrieval with Theory of Mind Reasoning

Current video moment retrieval excels at action-centric tasks but struggles with narrative content. Models can see \textit{what is happening} but fail to reason \textit{why it matters}. This semantic …

· 3 views
ARXIV.ORG

AdaMamba: Adaptive Frequency-Gated Mamba for Long-Term Time Series Forecasting

Accurate long-term time series forecasting (LTSF) requires the capture of complex long-range dependencies and dynamic periodic patterns. Recent advances in frequency-domain analysis offer a global per…

· 3 views