WeSearch

I tracked Claude Code and Codex pass-rates for 95 days — what "getting dumber" actually looks like

·3 min read · 0 reactions · 0 comments · 9 views
#ai#machinelearning#programming
I tracked Claude Code and Codex pass-rates for 95 days — what "getting dumber" actually looks like
⚡ TL;DR · AI summary

A recent analysis tracked the pass rates of Claude Code and Codex over 95 days to assess claims of declining performance. The data revealed that while Claude Code improved significantly after an update, it has recently shown a decline below its established baseline. In contrast, Codex's performance has remained relatively flat across its updates, indicating no significant improvement over time.

Key facts
Original article
DEV.to (Top)
Read full at DEV.to (Top) →
Opening excerpt (first ~120 words) tap to expand

try { if(localStorage) { let currentUser = localStorage.getItem('current_user'); if (currentUser) { currentUser = JSON.parse(currentUser); if (currentUser.id === 3845674) { document.getElementById('article-show-container').classList.add('current-user-is-article-author'); } } } } catch (e) { console.error(e); } Dylan Brown Posted on May 30 I tracked Claude Code and Codex pass-rates for 95 days — what "getting dumber" actually looks like #programming #ai #machinelearning #productivity Every few weeks a thread blows up: "Is Claude Code getting worse?" Someone swears Opus got lazy after an update; someone else says it's placebo. The arguments are always vibes — nobody posts numbers. So I built a tracker.

Excerpt limited to ~120 words for fair-use compliance. The full article is at DEV.to (Top).

Anonymous · no account needed
Share 𝕏 Facebook Reddit LinkedIn Threads WhatsApp Bluesky Mastodon Email

Discussion

0 comments

More from DEV.to (Top)