13 stories tagged with #reliability, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.
⌘ RSS feed for this tag → or search "Reliability"
An Agent Run Is Not Done When the Model Stops Talking
An Agent Run Is Not Done When the Model Stops Talking The Problem You prompt an...…
Code Orange: Fail Small is complete. The result is a stronger Cloudflare network
We have completed a massive engineering effort to make our infrastructure more resilient. Through new tools like Snapstone and the Engineering Codex, we've implemented safer config…
When Retries Turn Hostile — How Control Logic Kills Production Systems
"Your retries are killing us." A service team received this message from a downstream dependency...…
Crop Undercount Raises Questions About Reliability of U.S.D.A. Data
Corn estimates were off by 4.5 million acres last year. A lack of survey responses, not job cuts, led to the miss, the Agriculture Department said.…
Microsoft releases first big update after Nadella's vow to 'win back fans'
Lots of fixes, some performance tweaks. Fingers crossed there's no out-of-band patch to follow Microsoft is following through on its promise to prioritize Windows stability with it…
controller staleness is the hidden tax of platform automation
Why stale control loops, not missing automation, are becoming the real platform engineering tax.…
Illegal vs. Unwanted States
Keep Unwanted States Representable…
DO-Bench: An Attributable Benchmark for Diagnosing Object Hallucination in Vision-Language Models
Object level hallucination remains a central reliability challenge for vision language models (VLMs), particularly in binary object existence verification. Existing benchmarks emph…
Ghostty is ditching GitHub over chronic reliability failures, and no one knows where it's going yet
There were more bad days than good ones during April.…
Horror Stories from Former Azure Engineer
Inside the complacency and decisions that eroded trust in Azure—from a former Azure Core engineer.…
Why Your AI Agents Keep Breaking Your Workflows
Your AI investment isn’t paying off the way you expected.…
GitHub addresses two recent incidents and says it aims to improve reliability amid AI growth, focusing on "availability first, then capacity, then new features" (Vlad Fedorov/The GitHub Blog)
By Vlad Fedorov / The GitHub Blog. View the full context on Techmeme.…
Microsoft Outlook for iOS still down and out for many after 'service change'
Sign-in failures, unexpected sign-outs... just another day for users Users of Microsoft Outlook on iOS are continuing to experience outages more than 24 hours after glitches first …