60 stories tagged with #optimization, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.
⌘ RSS feed for this tag → or search "Optimization"
Governor – a Claude Code plugin to reduce token/context waste
You can beat the binary search
Comments…
A Physics Engine with Incremental Rollback for Multiplayer Games
We want Easel to be powerful enough to make the kinds of games you would play for hours.…
Chrome Web Store SEO: How I Optimized 17 Extensions for Search
Chrome Web Store Has a Search Engine — and Nobody Optimizes for It Most Chrome extension...…
Contrarian View: 2026 Startups Should Skip Vercel for Next.js 15 – Use Cloudflare Pages for 40% Lower Hosting Costs
By Q2 2026, Vercel’s standard Next.js 15 hosting tier will cost startups an average of $0.08 per GB...…
Finally, Windows 11 desktop PC users can enjoy Xbox Mode — and Microsoft has a new gift for Ally X users
Microsoft isn't resting on its laurels with Windows 11 gaming optimization, with Xbox Mode now being rolled out to desktop PC users.…
CSS Performance Optimization: How to Achieve 100 Points in Google PageSpeed
Оптимизация производительности CSS: как выжать 100 из PageSpeed и не сойти с ума Представь: ты...…
Step-level Optimization for Efficient Computer-use Agents
arXiv:2604.27151v1 Announce Type: new Abstract: Computer-use agents provide a promising path toward general software automation because they can interact directly with arbitrary gr…
My home Wi-Fi was full of dead zones - here are 6 solutions that actually worked
I struggled with Wi-Fi dead spots throughout my home for years. Here's what helped connect the dots.…
Your AI Agent Is Sending 10x More API Calls Than You Think — Here's Where the Cost Hides
The hidden multiplier nobody budgets for When we moved from single-turn chatbots to...…
Linux exploit instantly grants administrator access on most distributions since 2017 — cryptography optimization snafu grants root privileges to local users
Zero-day exploit instantly grants administrator access on most Linux distributions since 2017…
New Android 17 trick could make ChatGPT’s screen sharing way smoother on your phone
Though allowing Accessibility access could raise concerns for some users.…
Understanding when high availability infrastructure becomes a bottleneck
When your failover systems become the failure point Your carefully designed high...…
Lightweight OpenCode profile for routine dev work with focused agents
Lightweight OpenCode profile for routine dev work with focused agents, local skills, and conductor-based track management. - gc-victor/supersimple…
Internals: How Spring Boot 3.3’s New AOT Compilation Reduces Startup Times by 40% for 50k LOC Apps
Internals: How Spring Boot 3.3’s New AOT Compilation Reduces Startup Times by 40% for 50k...…
2 Lines of Code Saved 6.4x Memory on My Snake AI
Greetings all! In my previous post I covered Binary Plane Encoding, a 3-channel grid representation...…
I asked ChatGPT to recreate Bryan Johnson’s $2 million anti-aging routine for $20 — here’s what worked
AI turned one of the world’s most expensive anti-aging routines into a plan I could actually follow…
Ask HN: Any good ways to extend Codex sessions?
KV Cache Locality: The Hidden Variable in Your LLM Serving Cost
Every time your load balancer sends a request to the wrong GPU, that GPU recomputes a prefill it already computed somewhere else. The KV cache for that 4,000-token system prompt ex…
AI Skills as loader spec, not prompts – why the architecture changes everything
INTERNALS.md #2 · Skills are programs, not prompts. How the skills runtime actually loads, and why the architecture is everything.…
LLM Quantization
We’re on a journey to advance and democratize artificial intelligence through open source and open science.…
Performance Analysis of AI Query Approximation Using Lightweight Proxy Models
Several data warehouse and database providers have recently introduced extensions to SQL called AI Queries, enabling users to specify functions and conditions in SQL that are evalu…
The Descartes Systems Group Inc. (DSG:CA) Discusses AI-Driven Transformation of Last Mile Delivery and Fleet Performance Optimization Transcript
The Descartes Systems Group Inc. (DSG:CA) Discusses AI-Driven Transformation of Last Mile Delivery and Fleet Performance Optimization April 30, 2026 11:00...…
BSc CS junior looking to pivot into Operations Research / Optimization. Advice needed to pick BSc Math optionals, and MSc.
These 3 tweaks improved my thermals more than any cooler upgrade
Splurging on high-end AIOs is overrated when you have these options.…
ASP.NET Middleware: Complete Guide from Basics to Advanced Patterns, Tips, and Performance
The ASP.NET Core middleware pipeline is the backbone of every HTTP request your application...…
A Gentle Introduction to Stochastic Programming
How to make decisions when your spreadsheet is lying about the future…
Cut AI token usage by 96%?
AWS developer advocate Morgan Willis on Strands Agents, intent-based tools, MCP gateways, and how smarter tool design cut agent token usage from 52K to 2K.…
5 Project Reactor Techniques That Turned My Blocking Java Code Into High-Performance Pipelines
Master reactive Java with Project Reactor. Learn 5 techniques — Flux/Mono, backpressure, schedulers, error handling & testing — to build fast, non-blocking pipelines.…
Recursive Refinement
Reviewers default to grading tolerance. Recursive refinement keeps them asking what the original ask still requires, until the team has fully closed the gap.…
I over-engineered my simple AI backend: distillation, router, embedding etc.
I have extensively edited this article after an LLM agent combed through my codebase and prepared the initial draft.…
I asked Claude to improve my home lab and it was wild
Home lab architect powered by AI.…
We Ditched New Relic for Grafana 10 and Loki 2.9: 60% Monitoring Cost Savings
In Q3 2023, our 14-person platform team was spending $42,000 per month on New Relic for a modest...…
War Story: We Ditched Heroku for AWS EKS 1.32 and Saved 50% on Hosting
In Q3 2024, our 12-person engineering team at a Series B fintech startup ripped our production...…
PostgreSQL Is Not Slow. Your Queries Are
PostgreSQL isn’t slow, your queries are. Learn the 7 real causes of slow database performance, from missing indexes and N+1 queries to lock contention and connection overload, with…
AMD Posts Newest Linux Patches To Accelerate Page Migration For More Performance
Posted to the Linux kernel mailing list this week was the newest revision of a patch series originally started in early 2025 by a NVIDIA engineer for accelerating page migration…
Android 17 might finally fix the app problem holding Android tablets and foldabales back
Google asked nicely, now it is less polite…
Cache Stampede Prevention
When Your Cache Chokes: Taming the Cache Stampede Ever felt that exhilarating rush when...…
Ask HN: What did you streamline with AI agents?
Satya Nadella wants Windows to use less RAM as part of a bigger push to win back consumers
It's part of a huge drive across all of Microsoft's brands to regain trust.…
Pre-made Pouch Packaging Market to Reach USD 24.6 Billion by 2036, Driven by Efficiency, Retail Optimization, and Sustainable Packaging Shift - Morningstar
Comprehensive up-to-date news coverage, aggregated from sources all over the world by Google News.…
War Story: We Migrated from Socket.io 4.7 to Ably 2.0 and Cut Chat Latency 40% for Global Users
In Q3 2024, our team at a Series C messaging startup serving 1.2M monthly active users across 42...…
Deep Dive: How JetBrains Fleet Indexes 1M Line Codebases with Rust 1.85 and Kotlin 2.0
Indexing a 1,000,000-line Java codebase in under 800ms with 40% lower memory overhead than...…
Cloudless-Training: A Framework to Improve Efficiency of Geo-Distributed ML Training
Geo-distributed ML training can benefit many emerging ML scenarios (e.g., large model training, federated learning) with multi-regional cloud resources and wide area network. Howev…
RedParrot: Accelerating NL-to-DSL for Business Analytics via Query Semantic Caching
Recently, at Xiaohongshu, the rapid expansion of e-commerce and advertising demands real-time business analytics with high accuracy and low latency. To meet this demand, systems ty…
Stochastic KV Routing: Enabling Adaptive Depth-Wise Cache Sharing
Serving transformer language models with high throughput requires caching Key-Values (KVs) to avoid redundant computation during autoregressive generation. The memory footprint of …
Neural Network Optimization Reimagined: Decoupled Techniques for Scratch and Fine-Tuning
With the accumulation of resources in the era of big data and the rise of pre-trained models in deep learning, optimizing neural networks for various tasks often involves different…
SwarmDrive: Semantic V2V Coordination for Latency-Constrained Cooperative Autonomous Driving
Cloud-hosted LLM inference for autonomous driving adds round-trip delay and depends on stable connectivity, while purely local edge models struggle under occlusion. We present Swar…
Indie Dev ASO Complete Guide — Climbing App Store Rankings with Optimization
Indie Dev ASO Complete Guide — Climbing App Store Rankings with Optimization Building a...…
Common Salesforce admin mistakes that cost companies money
As a Salesforce admin who’s managed orgs for Fortune 500 companies in healthcare, finance, and...…
vLLM-Compile: Bringing Compiler Optimizations to LLM Inference
vLLM-compile: Bringing Compiler Optimizations to LLM Inference Luka Govedič vLLM Committer Senior Machine Learning Engineer, Red Hat 1…
We decreased our LLM costs with Opus
We switched to a frontier model and our costs went down. Here's the architecture that made it possible.…
Your AI agent wastes 13,000 tokens before saying "hello"
If you have an agent with 50 MCP tools installed, you're spending up to 13,000 tokens on the catalog alone — before processing any user message. Introducing TTC, a TERSE Format ext…
Seu agente de IA está desperdiçando 13.000 tokens antes de dizer "oi"
Se você tem um agente com 50 tools MCP instaladas, está gastando até 13.000 tokens só no catálogo — antes de processar qualquer mensagem. Apresento o TTC, uma extensão do TERSE For…
Synthetic.new – Limits and Pricing
Synthetic Limits and Pricing Subscription Pricing The Synthetic subscription — not to be confused with usage-based pricing — works in terms of “packs”. A base subscription is $3…
What I learned shipping a 5-day auction marketplace in 30 days (Cloudflare Pages + Supabase)
Notes-from-the-field on shipping ExitBid (auction marketplace for online businesses) — RLS-first architecture, SECURITY DEFINER RPCs, denormalized counters via triggers, realtime p…
How well does S3 checkpointing hold up when running Airflow on spot?
This article explores what actually happens when Apache Airflow runs on spot instances, using real experiments to simulate node preemption across both control plane and worker node…
Intel says software, not more cache, is key to beating AMD in gaming
Speaking to the German media outlet PC Games Hardware about Intel's plans to compete with AMD's X3D line of gaming CPUs, Vice President Robert Hallock said that...…
Release PiClaw v2.0.4 – Chapek 9 · rcarmo/piclaw
PiClaw v2.0.4 — "Chapek 9" Welcome to the planet of the robots. Present your identification or be destroyed. The settings dialog will open in approximately 47 milliseconds. Feature…
Can agents replace the search stack?
Instead of deploying the traditional query understanding + reranking combo, can we let an agent do all the work?…