25 stories tagged with #ai-inference, in publish-time order across the WeSearch catalog. Tag pages update as new stories ingest.
⌘ RSS feed for this tag → or search "Ai Inference"
Show HN: Hive Trust – Ed25519-signed benchmarks for every AI inference primitive
Hive primitives benchmarked against published SOTA adversaries. Every result is a signed Ed25519 receipt from hivemorph — queryable, tamper-evident, reproducible.…
FingerMotion shares rise on entry into edge AI inference computing market
With Nvidia Groq 3, the Era of AI Inference Is (Probably) Here (⌛ March 2026)
What makes Nvidia's new Groq 3 LPU chip a must-watch in the AI world?…
Silicon Motion new SM2524XT PCIe 5 controller achieves 14GB/s read and 12GB/s write speeds with up to 2.5 million IOPS and up to 25% higher performance-per-watt, designed for AI inference
Sources: ByteDance has partnered with chipmaker InnoStar to develop an AI inference chip modeled after Groq's LPUs, which are built to run AI models at low cost (The Information)
Argonne flexes spare supercompute to build private AI inference service
Think ChatDoE…
Imece – Distributed AI inference using volunteer GPUs and FLOP token
A decentralized AI compute cooperative where contributors earn inference credits by donating idle GPU/CPU time — measured in FLOPs, not crypto. - aslankose/imece…
I Squared Capital buys $225M data center portfolio from Cogent Fiber to build AI inference platform
I Squared Capital acquires 10 data center facilities from Cogent Fiber for $225M, committing up to $1B to build a US platform focused on AI inference workloads.…
Source: AI inference provider Baseten is in talks to raise $1B at a post-money valuation of $11B, up from $5B after its $300M Series E announced in January (The Information)
Show HN: MurrDB: A RocksDB-based NVMe/S3 cache for AI inference workloads
I Squared bets on AI inference with $225 million data center buy from Cogent
Is AI inference platform really that saturated now? [D]
What is a good setup for a beginner's homelab "server" that just runs plex + some AI inference stuff?
This Artificial Intelligence (AI) Stock Will Beat Nvidia, AMD, Broadcom, and Intel to Become the Biggest Winner in AI Inference
AMD CEO Lisa Su projects the CPU market will grow over 35% annually through 2031, up from 3% to 4% historically, driven by AI inference and agentic AI demand (Cheng Ting-Fang/Nikkei Asia)
Cheng Ting-Fang / Nikkei Asia : AMD CEO Lisa Su projects the CPU market will grow over 35% annually through 2031, up from 3% to 4% historically, driven by AI inference and agentic …
Modal Labs, which offers a serverless cloud platform to build AI apps and run AI inference, raised a $355M Series C at a $4.65B valuation, up from $1.1B in 2025 (Deepa Seetharaman/Reuters)
Powering the AI inference boom: Is it time to downsize the data centre?
AI Inference Costs: The Wake-Up Call for 2026 and 2027
These Super Stocks Could Be the Biggest Winners in the AI Inference and Agentic AI Economy
The AI Inference Supercycle Is Here. These 2 Stocks Will Be the Biggest Winners of This Megatrend (Hint: It's Not Broadcom or Intel)
Apple Silicon costs more than OpenRouter
Local LLMs can be very very cheap…
How to Achieve Truly Serverless GPUs
A deep dive on Modal's deep tech for fast boots.…
A Developer's Guide to AI Inference Costs in 2026
GPU rental, API pricing, and the infrastructure math that determines whether your AI feature makes money.…
Comparison: vLLM 0.6 vs. Text Generation Inference 1.4 for Serving Code LLMs
Serving code LLMs at production scale is 3.2x more expensive than general-purpose LLMs when using...…