24 results for "gemini"
KIOKU v0.7.0: completing multi-agent — Gemini and Codex CLI now get automatic session logging
v0.6 made KIOKU's skills agent-portable. v0.7 ports the hook layer too — Gemini CLI and Codex CLI now write to the same session-logs/ pipeline as Claude Code, with masking and security boundaries pres…
I just got Gemini on my Google Home and Nest speakers, and I’d like a refund
Gemini just rolled out to my Google Home and Nest speakers, and the experience has been slow, unpredictable, and frustrating. Here's what's wrong with it.…
Gemini could soon show you exactly how much AI you use
Google is working on a Gemini usage dashboard to help users track their AI consumption and quota resets. There are also new icons.…
I switched from Gemini to Claude and it’s a mixed bag
Claude isn't better or worse than Gemini. It's just different. Different enough that I can't switch fully, so I'll just use both.…
Google could soon say goodbye to current Gemini voices (APK teardown)
Like Gemini's current selection of voices? There's bad news as an Android Authority teardown has revealed that they'll be discontinued.…
Here’s your first look at Google’s upcoming ‘Proactive Assistance’ feature for Gemini
Google is developing a new Proactive Assistance feature for Gemini to delivers timely, context-aware suggestions even before you ask.…
E.U. is pushing Google to give rival AI services the same Android access as Gemini
Home Assistant's local LLM support outperforms Gemini for Home, and Google knows it
Search Live gets ready to follow Gemini with a colorful visual update
Experimenting with the latest beta version of the Google app, we were able to see a refreshed Search Live interface.…
'Just Adds More Complexity' — Gemini’s AI Trading Launch Triggers Warnings Across Crypto Community
I'm finally the 'organized' friend, thanks to a little help from Gemini
I have become organized in the most unexpected way…
Google Gemini wants to become a more proactive assistant
An assistant that acts before you ask…
Show HN: OSS Agent I built topped the TerminalBench on Gemini-3-flash-preview
Scored 65.2% vs google's official 47.8%, and the existing top closed source model Junie CLI's 64.3%. Since there are a lot of reports of deliberate cheating on TerminalBench 2.0 lately ( ), I would li…
I asked AI to book dinner. It made me want to use the app instead
ChatGPT, Claude, and Gemini may be aces at coding, but they’re less than magical when it comes to booking a table for three.…
EU tells Google to open up AI on Android; Google says "unwarranted intervention"
Gemini gets preferential treatment on Android, but maybe not for long (in Europe).…
Don't Make the LLM Read the Graph: Make the Graph Think
We investigate whether explicit belief graphs improve LLM performance in cooperative multi-agent reasoning. Through 3,000+ controlled trials across four LLM families in the cooperative card game Hanab…
StoryTR: Narrative-Centric Video Temporal Retrieval with Theory of Mind Reasoning
Current video moment retrieval excels at action-centric tasks but struggles with narrative content. Models can see \textit{what is happening} but fail to reason \textit{why it matters}. This semantic …
GSAR: Typed Grounding for Hallucination Detection and Recovery in Multi-Agent LLMs
Autonomous multi-agent LLM systems are increasingly deployed to investigate operational incidents and produce structured diagnostic reports. Their trustworthiness hinges on whether each claim is groun…
Beyond the Attention Stability Boundary: Agentic Self-Synthesizing Reasoning Protocols
As LLM agents transition to autonomous digital coworkers, maintaining deterministic goal-directedness in non-linear multi-turn conversations emerged as an architectural bottleneck. We identify and for…
A systematic evaluation of vision-language models for observational astronomical reasoning tasks
Vision-language models (VLMs) are increasingly proposed as general-purpose tools for scientific data interpretation, yet their reliability on real astronomical observations across diverse modalities r…
Thoughts on using an AMD Alveo V80 FPGA PCI card as a poor man’s Taalas HC1 (LLM-burned-onto-a-chip).
TL:DR - Remembered FPGA PCI boards being a big thing from my crypto days. Wondered if AMD Alveo V80 FPGA card could be used to approximate the performance of a Taalas HC1 (LLM-on-a-chip). Ran the idea…
Speculative decoding with Gemma-4-31B + Gemma-4-E2B enables 120 - 200 tok/s output speed for specific tasks
So for my project I was using up until now either Gemini 3 / 2.5 Flash or Flash-lite. All my use cases are not agentic, simply LLM workflows for atomic tasks like extracting references from the law, c…
Claude 4.7 named a journalist from 125 words of unpublished writing
Surprised this isn't a bigger topic but you tell me! In short: writer Kelsey Piper pasted 125 words of an unpublished political column into 4.7 and got her own name back. She'd logged out, run it via …
Decreased Intelligence Density in DeepSeek V4 Pro
In the V3.2 paper, they mentioned: Second, token efficiency remains a challenge; DeepSeek-V3.2 typically requires longer generation trajectories (i.e., more tokens) to match the output quality of mode…