PageGuide – a browser agent that grounds AI directly on the webpage
PageGuide is a browser extension that grounds LLM answers directly in the HTML DOM, helping users find information, complete multi-step tasks, and filter distractions — with full in-situ verification.
Opening excerpt (first ~120 words) tap to expand
Users browsing the web daily struggle to locate relevant information on cluttered pages, complete unfamiliar multi-step tasks, and stay focused amid distracting content. State-of-the-art AI assistants and browser agents return answers without showing where information comes from, forcing users to manually verify results and blindly trust every automated step. We present 🍊 PageGuide, a browser extension that grounds LLM answers directly in the HTML DOM via visual overlays, addressing three core user needs: Find — locating and highlighting relevant evidence in-situ so users can instantly verify answers on the page; Guide — showing step-by-step instructions one at a time so users can follow and perform actions by themselves; Hide — hiding distracting content with a per-element justification…
Excerpt limited to ~120 words for fair-use compliance. The full article is at Github.