Show HN: Cursed Browser – a VLM reads the HTML and hallucinates the page
Cursed Browser is a new rendering engine that uses visual large language models (VLMs) to interpret HTML and create unique visual representations of web pages. Each page load results in a different artistic rendering, making it a novel approach to browsing. The project is currently open-source and aims to eliminate unnecessary dependencies on traditional web infrastructure.
- ▪Cursed Browser uses an LLM to interpret HTML and generate visual representations of web pages.
- ▪Each page load results in a unique rendering, described as a work of art.
- ▪The current version is open-source and aims to break away from legacy web dependencies.
Opening excerpt (first ~120 words) tap to expand
Cursed Browser: Rendering Engine using Visual-LLMs Cursed Browser asks an LLM to look at the page's HTML and draw what it thinks it looks like. Every page load is a surprise. Every render is a work of art. It's better than correct, it's AI Native. Examples: Cursed vs Safari Compared to other "AI Native" browsers Feature Arc Dia Comet Atlas Cursed HTML parsed by an LLM token-by-token ❌ ❌ ❌ ❌ ✅ CSS interpreted via next-token prediction ❌ ❌ ❌ ❌ ✅ Pixels hallucinated by a VLM ❌ ❌ ❌ ❌ ✅ All hyphens upgraded to em—dashes ❌ ❌ ❌ ❌ ✅ Roadmap V1: An LLM looks at HTML and draws what it thinks a browser would show. Technically a browser. Legally, probably also a browser. Morally, questionable.
…
Excerpt limited to ~120 words for fair-use compliance. The full article is at GitHub.