An LLM verifier rated math proofs near-perfect; an expert found 17% correct

Jun 26, 2026 · 6:09 AM UTC ·5 min read · 0 reactions · 0 comments · 1 view

Two posts ago I quoted a warning: an AI will find it easier to convince you it has a proof than to write one. A middling new paper finally put a number on that gap — 0.99 against 0.55.

Original article

Korbonits

Read full at Korbonits →

Opening excerpt (first ~120 words) tap to expand

Jun 12, 2026 · 6 min read Easier to Convince Than to Prove aimathematicsverification Contents Easier to Convince Than to Prove The paper, in its place The number Why it doesn’t escape Footnotes Easier to Convince Than to Prove Three weeks ago I ended a post on a sentence from Melanie Wood: “in many cases, it will be easier for AI to convince humans it has a proof than to come up with a correct mathematical argument, and I believe that we as mathematicians are not sufficiently prepared for this.” I called it the dark twin of the verification story. I’ve now spent two posts on it — once as Wood’s forecast, once as DeepMind’s failure analysis, the agent burying a problem’s difficulty inside a sorry or citing a lemma it had hallucinated. The gap between convincing and correct can be sized.

…

Excerpt limited to ~120 words for fair-use compliance. The full article is at Korbonits.

Anonymous · no account needed

Discussion

0 comments

An LLM verifier rated math proofs near-perfect; an expert found 17% correct

Discussion

More from Korbonits