WeSearch

ThoughtFold: Folding Reasoning Chains via Introspective Preference Learning

·3 min read · 0 reactions · 0 comments · 11 views
#artificial intelligence#machine learning#reasoning
ThoughtFold: Folding Reasoning Chains via Introspective Preference Learning
⚡ TL;DR · AI summary

The paper introduces ThoughtFold, a framework designed to improve the efficiency of Large Reasoning Models (LRMs) by addressing redundant explorations in long reasoning chains. By employing introspective preference learning, ThoughtFold penalizes unnecessary steps and encourages more concise reasoning paths. Experimental results indicate that this approach significantly reduces token usage while maintaining high accuracy.

Key facts
Original article
arXiv cs.AI
Read full at arXiv cs.AI →
Opening excerpt (first ~120 words) tap to expand

Computer Science > Artificial Intelligence arXiv:2606.03503 (cs) [Submitted on 2 Jun 2026] Title:ThoughtFold: Folding Reasoning Chains via Introspective Preference Learning Authors:Ziyan Liu, Xueda Shen, Yuzhe Gu, Songyang Gao, Kuikun Liu, Guangran Cheng, Chengqi Lyu, Dahua Lin, Wenwei Zhang, Kai Chen View a PDF of the paper titled ThoughtFold: Folding Reasoning Chains via Introspective Preference Learning, by Ziyan Liu and 9 other authors View PDF Abstract:Large Reasoning Models (LRMs) have achieved remarkable progress thanks to Reinforcement Learning with Verifiable Rewards (RLVR) on Chain-of-Thoughts (CoTs).

Excerpt limited to ~120 words for fair-use compliance. The full article is at arXiv cs.AI.

Anonymous · no account needed
Share 𝕏 Facebook Reddit LinkedIn Threads WhatsApp Bluesky Mastodon Email

Discussion

0 comments

More from arXiv cs.AI