Internalizing the Future: A Unified Agentic Training Paradigm for World Model Planning

Jun 29, 2026 · 4:00 AM UTC ·3 min read · 0 reactions · 0 comments · 8 views

arXiv:2606.27483v1 Announce Type: new Abstract: Large language model (LLM) agents have demonstrated strong capability in sequential decision-making, yet they remains fundamentally reactive in long-horizon tasks. Unlike humans who employ "what-if" reasoning to evaluate potential plans before commitment, standard agents lack an internal world model to simulate future outcomes. Therefore, we propose to internalize future-aware planning by training a single autoregressive model to verbalize both a p

Original article

arXiv.org

Read full at arXiv.org →

Opening excerpt (first ~120 words) tap to expand

Computer Science > Artificial Intelligence arXiv:2606.27483 (cs) [Submitted on 25 Jun 2026] Title:Internalizing the Future: A Unified Agentic Training Paradigm for World Model Planning Authors:Xuan Zhang, Zhijian Zhou, Lingfeng Qiao, Yulei Qin, Ke Li, Xing Sun, Xiaoyu Tan, Chao Qu, Yuan Qi View a PDF of the paper titled Internalizing the Future: A Unified Agentic Training Paradigm for World Model Planning, by Xuan Zhang and 8 other authors View PDF Abstract:Large language model (LLM) agents have demonstrated strong capability in sequential decision-making, yet they remains fundamentally reactive in long-horizon tasks. Unlike humans who employ "what-if" reasoning to evaluate potential plans before commitment, standard agents lack an internal world model to simulate future outcomes.

…

Excerpt limited to ~120 words for fair-use compliance. The full article is at arXiv.org.

Anonymous · no account needed

Discussion

0 comments

Internalizing the Future: A Unified Agentic Training Paradigm for World Model Planning

Discussion

More from arXiv.org