:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lee, Kuang-Huei, Fischer, Ian, Wu, Yueh-Hua, Marwood, Dave, Baluja, Shumeet, Schuurmans, Dale, Chen, Xinyun
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2501.09891
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Making Images from Images: Interleaving Denoising and Transformation
by: Baluja, Shumeet, et al.
Published: (2024)

A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts
by: Lee, Kuang-Huei, et al.
Published: (2024)

Large Language Models can Learn Rules
by: Zhu, Zhaocheng, et al.
Published: (2023)

ResNets Are Deeper Than You Think
by: Mehmeti-Göpel, Christian H. X. Ali, et al.
Published: (2025)

Thinking Deeper, Not Longer: Depth-Recurrent Transformers for Compositional Generalization
by: Chen, Hung-Hsuan
Published: (2026)

Improving Large Language Model Planning with Action Sequence Similarity
by: Zhao, Xinran, et al.
Published: (2025)

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
by: McLeish, Sean, et al.
Published: (2025)

Spectral Representation-based Reinforcement Learning
by: Gao, Chenxiao, et al.
Published: (2025)

Soft Preference Optimization: Aligning Language Models to Expert Distributions
by: Sharifnassab, Arsalan, et al.
Published: (2024)

Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning
by: Zhang, Hongming, et al.
Published: (2023)

The World Is Bigger! A Computationally-Embedded Perspective on the Big World Hypothesis
by: Lewandowski, Alex, et al.
Published: (2025)

Do Agents Think Deeper? A Mechanistic Investigation of Layer-Wise Dynamics in Sequential Planning
by: Cui, Zhenyu, et al.
Published: (2026)

Training-free Diffusion Model Alignment with Sampling Demons
by: Yeh, Po-Hung, et al.
Published: (2024)

Watch Wider and Think Deeper: Collaborative Cross-modal Chain-of-Thought for Complex Visual Reasoning
by: Lu, Wenting, et al.
Published: (2026)

See Further, Think Deeper: Advancing VLM's Reasoning Ability with Low-level Visual Cues and Reflection
by: Wu, Zhiheng, et al.
Published: (2026)

Longer Context, Deeper Thinking: Uncovering the Role of Long-Context Ability in Reasoning
by: Yang, Wang, et al.
Published: (2025)

ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning
by: Huang, Chi-Pin, et al.
Published: (2025)

ThinkRec: Thinking-based recommendation via LLM
by: Yu, Qihang, et al.
Published: (2025)

Continuous Reasoning for Vision-Language-Action
by: Wu, Yueh-Hua, et al.
Published: (2026)

Learning Interactive Real-World Simulators
by: Yang, Sherry, et al.
Published: (2023)

Beyond Expectations: Learning with Stochastic Dominance Made Practical
by: Cen, Shicong, et al.
Published: (2024)

Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search
by: Inoue, Yuichi, et al.
Published: (2025)

DNAct: Diffusion Guided Multi-Task 3D Policy Learning
by: Yan, Ge, et al.
Published: (2024)

Optimize Wider, Not Deeper: Consensus Aggregation for Policy Optimization
by: Su, Zelal, et al.
Published: (2026)

Reversible Diffusion Decoding for Diffusion Language Models
by: Wang, Xinyun, et al.
Published: (2026)

PPO-Clip Attains Global Optimality: Towards Deeper Understandings of Clipping
by: Huang, Nai-Chieh, et al.
Published: (2023)

SmartSwitch: Advancing LLM Reasoning by Overcoming Underthinking via Promoting Deeper Thought Exploration
by: Zhang, Xichen, et al.
Published: (2025)

Unlocking Exploration in RLVR: Uncertainty-aware Advantage Shaping for Deeper Reasoning
by: Xie, Can, et al.
Published: (2025)

Fast Think-on-Graph: Wider, Deeper and Faster Reasoning of Large Language Model on Knowledge Graph
by: Liang, Xujian, et al.
Published: (2025)

Integrating Artificial Intelligence with Human Expertise: An In-depth Analysis of ChatGPT's Capabilities in Generating Metamorphic Relations
by: Zhang, Yifan, et al.
Published: (2025)

Scalable Diffusion for Materials Generation
by: Yang, Sherry, et al.
Published: (2023)

Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options
by: Nair, Lakshmi, et al.
Published: (2025)

Deeper Insights into Learning Performance of Stochastic Configuration Networks
by: Yan, Xiufeng, et al.
Published: (2024)

EvolveR: Self-Evolving LLM Agents through an Experience-Driven Lifecycle
by: Wu, Rong, et al.
Published: (2025)

Training Agents to Self-Report Misbehavior
by: Lee, Bruce W., et al.
Published: (2026)

Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
by: Ling Team, et al.
Published: (2025)

Self-Evolving Curriculum for LLM Reasoning
by: Chen, Xiaoyin, et al.
Published: (2025)

Multimodal Web Navigation with Instruction-Finetuned Foundation Models
by: Furuta, Hiroki, et al.
Published: (2023)

Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback
by: Furuta, Hiroki, et al.
Published: (2024)

HyperTree Planning: Enhancing LLM Reasoning via Hierarchical Thinking
by: Gui, Runquan, et al.
Published: (2025)