Saved in:
| Main Authors: | Lee, Kuang-Huei, Fischer, Ian, Wu, Yueh-Hua, Marwood, Dave, Baluja, Shumeet, Schuurmans, Dale, Chen, Xinyun |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.09891 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Making Images from Images: Interleaving Denoising and Transformation
by: Baluja, Shumeet, et al.
Published: (2024)
by: Baluja, Shumeet, et al.
Published: (2024)
A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts
by: Lee, Kuang-Huei, et al.
Published: (2024)
by: Lee, Kuang-Huei, et al.
Published: (2024)
Large Language Models can Learn Rules
by: Zhu, Zhaocheng, et al.
Published: (2023)
by: Zhu, Zhaocheng, et al.
Published: (2023)
ResNets Are Deeper Than You Think
by: Mehmeti-Göpel, Christian H. X. Ali, et al.
Published: (2025)
by: Mehmeti-Göpel, Christian H. X. Ali, et al.
Published: (2025)
Thinking Deeper, Not Longer: Depth-Recurrent Transformers for Compositional Generalization
by: Chen, Hung-Hsuan
Published: (2026)
by: Chen, Hung-Hsuan
Published: (2026)
Improving Large Language Model Planning with Action Sequence Similarity
by: Zhao, Xinran, et al.
Published: (2025)
by: Zhao, Xinran, et al.
Published: (2025)
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
by: McLeish, Sean, et al.
Published: (2025)
by: McLeish, Sean, et al.
Published: (2025)
Spectral Representation-based Reinforcement Learning
by: Gao, Chenxiao, et al.
Published: (2025)
by: Gao, Chenxiao, et al.
Published: (2025)
Soft Preference Optimization: Aligning Language Models to Expert Distributions
by: Sharifnassab, Arsalan, et al.
Published: (2024)
by: Sharifnassab, Arsalan, et al.
Published: (2024)
Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning
by: Zhang, Hongming, et al.
Published: (2023)
by: Zhang, Hongming, et al.
Published: (2023)
The World Is Bigger! A Computationally-Embedded Perspective on the Big World Hypothesis
by: Lewandowski, Alex, et al.
Published: (2025)
by: Lewandowski, Alex, et al.
Published: (2025)
Do Agents Think Deeper? A Mechanistic Investigation of Layer-Wise Dynamics in Sequential Planning
by: Cui, Zhenyu, et al.
Published: (2026)
by: Cui, Zhenyu, et al.
Published: (2026)
Training-free Diffusion Model Alignment with Sampling Demons
by: Yeh, Po-Hung, et al.
Published: (2024)
by: Yeh, Po-Hung, et al.
Published: (2024)
Watch Wider and Think Deeper: Collaborative Cross-modal Chain-of-Thought for Complex Visual Reasoning
by: Lu, Wenting, et al.
Published: (2026)
by: Lu, Wenting, et al.
Published: (2026)
See Further, Think Deeper: Advancing VLM's Reasoning Ability with Low-level Visual Cues and Reflection
by: Wu, Zhiheng, et al.
Published: (2026)
by: Wu, Zhiheng, et al.
Published: (2026)
Longer Context, Deeper Thinking: Uncovering the Role of Long-Context Ability in Reasoning
by: Yang, Wang, et al.
Published: (2025)
by: Yang, Wang, et al.
Published: (2025)
ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning
by: Huang, Chi-Pin, et al.
Published: (2025)
by: Huang, Chi-Pin, et al.
Published: (2025)
ThinkRec: Thinking-based recommendation via LLM
by: Yu, Qihang, et al.
Published: (2025)
by: Yu, Qihang, et al.
Published: (2025)
Continuous Reasoning for Vision-Language-Action
by: Wu, Yueh-Hua, et al.
Published: (2026)
by: Wu, Yueh-Hua, et al.
Published: (2026)
Learning Interactive Real-World Simulators
by: Yang, Sherry, et al.
Published: (2023)
by: Yang, Sherry, et al.
Published: (2023)
Beyond Expectations: Learning with Stochastic Dominance Made Practical
by: Cen, Shicong, et al.
Published: (2024)
by: Cen, Shicong, et al.
Published: (2024)
Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search
by: Inoue, Yuichi, et al.
Published: (2025)
by: Inoue, Yuichi, et al.
Published: (2025)
DNAct: Diffusion Guided Multi-Task 3D Policy Learning
by: Yan, Ge, et al.
Published: (2024)
by: Yan, Ge, et al.
Published: (2024)
Optimize Wider, Not Deeper: Consensus Aggregation for Policy Optimization
by: Su, Zelal, et al.
Published: (2026)
by: Su, Zelal, et al.
Published: (2026)
Reversible Diffusion Decoding for Diffusion Language Models
by: Wang, Xinyun, et al.
Published: (2026)
by: Wang, Xinyun, et al.
Published: (2026)
PPO-Clip Attains Global Optimality: Towards Deeper Understandings of Clipping
by: Huang, Nai-Chieh, et al.
Published: (2023)
by: Huang, Nai-Chieh, et al.
Published: (2023)
SmartSwitch: Advancing LLM Reasoning by Overcoming Underthinking via Promoting Deeper Thought Exploration
by: Zhang, Xichen, et al.
Published: (2025)
by: Zhang, Xichen, et al.
Published: (2025)
Unlocking Exploration in RLVR: Uncertainty-aware Advantage Shaping for Deeper Reasoning
by: Xie, Can, et al.
Published: (2025)
by: Xie, Can, et al.
Published: (2025)
Fast Think-on-Graph: Wider, Deeper and Faster Reasoning of Large Language Model on Knowledge Graph
by: Liang, Xujian, et al.
Published: (2025)
by: Liang, Xujian, et al.
Published: (2025)
Integrating Artificial Intelligence with Human Expertise: An In-depth Analysis of ChatGPT's Capabilities in Generating Metamorphic Relations
by: Zhang, Yifan, et al.
Published: (2025)
by: Zhang, Yifan, et al.
Published: (2025)
Scalable Diffusion for Materials Generation
by: Yang, Sherry, et al.
Published: (2023)
by: Yang, Sherry, et al.
Published: (2023)
Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options
by: Nair, Lakshmi, et al.
Published: (2025)
by: Nair, Lakshmi, et al.
Published: (2025)
Deeper Insights into Learning Performance of Stochastic Configuration Networks
by: Yan, Xiufeng, et al.
Published: (2024)
by: Yan, Xiufeng, et al.
Published: (2024)
EvolveR: Self-Evolving LLM Agents through an Experience-Driven Lifecycle
by: Wu, Rong, et al.
Published: (2025)
by: Wu, Rong, et al.
Published: (2025)
Training Agents to Self-Report Misbehavior
by: Lee, Bruce W., et al.
Published: (2026)
by: Lee, Bruce W., et al.
Published: (2026)
Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
by: Ling Team, et al.
Published: (2025)
by: Ling Team, et al.
Published: (2025)
Self-Evolving Curriculum for LLM Reasoning
by: Chen, Xiaoyin, et al.
Published: (2025)
by: Chen, Xiaoyin, et al.
Published: (2025)
Multimodal Web Navigation with Instruction-Finetuned Foundation Models
by: Furuta, Hiroki, et al.
Published: (2023)
by: Furuta, Hiroki, et al.
Published: (2023)
Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback
by: Furuta, Hiroki, et al.
Published: (2024)
by: Furuta, Hiroki, et al.
Published: (2024)
HyperTree Planning: Enhancing LLM Reasoning via Hierarchical Thinking
by: Gui, Runquan, et al.
Published: (2025)
by: Gui, Runquan, et al.
Published: (2025)
Similar Items
-
Making Images from Images: Interleaving Denoising and Transformation
by: Baluja, Shumeet, et al.
Published: (2024) -
A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts
by: Lee, Kuang-Huei, et al.
Published: (2024) -
Large Language Models can Learn Rules
by: Zhu, Zhaocheng, et al.
Published: (2023) -
ResNets Are Deeper Than You Think
by: Mehmeti-Göpel, Christian H. X. Ali, et al.
Published: (2025) -
Thinking Deeper, Not Longer: Depth-Recurrent Transformers for Compositional Generalization
by: Chen, Hung-Hsuan
Published: (2026)