Guardado en:
| Autores principales: | Zhu, Hanlin, Hao, Shibo, Hu, Zhiting, Jiao, Jiantao, Russell, Stuart, Tian, Yuandong |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2505.12514 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Emergence of Superposition: Unveiling the Training Dynamics of Chain of Continuous Thought
por: Zhu, Hanlin, et al.
Publicado: (2025)
por: Zhu, Hanlin, et al.
Publicado: (2025)
Towards a Theoretical Understanding of the 'Reversal Curse' via Training Dynamics
por: Zhu, Hanlin, et al.
Publicado: (2024)
por: Zhu, Hanlin, et al.
Publicado: (2024)
Transformers Provably Learn to Internalize Chain-of-Thought
por: Huang, Yixiao, et al.
Publicado: (2026)
por: Huang, Yixiao, et al.
Publicado: (2026)
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning
por: Su, DiJia, et al.
Publicado: (2025)
por: Su, DiJia, et al.
Publicado: (2025)
GSM-Agent: Understanding Agentic Reasoning Using Controllable Environments
por: Zhu, Hanlin, et al.
Publicado: (2025)
por: Zhu, Hanlin, et al.
Publicado: (2025)
Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers
por: Huang, Yixiao, et al.
Publicado: (2025)
por: Huang, Yixiao, et al.
Publicado: (2025)
Efficient Prompt Caching via Embedding Similarity
por: Zhu, Hanlin, et al.
Publicado: (2024)
por: Zhu, Hanlin, et al.
Publicado: (2024)
On Representation Complexity of Model-based and Model-free Reinforcement Learning
por: Zhu, Hanlin, et al.
Publicado: (2023)
por: Zhu, Hanlin, et al.
Publicado: (2023)
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings
por: Hao, Shibo, et al.
Publicado: (2023)
por: Hao, Shibo, et al.
Publicado: (2023)
Avoiding Catastrophe in Online Learning by Asking for Help
por: Plaut, Benjamin, et al.
Publicado: (2024)
por: Plaut, Benjamin, et al.
Publicado: (2024)
On the Cost and Benefit of Chain of Thought: A Learning-Theoretic Perspective
por: Zhang, Yue, et al.
Publicado: (2026)
por: Zhang, Yue, et al.
Publicado: (2026)
Continuous Chain of Thought Enables Parallel Exploration and Reasoning
por: Gozeten, Halil Alperen, et al.
Publicado: (2025)
por: Gozeten, Halil Alperen, et al.
Publicado: (2025)
Training Large Language Models to Reason in a Continuous Latent Space
por: Hao, Shibo, et al.
Publicado: (2024)
por: Hao, Shibo, et al.
Publicado: (2024)
Composing Global Solutions to Reasoning Tasks via Algebraic Objects in Neural Nets
por: Tian, Yuandong
Publicado: (2024)
por: Tian, Yuandong
Publicado: (2024)
Safe Learning Under Irreversible Dynamics via Asking for Help
por: Plaut, Benjamin, et al.
Publicado: (2025)
por: Plaut, Benjamin, et al.
Publicado: (2025)
LLM Pretraining with Continuous Concepts
por: Tack, Jihoon, et al.
Publicado: (2025)
por: Tack, Jihoon, et al.
Publicado: (2025)
A Theoretical Understanding of Chain-of-Thought: Coherent Reasoning and Error-Aware Demonstration
por: Cui, Yingqian, et al.
Publicado: (2024)
por: Cui, Yingqian, et al.
Publicado: (2024)
Deep Thinking by Markov Chain of Continuous Thoughts
por: Liu, Jiayu, et al.
Publicado: (2025)
por: Liu, Jiayu, et al.
Publicado: (2025)
Provable Scaling Laws of Feature Emergence from Learning Dynamics of Grokking
por: Tian, Yuandong
Publicado: (2025)
por: Tian, Yuandong
Publicado: (2025)
Bridging Formal Language with Chain-of-Thought Reasoning to Geometry Problem Solving
por: Yang, Tianyun, et al.
Publicado: (2025)
por: Yang, Tianyun, et al.
Publicado: (2025)
Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression
por: Tang, Yuntian, et al.
Publicado: (2026)
por: Tang, Yuntian, et al.
Publicado: (2026)
Rethinking Chain-of-Thought Reasoning for Videos
por: Zhong, Yiwu, et al.
Publicado: (2025)
por: Zhong, Yiwu, et al.
Publicado: (2025)
CTRLS: Chain-of-Thought Reasoning via Latent State-Transition
por: Wu, Junda, et al.
Publicado: (2025)
por: Wu, Junda, et al.
Publicado: (2025)
Fractured Chain-of-Thought Reasoning
por: Liao, Baohao, et al.
Publicado: (2025)
por: Liao, Baohao, et al.
Publicado: (2025)
On Learning Verifiers and Implications to Chain-of-Thought Reasoning
por: Balcan, Maria-Florina, et al.
Publicado: (2025)
por: Balcan, Maria-Florina, et al.
Publicado: (2025)
Learning Modal-Mixed Chain-of-Thought Reasoning with Latent Embeddings
por: Shao, Yifei, et al.
Publicado: (2026)
por: Shao, Yifei, et al.
Publicado: (2026)
When does Chain-of-Thought Help: A Markovian Perspective
por: Wang, Zihan, et al.
Publicado: (2026)
por: Wang, Zihan, et al.
Publicado: (2026)
A Theory of Online Learning with Autoregressive Chain-of-Thought Reasoning
por: Doron-Arad, Ilan, et al.
Publicado: (2026)
por: Doron-Arad, Ilan, et al.
Publicado: (2026)
Dissecting Long-Chain-of-Thought Reasoning Models: An Empirical Study
por: Mu, Yongyu, et al.
Publicado: (2025)
por: Mu, Yongyu, et al.
Publicado: (2025)
Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs
por: Zhang, Xuan, et al.
Publicado: (2024)
por: Zhang, Xuan, et al.
Publicado: (2024)
Understanding Reasoning in Chain-of-Thought from the Hopfieldian View
por: Hu, Lijie, et al.
Publicado: (2024)
por: Hu, Lijie, et al.
Publicado: (2024)
Reasoning Models Sometimes Output Illegible Chains of Thought
por: Jose, Arun
Publicado: (2025)
por: Jose, Arun
Publicado: (2025)
Demystifying Long Chain-of-Thought Reasoning in LLMs
por: Yeo, Edward, et al.
Publicado: (2025)
por: Yeo, Edward, et al.
Publicado: (2025)
Unveiling Confirmation Bias in Chain-of-Thought Reasoning
por: Wan, Yue, et al.
Publicado: (2025)
por: Wan, Yue, et al.
Publicado: (2025)
Understanding Hidden Computations in Chain-of-Thought Reasoning
por: Bharadwaj, Aryasomayajula Ram
Publicado: (2024)
por: Bharadwaj, Aryasomayajula Ram
Publicado: (2024)
Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models
por: Ye, Jiacheng, et al.
Publicado: (2024)
por: Ye, Jiacheng, et al.
Publicado: (2024)
Towards Optimal Statistical Watermarking
por: Huang, Baihe, et al.
Publicado: (2023)
por: Huang, Baihe, et al.
Publicado: (2023)
Scaling Graph Chain-of-Thought Reasoning: A Multi-Agent Framework with Efficient LLM Serving
por: Huan, Chengying, et al.
Publicado: (2025)
por: Huan, Chengying, et al.
Publicado: (2025)
Robotic Control via Embodied Chain-of-Thought Reasoning
por: Zawalski, Michał, et al.
Publicado: (2024)
por: Zawalski, Michał, et al.
Publicado: (2024)
Training Nonlinear Transformers for Chain-of-Thought Inference: A Theoretical Generalization Analysis
por: Li, Hongkang, et al.
Publicado: (2024)
por: Li, Hongkang, et al.
Publicado: (2024)
Ejemplares similares
-
Emergence of Superposition: Unveiling the Training Dynamics of Chain of Continuous Thought
por: Zhu, Hanlin, et al.
Publicado: (2025) -
Towards a Theoretical Understanding of the 'Reversal Curse' via Training Dynamics
por: Zhu, Hanlin, et al.
Publicado: (2024) -
Transformers Provably Learn to Internalize Chain-of-Thought
por: Huang, Yixiao, et al.
Publicado: (2026) -
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning
por: Su, DiJia, et al.
Publicado: (2025) -
GSM-Agent: Understanding Agentic Reasoning Using Controllable Environments
por: Zhu, Hanlin, et al.
Publicado: (2025)