Saved in:
| Main Authors: | Feng, Yunzhen, Kempe, Julia, Zhang, Cheng, Jain, Parag, Hartshorn, Anthony |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.19284 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Don't Waste Mistakes: Leveraging Negative RL-Groups via Confidence Reweighting
by: Feng, Yunzhen, et al.
Published: (2025)
by: Feng, Yunzhen, et al.
Published: (2025)
Model Collapse Demystified: The Case of Regression
by: Dohmatob, Elvis, et al.
Published: (2024)
by: Dohmatob, Elvis, et al.
Published: (2024)
Strong Model Collapse
by: Dohmatob, Elvis, et al.
Published: (2024)
by: Dohmatob, Elvis, et al.
Published: (2024)
PILAF: Optimal Human Preference Sampling for Reward Modeling
by: Feng, Yunzhen, et al.
Published: (2025)
by: Feng, Yunzhen, et al.
Published: (2025)
Beyond Model Collapse: Scaling Up with Synthesized Data Requires Verification
by: Feng, Yunzhen, et al.
Published: (2024)
by: Feng, Yunzhen, et al.
Published: (2024)
A Tale of Tails: Model Collapse as a Change of Scaling Laws
by: Dohmatob, Elvis, et al.
Published: (2024)
by: Dohmatob, Elvis, et al.
Published: (2024)
Attacking Bayes: On the Adversarial Robustness of Bayesian Neural Networks
by: Feng, Yunzhen, et al.
Published: (2024)
by: Feng, Yunzhen, et al.
Published: (2024)
Self-Verifying Reflection Helps Transformers with CoT Reasoning
by: Yu, Zhongwei, et al.
Published: (2025)
by: Yu, Zhongwei, et al.
Published: (2025)
Outcome-based Exploration for LLM Reasoning
by: Song, Yuda, et al.
Published: (2025)
by: Song, Yuda, et al.
Published: (2025)
From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step
by: Deng, Yuntian, et al.
Published: (2024)
by: Deng, Yuntian, et al.
Published: (2024)
Exploring the Limitations of Mamba in COPY and CoT Reasoning
by: Ren, Ruifeng, et al.
Published: (2024)
by: Ren, Ruifeng, et al.
Published: (2024)
CDW-CoT: Clustered Distance-Weighted Chain-of-Thoughts Reasoning
by: Fang, Yuanheng, et al.
Published: (2025)
by: Fang, Yuanheng, et al.
Published: (2025)
Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning
by: Yan, Shaotian, et al.
Published: (2026)
by: Yan, Shaotian, et al.
Published: (2026)
How Likely Do LLMs with CoT Mimic Human Reasoning?
by: Bao, Guangsheng, et al.
Published: (2024)
by: Bao, Guangsheng, et al.
Published: (2024)
The Quest for Efficient Reasoning: A Data-Centric Benchmark to CoT Distillation
by: Zhang, Ruichen, et al.
Published: (2025)
by: Zhang, Ruichen, et al.
Published: (2025)
Divide-and-Conquer CoT: RL for Reducing Latency via Parallel Reasoning
by: Mahankali, Arvind, et al.
Published: (2026)
by: Mahankali, Arvind, et al.
Published: (2026)
What's Behind PPO's Collapse in Long-CoT? Value Optimization Holds the Secret
by: Yuan, Yufeng, et al.
Published: (2025)
by: Yuan, Yufeng, et al.
Published: (2025)
Emergent properties with repeated examples
by: Charton, François, et al.
Published: (2024)
by: Charton, François, et al.
Published: (2024)
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
by: Sprague, Zayne, et al.
Published: (2024)
by: Sprague, Zayne, et al.
Published: (2024)
UML-CoT: Structured Reasoning and Planning with Unified Modeling Language for Robotic Room Cleaning
by: Chen, Hongyu, et al.
Published: (2025)
by: Chen, Hongyu, et al.
Published: (2025)
M3Hop-CoT: Misogynous Meme Identification with Multimodal Multi-hop Chain-of-Thought
by: Kumari, Gitanjali, et al.
Published: (2024)
by: Kumari, Gitanjali, et al.
Published: (2024)
Demonstrations, CoT, and Prompting: A Theoretical Analysis of ICL
by: Tong, Xuhan, et al.
Published: (2026)
by: Tong, Xuhan, et al.
Published: (2026)
LayerCraft: Enhancing Text-to-Image Generation with CoT Reasoning and Layered Object Integration
by: Zhang, Yuyao, et al.
Published: (2025)
by: Zhang, Yuyao, et al.
Published: (2025)
The Ends Justify the Thoughts: RL-Induced Motivated Reasoning in LLM CoTs
by: Howe, Nikolaus, et al.
Published: (2025)
by: Howe, Nikolaus, et al.
Published: (2025)
Compositional Generalization from Learned Skills via CoT Training: A Theoretical and Structural Analysis for Reasoning
by: Yao, Xinhao, et al.
Published: (2025)
by: Yao, Xinhao, et al.
Published: (2025)
Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning
by: Li, Ang, et al.
Published: (2025)
by: Li, Ang, et al.
Published: (2025)
Unveiling and Causalizing CoT: A Causal Pespective
by: Fu, Jiarun, et al.
Published: (2025)
by: Fu, Jiarun, et al.
Published: (2025)
EPiC: Towards Lossless Speedup for Reasoning Training through Edge-Preserving CoT Condensation
by: Jia, Jinghan, et al.
Published: (2025)
by: Jia, Jinghan, et al.
Published: (2025)
LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning
by: Ye, Xinwu, et al.
Published: (2026)
by: Ye, Xinwu, et al.
Published: (2026)
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability
by: Sundaram, Shobhita, et al.
Published: (2026)
by: Sundaram, Shobhita, et al.
Published: (2026)
Deconstructing the Goldilocks Zone of Neural Network Initialization
by: Vysogorets, Artem, et al.
Published: (2024)
by: Vysogorets, Artem, et al.
Published: (2024)
DeLTa: A Decoding Strategy based on Logit Trajectory Prediction Improves Factuality and Reasoning Ability
by: He, Yunzhen, et al.
Published: (2025)
by: He, Yunzhen, et al.
Published: (2025)
The Illusion of Reasoning: Exposing Evasive Data Contamination in LLMs via Zero-CoT Truncation
by: Lan, Yifan, et al.
Published: (2026)
by: Lan, Yifan, et al.
Published: (2026)
Hindsight Hint Distillation: Scaffolded Reasoning for SWE Agents from CoT-free Answers
by: Wang, Shengjie, et al.
Published: (2026)
by: Wang, Shengjie, et al.
Published: (2026)
RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems?
by: Xu, Haotian, et al.
Published: (2025)
by: Xu, Haotian, et al.
Published: (2025)
Audio Flamingo Sound-CoT Technical Report: Improving Chain-of-Thought Reasoning in Sound Understanding
by: Kong, Zhifeng, et al.
Published: (2025)
by: Kong, Zhifeng, et al.
Published: (2025)
On the Robustness of Neural Collapse and the Neural Collapse of Robustness
by: Su, Jingtong, et al.
Published: (2023)
by: Su, Jingtong, et al.
Published: (2023)
Training on Documents About Monitoring Leads to CoT Obfuscation
by: Haskins, Reilly, et al.
Published: (2026)
by: Haskins, Reilly, et al.
Published: (2026)
Cross Domain Evaluation of Multimodal Chain-of-Thought Reasoning of different datasets into the Amazon CoT Framework
by: Tiwari, Nitya, et al.
Published: (2025)
by: Tiwari, Nitya, et al.
Published: (2025)
CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models
by: Zhao, Qingqing, et al.
Published: (2025)
by: Zhao, Qingqing, et al.
Published: (2025)
Similar Items
-
Don't Waste Mistakes: Leveraging Negative RL-Groups via Confidence Reweighting
by: Feng, Yunzhen, et al.
Published: (2025) -
Model Collapse Demystified: The Case of Regression
by: Dohmatob, Elvis, et al.
Published: (2024) -
Strong Model Collapse
by: Dohmatob, Elvis, et al.
Published: (2024) -
PILAF: Optimal Human Preference Sampling for Reward Modeling
by: Feng, Yunzhen, et al.
Published: (2025) -
Beyond Model Collapse: Scaling Up with Synthesized Data Requires Verification
by: Feng, Yunzhen, et al.
Published: (2024)