Saved in:
| Main Authors: | Chen, Zihan, Zhang, Yiming, Geng, Wenxiang, Ding, Zenghui, Sun, Yining |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2606.00674 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Rethinking RL Evaluation: Can Benchmarks Truly Reveal Failures of RL Methods?
by: Chen, Zihan, et al.
Published: (2025)
by: Chen, Zihan, et al.
Published: (2025)
RankCLIP: Ranking-Consistent Language-Image Pretraining
by: Zhang, Yiming, et al.
Published: (2024)
by: Zhang, Yiming, et al.
Published: (2024)
Information-Theoretic Causal Bounds under Unmeasured Confounding
by: Jung, Yonghan, et al.
Published: (2026)
by: Jung, Yonghan, et al.
Published: (2026)
Inductive Subgraphs as Shortcuts: Causal Disentanglement for Heterophilic Graph Learning
by: Wang, Xiangmeng, et al.
Published: (2026)
by: Wang, Xiangmeng, et al.
Published: (2026)
The Reliability Paradox: Exploring How Shortcut Learning Undermines Language Model Calibration
by: Bihani, Geetanjali, et al.
Published: (2024)
by: Bihani, Geetanjali, et al.
Published: (2024)
Networked Restless Multi-Arm Bandits with Reinforcement Learning
by: Zhang, Hanmo, et al.
Published: (2025)
by: Zhang, Hanmo, et al.
Published: (2025)
DINORANKCLIP: DINOv3 Distillation and Injection for Vision-Language Pretraining with High-Order Ranking Consistency
by: Jiang, Shuyang, et al.
Published: (2026)
by: Jiang, Shuyang, et al.
Published: (2026)
Meaningful Causal Aggregation and Paradoxical Confounding
by: Zhu, Yuchen, et al.
Published: (2023)
by: Zhu, Yuchen, et al.
Published: (2023)
Single-stream Policy Optimization
by: Xu, Zhongwen, et al.
Published: (2025)
by: Xu, Zhongwen, et al.
Published: (2025)
The Reasoning Trap: An Information-Theoretic Bound on Closed-System Multi-Step LLM Reasoning
by: Shin, Kwan Soo
Published: (2026)
by: Shin, Kwan Soo
Published: (2026)
LGMT: Logic-Grounded Metamorphic Testing for Evaluating the Reasoning Reliability of LLMs
by: Zhou, Zenghui, et al.
Published: (2026)
by: Zhou, Zenghui, et al.
Published: (2026)
TimeMKG: Knowledge-Infused Causal Reasoning for Multivariate Time Series Modeling
by: Sun, Yifei, et al.
Published: (2025)
by: Sun, Yifei, et al.
Published: (2025)
CARE: Turning LLMs Into Causal Reasoning Expert
by: Dong, Juncheng, et al.
Published: (2025)
by: Dong, Juncheng, et al.
Published: (2025)
Can Post-Training Transform LLMs into Causal Reasoners?
by: Chen, Junqi, et al.
Published: (2026)
by: Chen, Junqi, et al.
Published: (2026)
BEARS Make Neuro-Symbolic Models Aware of their Reasoning Shortcuts
by: Marconato, Emanuele, et al.
Published: (2024)
by: Marconato, Emanuele, et al.
Published: (2024)
GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations
by: Duan, Jinhao, et al.
Published: (2024)
by: Duan, Jinhao, et al.
Published: (2024)
A Neuro-Symbolic Benchmark Suite for Concept Quality and Reasoning Shortcuts
by: Bortolotti, Samuele, et al.
Published: (2024)
by: Bortolotti, Samuele, et al.
Published: (2024)
MiMu: Mitigating Multiple Shortcut Learning Behavior of Transformers
by: Zhao, Lili, et al.
Published: (2025)
by: Zhao, Lili, et al.
Published: (2025)
Theoretical Analysis of Meta Reinforcement Learning: Generalization Bounds and Convergence Guarantees
by: Wang, Cangqing, et al.
Published: (2024)
by: Wang, Cangqing, et al.
Published: (2024)
Meta-Aligner: Bidirectional Preference-Policy Optimization for Multi-Objective LLMs Alignment
by: Xu, Wenzhe, et al.
Published: (2026)
by: Xu, Wenzhe, et al.
Published: (2026)
Calibration-Aware Policy Optimization for Reasoning LLMs
by: Wang, Ziqi, et al.
Published: (2026)
by: Wang, Ziqi, et al.
Published: (2026)
Reshaping Reasoning in LLMs: A Theoretical Analysis of RL Training Dynamics through Pattern Selection
by: Chen, Xingwu, et al.
Published: (2025)
by: Chen, Xingwu, et al.
Published: (2025)
Symbol Grounding in Neuro-Symbolic AI: A Gentle Introduction to Reasoning Shortcuts
by: Marconato, Emanuele, et al.
Published: (2025)
by: Marconato, Emanuele, et al.
Published: (2025)
Information-Theoretic Safe Bayesian Optimization
by: Bottero, Alessandro G., et al.
Published: (2024)
by: Bottero, Alessandro G., et al.
Published: (2024)
Learning to Correct for QA Reasoning with Black-box LLMs
by: Kim, Jaehyung, et al.
Published: (2024)
by: Kim, Jaehyung, et al.
Published: (2024)
Reasoning Stabilization Point: A Training-Time Signal for Stable Evidence and Shortcut Reliance
by: Dhayalkar, Sahil Rajesh
Published: (2026)
by: Dhayalkar, Sahil Rajesh
Published: (2026)
Generalization of RLVR Using Causal Reasoning as a Testbed
by: Lu, Brian, et al.
Published: (2025)
by: Lu, Brian, et al.
Published: (2025)
Efficient Transfer Learning via Causal Bounds
by: Gong, Xueping, et al.
Published: (2023)
by: Gong, Xueping, et al.
Published: (2023)
Learning with Logical Constraints but without Shortcut Satisfaction
by: Li, Zenan, et al.
Published: (2024)
by: Li, Zenan, et al.
Published: (2024)
Rectifying Shortcut Behaviors in Preference-based Reward Learning
by: Ye, Wenqian, et al.
Published: (2025)
by: Ye, Wenqian, et al.
Published: (2025)
Probing Neural Combinatorial Optimization Models
by: Zhang, Zhiqin, et al.
Published: (2025)
by: Zhang, Zhiqin, et al.
Published: (2025)
Revisiting Meta-Learning with Noisy Labels: Reweighting Dynamics and Theoretical Guarantees
by: Zhang, Yiming, et al.
Published: (2025)
by: Zhang, Yiming, et al.
Published: (2025)
InfoRM: Mitigating Reward Hacking in RLHF via Information-Theoretic Reward Modeling
by: Miao, Yuchun, et al.
Published: (2024)
by: Miao, Yuchun, et al.
Published: (2024)
Gradient-based Model Shortcut Detection for Time Series Classification
by: Ibarra, Salomon, et al.
Published: (2025)
by: Ibarra, Salomon, et al.
Published: (2025)
From Static Analysis to Audience Dissemination: A Training-Free Multimodal Controversy Detection Multi-Agent Framework
by: Ding, Zihan, et al.
Published: (2026)
by: Ding, Zihan, et al.
Published: (2026)
PrismAgent: Illuminating Harm in Memes via a Zero-Shot Interpretable Multi-Agent Framework
by: Ding, Zihan, et al.
Published: (2026)
by: Ding, Zihan, et al.
Published: (2026)
On the Empirical Complexity of Reasoning and Planning in LLMs
by: Kang, Liwei, et al.
Published: (2024)
by: Kang, Liwei, et al.
Published: (2024)
Transferring Information Across Interventions in Causal Bayesian Optimization
by: Javidian, Mohammad Ali
Published: (2026)
by: Javidian, Mohammad Ali
Published: (2026)
UniPruning: Unifying Local Metric and Global Feedback for Scalable Sparse LLMs
by: Ding, Yizhuo, et al.
Published: (2025)
by: Ding, Yizhuo, et al.
Published: (2025)
Deciphering Scientific Reasoning Steps from Outcome Data for Molecule Optimization
by: Liu, Zequn, et al.
Published: (2026)
by: Liu, Zequn, et al.
Published: (2026)
Similar Items
-
Rethinking RL Evaluation: Can Benchmarks Truly Reveal Failures of RL Methods?
by: Chen, Zihan, et al.
Published: (2025) -
RankCLIP: Ranking-Consistent Language-Image Pretraining
by: Zhang, Yiming, et al.
Published: (2024) -
Information-Theoretic Causal Bounds under Unmeasured Confounding
by: Jung, Yonghan, et al.
Published: (2026) -
Inductive Subgraphs as Shortcuts: Causal Disentanglement for Heterophilic Graph Learning
by: Wang, Xiangmeng, et al.
Published: (2026) -
The Reliability Paradox: Exploring How Shortcut Learning Undermines Language Model Calibration
by: Bihani, Geetanjali, et al.
Published: (2024)