:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Chen, Zihan, Zhang, Yiming, Geng, Wenxiang, Ding, Zenghui, Sun, Yining
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2606.00674
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Rethinking RL Evaluation: Can Benchmarks Truly Reveal Failures of RL Methods?
by: Chen, Zihan, et al.
Published: (2025)

RankCLIP: Ranking-Consistent Language-Image Pretraining
by: Zhang, Yiming, et al.
Published: (2024)

Information-Theoretic Causal Bounds under Unmeasured Confounding
by: Jung, Yonghan, et al.
Published: (2026)

Inductive Subgraphs as Shortcuts: Causal Disentanglement for Heterophilic Graph Learning
by: Wang, Xiangmeng, et al.
Published: (2026)

The Reliability Paradox: Exploring How Shortcut Learning Undermines Language Model Calibration
by: Bihani, Geetanjali, et al.
Published: (2024)

Networked Restless Multi-Arm Bandits with Reinforcement Learning
by: Zhang, Hanmo, et al.
Published: (2025)

DINORANKCLIP: DINOv3 Distillation and Injection for Vision-Language Pretraining with High-Order Ranking Consistency
by: Jiang, Shuyang, et al.
Published: (2026)

Meaningful Causal Aggregation and Paradoxical Confounding
by: Zhu, Yuchen, et al.
Published: (2023)

Single-stream Policy Optimization
by: Xu, Zhongwen, et al.
Published: (2025)

The Reasoning Trap: An Information-Theoretic Bound on Closed-System Multi-Step LLM Reasoning
by: Shin, Kwan Soo
Published: (2026)

LGMT: Logic-Grounded Metamorphic Testing for Evaluating the Reasoning Reliability of LLMs
by: Zhou, Zenghui, et al.
Published: (2026)

TimeMKG: Knowledge-Infused Causal Reasoning for Multivariate Time Series Modeling
by: Sun, Yifei, et al.
Published: (2025)

CARE: Turning LLMs Into Causal Reasoning Expert
by: Dong, Juncheng, et al.
Published: (2025)

Can Post-Training Transform LLMs into Causal Reasoners?
by: Chen, Junqi, et al.
Published: (2026)

BEARS Make Neuro-Symbolic Models Aware of their Reasoning Shortcuts
by: Marconato, Emanuele, et al.
Published: (2024)

GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations
by: Duan, Jinhao, et al.
Published: (2024)

A Neuro-Symbolic Benchmark Suite for Concept Quality and Reasoning Shortcuts
by: Bortolotti, Samuele, et al.
Published: (2024)

MiMu: Mitigating Multiple Shortcut Learning Behavior of Transformers
by: Zhao, Lili, et al.
Published: (2025)

Theoretical Analysis of Meta Reinforcement Learning: Generalization Bounds and Convergence Guarantees
by: Wang, Cangqing, et al.
Published: (2024)

Meta-Aligner: Bidirectional Preference-Policy Optimization for Multi-Objective LLMs Alignment
by: Xu, Wenzhe, et al.
Published: (2026)

Calibration-Aware Policy Optimization for Reasoning LLMs
by: Wang, Ziqi, et al.
Published: (2026)

Reshaping Reasoning in LLMs: A Theoretical Analysis of RL Training Dynamics through Pattern Selection
by: Chen, Xingwu, et al.
Published: (2025)

Symbol Grounding in Neuro-Symbolic AI: A Gentle Introduction to Reasoning Shortcuts
by: Marconato, Emanuele, et al.
Published: (2025)

Information-Theoretic Safe Bayesian Optimization
by: Bottero, Alessandro G., et al.
Published: (2024)

Learning to Correct for QA Reasoning with Black-box LLMs
by: Kim, Jaehyung, et al.
Published: (2024)

Reasoning Stabilization Point: A Training-Time Signal for Stable Evidence and Shortcut Reliance
by: Dhayalkar, Sahil Rajesh
Published: (2026)

Generalization of RLVR Using Causal Reasoning as a Testbed
by: Lu, Brian, et al.
Published: (2025)

Efficient Transfer Learning via Causal Bounds
by: Gong, Xueping, et al.
Published: (2023)

Learning with Logical Constraints but without Shortcut Satisfaction
by: Li, Zenan, et al.
Published: (2024)

Rectifying Shortcut Behaviors in Preference-based Reward Learning
by: Ye, Wenqian, et al.
Published: (2025)

Probing Neural Combinatorial Optimization Models
by: Zhang, Zhiqin, et al.
Published: (2025)

Revisiting Meta-Learning with Noisy Labels: Reweighting Dynamics and Theoretical Guarantees
by: Zhang, Yiming, et al.
Published: (2025)

InfoRM: Mitigating Reward Hacking in RLHF via Information-Theoretic Reward Modeling
by: Miao, Yuchun, et al.
Published: (2024)

Gradient-based Model Shortcut Detection for Time Series Classification
by: Ibarra, Salomon, et al.
Published: (2025)

From Static Analysis to Audience Dissemination: A Training-Free Multimodal Controversy Detection Multi-Agent Framework
by: Ding, Zihan, et al.
Published: (2026)

PrismAgent: Illuminating Harm in Memes via a Zero-Shot Interpretable Multi-Agent Framework
by: Ding, Zihan, et al.
Published: (2026)

On the Empirical Complexity of Reasoning and Planning in LLMs
by: Kang, Liwei, et al.
Published: (2024)

Transferring Information Across Interventions in Causal Bayesian Optimization
by: Javidian, Mohammad Ali
Published: (2026)

UniPruning: Unifying Local Metric and Global Feedback for Scalable Sparse LLMs
by: Ding, Yizhuo, et al.
Published: (2025)

Deciphering Scientific Reasoning Steps from Outcome Data for Molecule Optimization
by: Liu, Zequn, et al.
Published: (2026)