Saved in:
| Main Authors: | Ding, Chenlu, Wu, Jiancan, Luo, Yanchen, Liu, Zheyuan, Yuan, Yancheng, Wang, Xiang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.14636 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MLLMEraser: Achieving Test-Time Unlearning in Multimodal Large Language Models through Activation Steering
by: Ding, Chenlu, et al.
Published: (2025)
by: Ding, Chenlu, et al.
Published: (2025)
Delayed Feedback Modeling with Influence Functions
by: Ding, Chenlu, et al.
Published: (2025)
by: Ding, Chenlu, et al.
Published: (2025)
Unified Parameter-Efficient Unlearning for LLMs
by: Ding, Chenlu, et al.
Published: (2024)
by: Ding, Chenlu, et al.
Published: (2024)
Enhancing Temporal Sensitivity of Large Language Model for Recommendation with Counterfactual Tuning
by: Liu, Yutian, et al.
Published: (2025)
by: Liu, Yutian, et al.
Published: (2025)
Adaptive Self-supervised Robust Clustering for Unstructured Data with Unknown Cluster Number
by: Ding, Chen-Lu, et al.
Published: (2024)
by: Ding, Chen-Lu, et al.
Published: (2024)
Text-guided Diffusion Model for 3D Molecule Generation
by: Luo, Yanchen, et al.
Published: (2024)
by: Luo, Yanchen, et al.
Published: (2024)
On Negative-aware Preference Optimization for Recommendation
by: Ding, Chenlu, et al.
Published: (2025)
by: Ding, Chenlu, et al.
Published: (2025)
Process Supervision via Verbal Critique Improves Reasoning in Large Language Models
by: Chen, Hao-Yuan
Published: (2026)
by: Chen, Hao-Yuan
Published: (2026)
Teaching Language Models to Critique via Reinforcement Learning
by: Xie, Zhihui, et al.
Published: (2025)
by: Xie, Zhihui, et al.
Published: (2025)
KnowRL: Teaching Language Models to Know What They Know
by: Kale, Sahil, et al.
Published: (2025)
by: Kale, Sahil, et al.
Published: (2025)
Reasoning about Uncertainty: Do Reasoning Models Know When They Don't Know?
by: Mei, Zhiting, et al.
Published: (2025)
by: Mei, Zhiting, et al.
Published: (2025)
CaRT: Teaching LLM Agents to Know When They Know Enough
by: Liu, Grace, et al.
Published: (2025)
by: Liu, Grace, et al.
Published: (2025)
Teaching Language Models to Reason with Tools
by: Li, Chengpeng, et al.
Published: (2025)
by: Li, Chengpeng, et al.
Published: (2025)
Quantile Advantage Estimation: Stabilizing RLVR for LLM Reasoning
by: Wu, Junkang, et al.
Published: (2025)
by: Wu, Junkang, et al.
Published: (2025)
Base Models Know How to Reason, Thinking Models Learn When
by: Venhoff, Constantin, et al.
Published: (2025)
by: Venhoff, Constantin, et al.
Published: (2025)
Large Language Models Often Know When They Are Being Evaluated
by: Needham, Joe, et al.
Published: (2025)
by: Needham, Joe, et al.
Published: (2025)
Do Retrieval Augmented Language Models Know When They Don't Know?
by: Zhou, Youchao, et al.
Published: (2025)
by: Zhou, Youchao, et al.
Published: (2025)
Self-Evolving Critique Abilities in Large Language Models
by: Tang, Zhengyang, et al.
Published: (2025)
by: Tang, Zhengyang, et al.
Published: (2025)
Scaling Retrieval-Augmented Reasoning with Parallel Search and Explicit Merging
by: Liu, Jiabei, et al.
Published: (2026)
by: Liu, Jiabei, et al.
Published: (2026)
Do Large Language Models Mentalize When They Teach?
by: Harootonian, Sevan K., et al.
Published: (2026)
by: Harootonian, Sevan K., et al.
Published: (2026)
The Critique of Critique
by: Sun, Shichao, et al.
Published: (2024)
by: Sun, Shichao, et al.
Published: (2024)
Invariant Graph Learning Meets Information Bottleneck for Out-of-Distribution Generalization
by: Mao, Wenyu, et al.
Published: (2024)
by: Mao, Wenyu, et al.
Published: (2024)
Does Your Reasoning Model Implicitly Know When to Stop Thinking?
by: Huang, Zixuan, et al.
Published: (2026)
by: Huang, Zixuan, et al.
Published: (2026)
CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation
by: Ke, Pei, et al.
Published: (2023)
by: Ke, Pei, et al.
Published: (2023)
Customizing Language Models with Instance-wise LoRA for Sequential Recommendation
by: Kong, Xiaoyu, et al.
Published: (2024)
by: Kong, Xiaoyu, et al.
Published: (2024)
LookWise: Knowing When and Where to Look for Fine-Grained Visual Reasoning in Multimodal Large Language Models
by: Shen, Yuxiang, et al.
Published: (2026)
by: Shen, Yuxiang, et al.
Published: (2026)
When Models Know When They Do Not Know: Calibration, Cascading, and Cleaning
by: Hao, Chenjie, et al.
Published: (2026)
by: Hao, Chenjie, et al.
Published: (2026)
Computing Ex Ante Equilibrium in Heterogeneous Zero-Sum Team Games
by: Liu, Naming, et al.
Published: (2024)
by: Liu, Naming, et al.
Published: (2024)
Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning
by: Xi, Zhiheng, et al.
Published: (2025)
by: Xi, Zhiheng, et al.
Published: (2025)
Critique of Impure Reason: Unveiling the reasoning behaviour of medical Large Language Models
by: Sim, Shamus, et al.
Published: (2024)
by: Sim, Shamus, et al.
Published: (2024)
Towards Agents That Know When They Don't Know: Uncertainty as a Control Signal for Structured Reasoning
by: Stoisser, Josefa Lia, et al.
Published: (2025)
by: Stoisser, Josefa Lia, et al.
Published: (2025)
Ex Ante Evaluation of AI-Induced Idea Diversity Collapse
by: Azad, Nafis Saami, et al.
Published: (2026)
by: Azad, Nafis Saami, et al.
Published: (2026)
Epistemic Deep Learning: Enabling Machine Learning Models to Know When They Do Not Know
by: Manchingal, Shireen Kudukkil
Published: (2025)
by: Manchingal, Shireen Kudukkil
Published: (2025)
Know When to Explore: Difficulty-Aware Certainty as a Guide for LLM Reinforcement Learning
by: Li, Ang, et al.
Published: (2025)
by: Li, Ang, et al.
Published: (2025)
Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization
by: Wu, Junkang, et al.
Published: (2024)
by: Wu, Junkang, et al.
Published: (2024)
Epistemic Artificial Intelligence is Essential for Machine Learning Models to Truly 'Know When They Do Not Know'
by: Manchingal, Shireen Kudukkil, et al.
Published: (2025)
by: Manchingal, Shireen Kudukkil, et al.
Published: (2025)
RePO: Understanding Preference Learning Through ReLU-Based Optimization
by: Wu, Junkang, et al.
Published: (2025)
by: Wu, Junkang, et al.
Published: (2025)
Knowing When Not to Answer: Abstention-Aware Scientific Reasoning
by: Abdaljalil, Samir, et al.
Published: (2026)
by: Abdaljalil, Samir, et al.
Published: (2026)
MuggleMath: Assessing the Impact of Query and Response Augmentation on Math Reasoning
by: Li, Chengpeng, et al.
Published: (2023)
by: Li, Chengpeng, et al.
Published: (2023)
Reasoning on Graphs: Faithful and Interpretable Large Language Model Reasoning
by: Luo, Linhao, et al.
Published: (2023)
by: Luo, Linhao, et al.
Published: (2023)
Similar Items
-
MLLMEraser: Achieving Test-Time Unlearning in Multimodal Large Language Models through Activation Steering
by: Ding, Chenlu, et al.
Published: (2025) -
Delayed Feedback Modeling with Influence Functions
by: Ding, Chenlu, et al.
Published: (2025) -
Unified Parameter-Efficient Unlearning for LLMs
by: Ding, Chenlu, et al.
Published: (2024) -
Enhancing Temporal Sensitivity of Large Language Model for Recommendation with Counterfactual Tuning
by: Liu, Yutian, et al.
Published: (2025) -
Adaptive Self-supervised Robust Clustering for Unstructured Data with Unknown Cluster Number
by: Ding, Chen-Lu, et al.
Published: (2024)