Saved in:
| Main Authors: | Ghasemabadi, Amirhosein, Niu, Di |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.20578 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence
by: Ghasemabadi, Amirhosein, et al.
Published: (2025)
by: Ghasemabadi, Amirhosein, et al.
Published: (2025)
Learning Truncated Causal History Model for Video Restoration
by: Ghasemabadi, Amirhosein, et al.
Published: (2024)
by: Ghasemabadi, Amirhosein, et al.
Published: (2024)
Can LLMs Detect Their Own Hallucinations?
by: Kadotani, Sora, et al.
Published: (2025)
by: Kadotani, Sora, et al.
Published: (2025)
When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs
by: Kamoi, Ryo, et al.
Published: (2024)
by: Kamoi, Ryo, et al.
Published: (2024)
CascadedGaze: Efficiency in Global Context Extraction for Image Restoration
by: Ghasemabadi, Amirhosein, et al.
Published: (2024)
by: Ghasemabadi, Amirhosein, et al.
Published: (2024)
Grounding Degradations in Natural Language for All-In-One Video Restoration
by: Janjua, Muhammad Kamran, et al.
Published: (2025)
by: Janjua, Muhammad Kamran, et al.
Published: (2025)
LLMs Can Generate a Better Answer by Aggregating Their Own Responses
by: Li, Zichong, et al.
Published: (2025)
by: Li, Zichong, et al.
Published: (2025)
Language Models Can Predict Their Own Behavior
by: Ashok, Dhananjay, et al.
Published: (2025)
by: Ashok, Dhananjay, et al.
Published: (2025)
Self-Interpretability: LLMs Can Describe Complex Internal Processes that Drive Their Decisions
by: Plunkett, Dillon, et al.
Published: (2025)
by: Plunkett, Dillon, et al.
Published: (2025)
SelfReflect: Can LLMs Communicate Their Internal Answer Distribution?
by: Kirchhof, Michael, et al.
Published: (2025)
by: Kirchhof, Michael, et al.
Published: (2025)
Read Your Own Mind: Reasoning Helps Surface Self-Confidence Signals in LLMs
by: Podolak, Jakub, et al.
Published: (2025)
by: Podolak, Jakub, et al.
Published: (2025)
BYOL: Bring Your Own Language Into LLMs
by: Zamir, Syed Waqas, et al.
Published: (2026)
by: Zamir, Syed Waqas, et al.
Published: (2026)
Can AI Debias the News? LLM Interventions Improve Cross-Partisan Receptivity but LLMs Overestimate Their Own Effectiveness
by: Feroz, Faisal, et al.
Published: (2026)
by: Feroz, Faisal, et al.
Published: (2026)
Policy-Guided World Model Planning for Language-Conditioned Visual Navigation
by: Chahe, Amirhosein, et al.
Published: (2026)
by: Chahe, Amirhosein, et al.
Published: (2026)
Do LLMs Benefit From Their Own Words?
by: Huang, Jenny Y., et al.
Published: (2026)
by: Huang, Jenny Y., et al.
Published: (2026)
Beyond Surface Statistics: Robust Conformal Prediction for LLMs via Internal Representations
by: Wang, Yanli, et al.
Published: (2026)
by: Wang, Yanli, et al.
Published: (2026)
Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs
by: Chen, Angelica, et al.
Published: (2023)
by: Chen, Angelica, et al.
Published: (2023)
All Circuits Lead to Rome: Rethinking Functional Anisotropy in Circuit and Sheaf Discovery for LLMs
by: Chen, Xi, et al.
Published: (2026)
by: Chen, Xi, et al.
Published: (2026)
LLMs Don't Know Their Own Decision Boundaries: The Unreliability of Self-Generated Counterfactual Explanations
by: Mayne, Harry, et al.
Published: (2025)
by: Mayne, Harry, et al.
Published: (2025)
Skill-RAG: Failure-State-Aware Retrieval Augmentation via Hidden-State Probing and Skill Routing
by: Wei, Kai, et al.
Published: (2026)
by: Wei, Kai, et al.
Published: (2026)
Roll Out and Roll Back: Diffusion LLMs are Their Own Efficiency Teachers
by: Zeng, Fanqin, et al.
Published: (2026)
by: Zeng, Fanqin, et al.
Published: (2026)
Do LLMs Follow Their Own Rules? A Reflexive Audit of Self-Stated Safety Policies
by: Mittal, Avni
Published: (2026)
by: Mittal, Avni
Published: (2026)
PLDR-LLMs Learn A Generalizable Tensor Operator That Can Replace Its Own Deep Neural Net At Inference
by: Gokden, Burc
Published: (2025)
by: Gokden, Burc
Published: (2025)
Break Me If You Can: Self-Jailbreaking of Aligned LLMs via Lexical Insertion Prompting
by: Kulshreshtha, Devang, et al.
Published: (2026)
by: Kulshreshtha, Devang, et al.
Published: (2026)
From Long to Short: LLMs Excel at Trimming Own Reasoning Chains
by: Han, Wei, et al.
Published: (2025)
by: Han, Wei, et al.
Published: (2025)
Can LLMs Translate Human Instructions into a Reinforcement Learning Agent's Internal Emergent Symbolic Representation?
by: Ma, Ziqi, et al.
Published: (2025)
by: Ma, Ziqi, et al.
Published: (2025)
Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs
by: Tie, Guiyao, et al.
Published: (2025)
by: Tie, Guiyao, et al.
Published: (2025)
Be Your Own Red Teamer: Safety Alignment via Self-Play and Reflective Experience Replay
by: Wang, Hao, et al.
Published: (2026)
by: Wang, Hao, et al.
Published: (2026)
LLMs Can Teach Themselves to Better Predict the Future
by: Turtel, Benjamin, et al.
Published: (2025)
by: Turtel, Benjamin, et al.
Published: (2025)
InternalInspector $I^2$: Robust Confidence Estimation in LLMs through Internal States
by: Beigi, Mohammad, et al.
Published: (2024)
by: Beigi, Mohammad, et al.
Published: (2024)
FAMA: Failure-Aware Meta-Agentic Framework for Open-Source LLMs in Interactive Tool Use Environments
by: Saeidi, Amir, et al.
Published: (2026)
by: Saeidi, Amir, et al.
Published: (2026)
Can Public LLMs be used for Self-Diagnosis of Medical Conditions ?
by: Balasubramanian, Nikil Sharan Prabahar, et al.
Published: (2024)
by: Balasubramanian, Nikil Sharan Prabahar, et al.
Published: (2024)
Agentic Username Suggestion and Multimodal Gender Detection in Online Platforms: Introducing the PNGT-26K Dataset
by: Bijary, Farbod, et al.
Published: (2025)
by: Bijary, Farbod, et al.
Published: (2025)
Bring Your Own Prompts: Use-Case-Specific Bias and Fairness Evaluation for LLMs
by: Bouchard, Dylan
Published: (2024)
by: Bouchard, Dylan
Published: (2024)
Hallucination Detection with the Internal Layers of LLMs
by: Preiß, Martin
Published: (2025)
by: Preiß, Martin
Published: (2025)
Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States
by: Choi, Yunho, et al.
Published: (2026)
by: Choi, Yunho, et al.
Published: (2026)
Do LLMs Know They Are Being Tested? Evaluation Awareness and Incentive-Sensitive Failures in GPT-OSS-20B
by: Ahmed, Nisar, et al.
Published: (2025)
by: Ahmed, Nisar, et al.
Published: (2025)
Enhancing the Medical Context-Awareness Ability of LLMs via Multifaceted Self-Refinement Learning
by: Zhou, Yuxuan, et al.
Published: (2025)
by: Zhou, Yuxuan, et al.
Published: (2025)
Jinx: Unlimited LLMs for Probing Alignment Failures
by: Zhao, Jiahao, et al.
Published: (2025)
by: Zhao, Jiahao, et al.
Published: (2025)
Can Pre-training Indicators Reliably Predict Fine-tuning Outcomes of LLMs?
by: Zeng, Hansi, et al.
Published: (2025)
by: Zeng, Hansi, et al.
Published: (2025)
Similar Items
-
Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence
by: Ghasemabadi, Amirhosein, et al.
Published: (2025) -
Learning Truncated Causal History Model for Video Restoration
by: Ghasemabadi, Amirhosein, et al.
Published: (2024) -
Can LLMs Detect Their Own Hallucinations?
by: Kadotani, Sora, et al.
Published: (2025) -
When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs
by: Kamoi, Ryo, et al.
Published: (2024) -
CascadedGaze: Efficiency in Global Context Extraction for Image Restoration
by: Ghasemabadi, Amirhosein, et al.
Published: (2024)