:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ghasemabadi, Amirhosein, Niu, Di
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2512.20578
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence
by: Ghasemabadi, Amirhosein, et al.
Published: (2025)

Learning Truncated Causal History Model for Video Restoration
by: Ghasemabadi, Amirhosein, et al.
Published: (2024)

Can LLMs Detect Their Own Hallucinations?
by: Kadotani, Sora, et al.
Published: (2025)

When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs
by: Kamoi, Ryo, et al.
Published: (2024)

CascadedGaze: Efficiency in Global Context Extraction for Image Restoration
by: Ghasemabadi, Amirhosein, et al.
Published: (2024)

Grounding Degradations in Natural Language for All-In-One Video Restoration
by: Janjua, Muhammad Kamran, et al.
Published: (2025)

LLMs Can Generate a Better Answer by Aggregating Their Own Responses
by: Li, Zichong, et al.
Published: (2025)

Language Models Can Predict Their Own Behavior
by: Ashok, Dhananjay, et al.
Published: (2025)

Self-Interpretability: LLMs Can Describe Complex Internal Processes that Drive Their Decisions
by: Plunkett, Dillon, et al.
Published: (2025)

SelfReflect: Can LLMs Communicate Their Internal Answer Distribution?
by: Kirchhof, Michael, et al.
Published: (2025)

Read Your Own Mind: Reasoning Helps Surface Self-Confidence Signals in LLMs
by: Podolak, Jakub, et al.
Published: (2025)

BYOL: Bring Your Own Language Into LLMs
by: Zamir, Syed Waqas, et al.
Published: (2026)

Can AI Debias the News? LLM Interventions Improve Cross-Partisan Receptivity but LLMs Overestimate Their Own Effectiveness
by: Feroz, Faisal, et al.
Published: (2026)

Policy-Guided World Model Planning for Language-Conditioned Visual Navigation
by: Chahe, Amirhosein, et al.
Published: (2026)

Do LLMs Benefit From Their Own Words?
by: Huang, Jenny Y., et al.
Published: (2026)

Beyond Surface Statistics: Robust Conformal Prediction for LLMs via Internal Representations
by: Wang, Yanli, et al.
Published: (2026)

Two Failures of Self-Consistency in the Multi-Step Reasoning of LLMs
by: Chen, Angelica, et al.
Published: (2023)

All Circuits Lead to Rome: Rethinking Functional Anisotropy in Circuit and Sheaf Discovery for LLMs
by: Chen, Xi, et al.
Published: (2026)

LLMs Don't Know Their Own Decision Boundaries: The Unreliability of Self-Generated Counterfactual Explanations
by: Mayne, Harry, et al.
Published: (2025)

Skill-RAG: Failure-State-Aware Retrieval Augmentation via Hidden-State Probing and Skill Routing
by: Wei, Kai, et al.
Published: (2026)

Roll Out and Roll Back: Diffusion LLMs are Their Own Efficiency Teachers
by: Zeng, Fanqin, et al.
Published: (2026)

Do LLMs Follow Their Own Rules? A Reflexive Audit of Self-Stated Safety Policies
by: Mittal, Avni
Published: (2026)

PLDR-LLMs Learn A Generalizable Tensor Operator That Can Replace Its Own Deep Neural Net At Inference
by: Gokden, Burc
Published: (2025)

Break Me If You Can: Self-Jailbreaking of Aligned LLMs via Lexical Insertion Prompting
by: Kulshreshtha, Devang, et al.
Published: (2026)

From Long to Short: LLMs Excel at Trimming Own Reasoning Chains
by: Han, Wei, et al.
Published: (2025)

Can LLMs Translate Human Instructions into a Reinforcement Learning Agent's Internal Emergent Symbolic Representation?
by: Ma, Ziqi, et al.
Published: (2025)

Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs
by: Tie, Guiyao, et al.
Published: (2025)

Be Your Own Red Teamer: Safety Alignment via Self-Play and Reflective Experience Replay
by: Wang, Hao, et al.
Published: (2026)

LLMs Can Teach Themselves to Better Predict the Future
by: Turtel, Benjamin, et al.
Published: (2025)

InternalInspector $I^2$: Robust Confidence Estimation in LLMs through Internal States
by: Beigi, Mohammad, et al.
Published: (2024)

FAMA: Failure-Aware Meta-Agentic Framework for Open-Source LLMs in Interactive Tool Use Environments
by: Saeidi, Amir, et al.
Published: (2026)

Can Public LLMs be used for Self-Diagnosis of Medical Conditions ?
by: Balasubramanian, Nikil Sharan Prabahar, et al.
Published: (2024)

Agentic Username Suggestion and Multimodal Gender Detection in Online Platforms: Introducing the PNGT-26K Dataset
by: Bijary, Farbod, et al.
Published: (2025)

Bring Your Own Prompts: Use-Case-Specific Bias and Fairness Evaluation for LLMs
by: Bouchard, Dylan
Published: (2024)

Hallucination Detection with the Internal Layers of LLMs
by: Preiß, Martin
Published: (2025)

Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor's Internal States
by: Choi, Yunho, et al.
Published: (2026)

Do LLMs Know They Are Being Tested? Evaluation Awareness and Incentive-Sensitive Failures in GPT-OSS-20B
by: Ahmed, Nisar, et al.
Published: (2025)

Enhancing the Medical Context-Awareness Ability of LLMs via Multifaceted Self-Refinement Learning
by: Zhou, Yuxuan, et al.
Published: (2025)

Jinx: Unlimited LLMs for Probing Alignment Failures
by: Zhao, Jiahao, et al.
Published: (2025)

Can Pre-training Indicators Reliably Predict Fine-tuning Outcomes of LLMs?
by: Zeng, Hansi, et al.
Published: (2025)