Saved in:
| Main Authors: | Phillips, Edward, Wu, Sean, Gustafsson, Fredrik K., Gao, Boyan, Clifton, David A. |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.04577 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
BAS: A Decision-Theoretic Approach to Evaluating Large Language Model Confidence
by: Wu, Sean, et al.
Published: (2026)
by: Wu, Sean, et al.
Published: (2026)
Entropy Alone is Insufficient for Safe Selective Prediction in LLMs
by: Phillips, Edward, et al.
Published: (2026)
by: Phillips, Edward, et al.
Published: (2026)
Geometric Uncertainty for Detecting and Correcting Hallucinations in LLMs
by: Phillips, Edward, et al.
Published: (2025)
by: Phillips, Edward, et al.
Published: (2025)
Uncertainty Distillation: Teaching Language Models to Express Semantic Confidence
by: Hager, Sophia, et al.
Published: (2025)
by: Hager, Sophia, et al.
Published: (2025)
SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking
by: Xing, Xingrun, et al.
Published: (2024)
by: Xing, Xingrun, et al.
Published: (2024)
Cognition Chain for Explainable Psychological Stress Detection on Social Media
by: Wang, Xin, et al.
Published: (2024)
by: Wang, Xin, et al.
Published: (2024)
Trust Region On-Policy Distillation
by: Xing, Xingrun, et al.
Published: (2026)
by: Xing, Xingrun, et al.
Published: (2026)
Semantic Self-Consistency: Enhancing Language Model Reasoning via Semantic Weighting
by: Knappe, Tim, et al.
Published: (2024)
by: Knappe, Tim, et al.
Published: (2024)
Self-Data Distillation for Recovering Quality in Pruned Large Language Models
by: Thangarasa, Vithursan, et al.
Published: (2024)
by: Thangarasa, Vithursan, et al.
Published: (2024)
Benchmarking Pathology Foundation Models for Breast Cancer Survival Prediction
by: Gustafsson, Fredrik K., et al.
Published: (2026)
by: Gustafsson, Fredrik K., et al.
Published: (2026)
Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models
by: Zhao, Siyan, et al.
Published: (2026)
by: Zhao, Siyan, et al.
Published: (2026)
Uncertainty in Semantic Language Modeling with PIXELS
by: Radu, Stefania, et al.
Published: (2025)
by: Radu, Stefania, et al.
Published: (2025)
From Large to Tiny: Distilling and Refining Mathematical Expertise for Math Word Problems with Weakly Supervision
by: Lin, Qingwen, et al.
Published: (2024)
by: Lin, Qingwen, et al.
Published: (2024)
MASSV: Multimodal Adaptation and Self-Data Distillation for Speculative Decoding of Vision-Language Models
by: Ganesan, Mugilan, et al.
Published: (2025)
by: Ganesan, Mugilan, et al.
Published: (2025)
Multi-Granularity Semantic Revision for Large Language Model Distillation
by: Liu, Xiaoyu, et al.
Published: (2024)
by: Liu, Xiaoyu, et al.
Published: (2024)
Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages
by: Schmidt, Fabian David, et al.
Published: (2024)
by: Schmidt, Fabian David, et al.
Published: (2024)
FE-Adapter: Adapting Image-based Emotion Classifiers to Videos
by: Gowda, Shreyank N, et al.
Published: (2024)
by: Gowda, Shreyank N, et al.
Published: (2024)
Can Language Models Be Tricked by Language Illusions? Easier with Syntax, Harder with Semantics
by: Zhang, Yuhan, et al.
Published: (2023)
by: Zhang, Yuhan, et al.
Published: (2023)
PLPP: Prompt Learning with Perplexity Is Self-Distillation for Vision-Language Models
by: Liu, Biao, et al.
Published: (2024)
by: Liu, Biao, et al.
Published: (2024)
HumorGen: Cognitive Synergy for Humor Generation in Large Language Models via Persona-Based Distillation
by: Ajayi, Edward, et al.
Published: (2026)
by: Ajayi, Edward, et al.
Published: (2026)
Self-Updatable Large Language Models by Integrating Context into Model Parameters
by: Wang, Yu, et al.
Published: (2024)
by: Wang, Yu, et al.
Published: (2024)
D2LLM: Decomposed and Distilled Large Language Models for Semantic Search
by: Liao, Zihan, et al.
Published: (2024)
by: Liao, Zihan, et al.
Published: (2024)
Language Confusion Gate: Language-Aware Decoding Through Model Self-Distillation
by: Zhang, Collin, et al.
Published: (2025)
by: Zhang, Collin, et al.
Published: (2025)
On-Policy Context Distillation for Language Models
by: Ye, Tianzhu, et al.
Published: (2026)
by: Ye, Tianzhu, et al.
Published: (2026)
Are Large Language Models Good Statisticians?
by: Zhu, Yizhang, et al.
Published: (2024)
by: Zhu, Yizhang, et al.
Published: (2024)
Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning
by: Yang, Zhaorui, et al.
Published: (2024)
by: Yang, Zhaorui, et al.
Published: (2024)
Mind's Mirror: Distilling Self-Evaluation Capability and Comprehensive Thinking from Large Language Models
by: Liu, Weize, et al.
Published: (2023)
by: Liu, Weize, et al.
Published: (2023)
Self-Distilled Trajectory-Aware Boltzmann Modeling: Bridging the Training-Inference Discrepancy in Diffusion Language Models
by: Chen, Kecheng, et al.
Published: (2026)
by: Chen, Kecheng, et al.
Published: (2026)
Estimating the Black-box LLM Uncertainty with Distribution-Aligned Adversarial Distillation
by: Cui, Huizi, et al.
Published: (2026)
by: Cui, Huizi, et al.
Published: (2026)
ROSD: Reflective On-Policy Self-Distillation for Language Model Reasoning across Domains
by: Zhao, Ziqi, et al.
Published: (2026)
by: Zhao, Ziqi, et al.
Published: (2026)
No Reliable Evidence of Self-Reported Sentience in Small Large Language Models
by: Kaiser, Caspar, et al.
Published: (2026)
by: Kaiser, Caspar, et al.
Published: (2026)
Memorization Dynamics in Knowledge Distillation for Language Models
by: Borkar, Jaydeep, et al.
Published: (2026)
by: Borkar, Jaydeep, et al.
Published: (2026)
OPSDL: On-Policy Self-Distillation for Long-Context Language Models
by: Zhang, Xinsen, et al.
Published: (2026)
by: Zhang, Xinsen, et al.
Published: (2026)
Self-Refining Language Model Anonymizers via Adversarial Distillation
by: Kim, Kyuyoung, et al.
Published: (2025)
by: Kim, Kyuyoung, et al.
Published: (2025)
Semantic Density: Uncertainty Quantification for Large Language Models through Confidence Measurement in Semantic Space
by: Qiu, Xin, et al.
Published: (2024)
by: Qiu, Xin, et al.
Published: (2024)
BattleAgentBench: A Benchmark for Evaluating Cooperation and Competition Capabilities of Language Models in Multi-Agent Systems
by: Wang, Wei, et al.
Published: (2024)
by: Wang, Wei, et al.
Published: (2024)
Retrieval-Augmented and Knowledge-Grounded Language Models for Faithful Clinical Medicine
by: Liu, Fenglin, et al.
Published: (2022)
by: Liu, Fenglin, et al.
Published: (2022)
Advantage-Guided Distillation for Preference Alignment in Small Language Models
by: Gao, Shiping, et al.
Published: (2025)
by: Gao, Shiping, et al.
Published: (2025)
Quantification of Large Language Model Distillation
by: Lee, Sunbowen, et al.
Published: (2025)
by: Lee, Sunbowen, et al.
Published: (2025)
Evaluating Deep Regression Models for WSI-Based Gene-Expression Prediction
by: Gustafsson, Fredrik K., et al.
Published: (2024)
by: Gustafsson, Fredrik K., et al.
Published: (2024)
Similar Items
-
BAS: A Decision-Theoretic Approach to Evaluating Large Language Model Confidence
by: Wu, Sean, et al.
Published: (2026) -
Entropy Alone is Insufficient for Safe Selective Prediction in LLMs
by: Phillips, Edward, et al.
Published: (2026) -
Geometric Uncertainty for Detecting and Correcting Hallucinations in LLMs
by: Phillips, Edward, et al.
Published: (2025) -
Uncertainty Distillation: Teaching Language Models to Express Semantic Confidence
by: Hager, Sophia, et al.
Published: (2025) -
SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking
by: Xing, Xingrun, et al.
Published: (2024)