Saved in:
| Main Authors: | Niu, Wenqi, Wang, Yingchao, Cai, Guohui, Hou, Hanpo |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.06561 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Federated Progressive Self-Distillation with Logits Calibration for Personalized IIoT Edge Intelligence
by: Wang, Yingchao, et al.
Published: (2024)
by: Wang, Yingchao, et al.
Published: (2024)
Robust Knowledge Distillation Based on Feature Variance Against Backdoored Teacher Model
by: Chen, Jinyin, et al.
Published: (2024)
by: Chen, Jinyin, et al.
Published: (2024)
Multi-Teacher Knowledge Distillation via Teacher-Informed Mixture Priors
by: Fang, Luyang, et al.
Published: (2026)
by: Fang, Luyang, et al.
Published: (2026)
CLIP-Embed-KD: Computationally Efficient Knowledge Distillation Using Embeddings as Teachers
by: Nair, Lakshmi
Published: (2024)
by: Nair, Lakshmi
Published: (2024)
Model Merging via Multi-Teacher Knowledge Distillation
by: Dalili, Seyed Arshan, et al.
Published: (2025)
by: Dalili, Seyed Arshan, et al.
Published: (2025)
Toward Student-Oriented Teacher Network Training For Knowledge Distillation
by: Dong, Chengyu, et al.
Published: (2022)
by: Dong, Chengyu, et al.
Published: (2022)
Robust and Resource-Efficient Data-Free Knowledge Distillation by Generative Pseudo Replay
by: Binici, Kuluhan, et al.
Published: (2022)
by: Binici, Kuluhan, et al.
Published: (2022)
DUET: Distilled LLM Unlearning from an Efficiently Contextualized Teacher
by: Zhong, Yisheng, et al.
Published: (2026)
by: Zhong, Yisheng, et al.
Published: (2026)
Group Relative Knowledge Distillation: Learning from Teacher's Relational Inductive Bias
by: Li, Chao, et al.
Published: (2025)
by: Li, Chao, et al.
Published: (2025)
Generalizing Teacher Networks for Effective Knowledge Distillation Across Student Architectures
by: Binici, Kuluhan, et al.
Published: (2024)
by: Binici, Kuluhan, et al.
Published: (2024)
GSTAM: Efficient Graph Distillation with Structural Attention-Matching
by: Rasti-Meymandi, Arash, et al.
Published: (2024)
by: Rasti-Meymandi, Arash, et al.
Published: (2024)
The Role of Teacher Calibration in Knowledge Distillation
by: Kim, Suyoung, et al.
Published: (2025)
by: Kim, Suyoung, et al.
Published: (2025)
FedMTFI: Feature Importance Based Optimized Multi Teacher Knowledge Distillation in Heterogeneous Federated Learning Environment
by: Shadin, Nazmus Shakib, et al.
Published: (2026)
by: Shadin, Nazmus Shakib, et al.
Published: (2026)
Membership and Memorization in LLM Knowledge Distillation
by: Zhang, Ziqi, et al.
Published: (2025)
by: Zhang, Ziqi, et al.
Published: (2025)
Knowledge Distillation in Wide Neural Networks: Risk Bound, Data Efficiency and Imperfect Teacher
by: Ji, Guangda, et al.
Published: (2020)
by: Ji, Guangda, et al.
Published: (2020)
Trajectory as the Teacher: Few-Step Discrete Flow Matching via Energy-Navigated Distillation
by: Monsefi, Amin Karimi, et al.
Published: (2026)
by: Monsefi, Amin Karimi, et al.
Published: (2026)
Geometric Flow Matching for Molecular Conformation Generation via Manifold Decomposition
by: Liu, Yunqing, et al.
Published: (2026)
by: Liu, Yunqing, et al.
Published: (2026)
Teacher as a Lenient Expert: Teacher-Agnostic Data-Free Knowledge Distillation
by: Shin, Hyunjune, et al.
Published: (2024)
by: Shin, Hyunjune, et al.
Published: (2024)
Good Teachers Explain: Explanation-Enhanced Knowledge Distillation
by: Parchami-Araghi, Amin, et al.
Published: (2024)
by: Parchami-Araghi, Amin, et al.
Published: (2024)
Beyond Answers: Transferring Reasoning Capabilities to Smaller LLMs Using Multi-Teacher Knowledge Distillation
by: Tian, Yijun, et al.
Published: (2024)
by: Tian, Yijun, et al.
Published: (2024)
Dimer-Enhanced Optimization: A First-Order Approach to Escaping Saddle Points in Neural Network Training
by: Hu, Yue, et al.
Published: (2025)
by: Hu, Yue, et al.
Published: (2025)
Confident or Seek Stronger: Exploring Uncertainty-Based On-device LLM Routing From Benchmarking to Generalization
by: Chuang, Yu-Neng, et al.
Published: (2025)
by: Chuang, Yu-Neng, et al.
Published: (2025)
Distilling Genomic Models for Efficient mRNA Representation Learning via Embedding Matching
by: Haidari, Rasched, et al.
Published: (2026)
by: Haidari, Rasched, et al.
Published: (2026)
DistDD: Distributed Data Distillation Aggregation through Gradient Matching
by: Wang, Peiran, et al.
Published: (2024)
by: Wang, Peiran, et al.
Published: (2024)
Efficient Knowledge Distillation via Curriculum Extraction
by: Gupta, Shivam, et al.
Published: (2025)
by: Gupta, Shivam, et al.
Published: (2025)
LLMBoost: Make Large Language Models Stronger with Boosting
by: Chen, Zehao, et al.
Published: (2025)
by: Chen, Zehao, et al.
Published: (2025)
Knowledge Distillation Must Account for What It Loses
by: Wang, Wenshuo
Published: (2026)
by: Wang, Wenshuo
Published: (2026)
Preference-Based Self-Distillation: Beyond KL Matching via Reward Regularization
by: Yu, Xin, et al.
Published: (2026)
by: Yu, Xin, et al.
Published: (2026)
Path-Guided Flow Matching for Dataset Distillation
by: Li, Xuhui, et al.
Published: (2026)
by: Li, Xuhui, et al.
Published: (2026)
DouRN: Improving DouZero by Residual Neural Networks
by: Chen, Yiquan, et al.
Published: (2024)
by: Chen, Yiquan, et al.
Published: (2024)
Efficient Epistemic Uncertainty Estimation for Large Language Models via Knowledge Distillation
by: Park, Seonghyeon, et al.
Published: (2026)
by: Park, Seonghyeon, et al.
Published: (2026)
Leveraging Knowledge Distillation for Efficient Deep Reinforcement Learning in Resource-Constrained Environments
by: Meng, Guanlin
Published: (2023)
by: Meng, Guanlin
Published: (2023)
Linear Projections of Teacher Embeddings for Few-Class Distillation
by: Loo, Noel, et al.
Published: (2024)
by: Loo, Noel, et al.
Published: (2024)
A Stronger Mixture of Low-Rank Experts for Fine-Tuning Foundation Models
by: Sun, Mengyang, et al.
Published: (2025)
by: Sun, Mengyang, et al.
Published: (2025)
Multi-Label Knowledge Distillation
by: Yang, Penghui, et al.
Published: (2023)
by: Yang, Penghui, et al.
Published: (2023)
Teaching the Teacher: The Role of Teacher-Student Smoothness Alignment in Genetic Programming-based Symbolic Distillation
by: Dhar, Soumyadeep, et al.
Published: (2025)
by: Dhar, Soumyadeep, et al.
Published: (2025)
Multi-Stage Knowledge-Distilled VGAE and GAT for Robust Controller-Area-Network Intrusion Detection
by: Frenken, Robert, et al.
Published: (2025)
by: Frenken, Robert, et al.
Published: (2025)
Online Adversarial Knowledge Distillation for Graph Neural Networks
by: Wang, Can, et al.
Published: (2021)
by: Wang, Can, et al.
Published: (2021)
LLM-NEO: Parameter Efficient Knowledge Distillation for Large Language Models
by: Yang, Runming, et al.
Published: (2024)
by: Yang, Runming, et al.
Published: (2024)
When Are Teacher Tokens Reliable? Position-Weighted On-Policy Self-Distillation for Reasoning
by: Liu, Xiaogeng, et al.
Published: (2026)
by: Liu, Xiaogeng, et al.
Published: (2026)
Similar Items
-
Federated Progressive Self-Distillation with Logits Calibration for Personalized IIoT Edge Intelligence
by: Wang, Yingchao, et al.
Published: (2024) -
Robust Knowledge Distillation Based on Feature Variance Against Backdoored Teacher Model
by: Chen, Jinyin, et al.
Published: (2024) -
Multi-Teacher Knowledge Distillation via Teacher-Informed Mixture Priors
by: Fang, Luyang, et al.
Published: (2026) -
CLIP-Embed-KD: Computationally Efficient Knowledge Distillation Using Embeddings as Teachers
by: Nair, Lakshmi
Published: (2024) -
Model Merging via Multi-Teacher Knowledge Distillation
by: Dalili, Seyed Arshan, et al.
Published: (2025)