Saved in:
| Main Authors: | Gholami, Mohsen, Akbari, Mohammad, Hu, Cindy, Masrani, Vaden, Wang, Z. Jane, Zhang, Yong |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.19754 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Task-Agnostic Language Model Watermarking via High Entropy Passthrough Layers
by: Masrani, Vaden, et al.
Published: (2024)
by: Masrani, Vaden, et al.
Published: (2024)
CASP: Compression of Large Multimodal Models Based on Attention Sparsity
by: Gholami, Mohsen, et al.
Published: (2025)
by: Gholami, Mohsen, et al.
Published: (2025)
Generation, Distillation and Evaluation of Motivational Interviewing-Style Reflections with a Foundational Language Model
by: Brown, Andrew, et al.
Published: (2024)
by: Brown, Andrew, et al.
Published: (2024)
GOLD: Geometry Problem Solver with Natural Language Description
by: Zhang, Jiaxin, et al.
Published: (2024)
by: Zhang, Jiaxin, et al.
Published: (2024)
TAIA: Large Language Models are Out-of-Distribution Data Learners
by: Jiang, Shuyang, et al.
Published: (2024)
by: Jiang, Shuyang, et al.
Published: (2024)
Extracting General-use Transformers for Low-resource Languages via Knowledge Distillation
by: Cruz, Jan Christian Blaise, et al.
Published: (2025)
by: Cruz, Jan Christian Blaise, et al.
Published: (2025)
Rainproof: An Umbrella To Shield Text Generators From Out-Of-Distribution Data
by: Darrin, Maxime, et al.
Published: (2022)
by: Darrin, Maxime, et al.
Published: (2022)
Distilling Implicit Multimodal Knowledge into Large Language Models for Zero-Resource Dialogue Generation
by: Zhang, Bo, et al.
Published: (2024)
by: Zhang, Bo, et al.
Published: (2024)
LLM-Guided Knowledge Distillation for Temporal Knowledge Graph Reasoning
by: Xing, Wang, et al.
Published: (2026)
by: Xing, Wang, et al.
Published: (2026)
Spatial Reasoning with Vision-Language Models in Ego-Centric Multi-View Scenes
by: Gholami, Mohsen, et al.
Published: (2025)
by: Gholami, Mohsen, et al.
Published: (2025)
Out-of-Distribution Detection using Synthetic Data Generation
by: Abbas, Momin, et al.
Published: (2025)
by: Abbas, Momin, et al.
Published: (2025)
Exploring and Enhancing the Transfer of Distribution in Knowledge Distillation for Autoregressive Language Models
by: Rao, Jun, et al.
Published: (2024)
by: Rao, Jun, et al.
Published: (2024)
LLM-based Privacy Data Augmentation Guided by Knowledge Distillation with a Distribution Tutor for Medical Text Classification
by: Song, Yiping, et al.
Published: (2024)
by: Song, Yiping, et al.
Published: (2024)
COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation
by: Zhou, Tianyi, et al.
Published: (2026)
by: Zhou, Tianyi, et al.
Published: (2026)
EnrichEvent: Enriching Social Data with Contextual Information for Emerging Event Extraction
by: Esfahani, Mohammadali Sefidi, et al.
Published: (2023)
by: Esfahani, Mohammadali Sefidi, et al.
Published: (2023)
Compact Language Models via Pruning and Knowledge Distillation
by: Muralidharan, Saurav, et al.
Published: (2024)
by: Muralidharan, Saurav, et al.
Published: (2024)
On the Generalization vs Fidelity Paradox in Knowledge Distillation
by: Ramesh, Suhas Kamasetty, et al.
Published: (2025)
by: Ramesh, Suhas Kamasetty, et al.
Published: (2025)
$\mathcal{X}$-KD: General Experiential Knowledge Distillation for Large Language Models
by: Cai, Yuang, et al.
Published: (2026)
by: Cai, Yuang, et al.
Published: (2026)
Distribution Corrected Offline Data Distillation for Large Language Models
by: Zhang, Yumeng, et al.
Published: (2026)
by: Zhang, Yumeng, et al.
Published: (2026)
Multi-MLLM Knowledge Distillation for Out-of-Context News Detection
by: Gu, Yimeng, et al.
Published: (2025)
by: Gu, Yimeng, et al.
Published: (2025)
LLM-Oriented Token-Adaptive Knowledge Distillation
by: Xie, Xurong, et al.
Published: (2025)
by: Xie, Xurong, et al.
Published: (2025)
Scaling Knowledge Graph Construction through Synthetic Data Generation and Distillation
by: Choubey, Prafulla Kumar, et al.
Published: (2024)
by: Choubey, Prafulla Kumar, et al.
Published: (2024)
Knowledge Graph-Guided Retrieval Augmented Generation
by: Zhu, Xiangrong, et al.
Published: (2025)
by: Zhu, Xiangrong, et al.
Published: (2025)
A Dual-Space Framework for General Knowledge Distillation of Large Language Models
by: Zhang, Xue, et al.
Published: (2025)
by: Zhang, Xue, et al.
Published: (2025)
Large Language Models are Limited in Out-of-Context Knowledge Reasoning
by: Hu, Peng, et al.
Published: (2024)
by: Hu, Peng, et al.
Published: (2024)
Enhancing Knowledge Distillation of Large Language Models through Efficient Multi-Modal Distribution Alignment
by: Peng, Tianyu, et al.
Published: (2024)
by: Peng, Tianyu, et al.
Published: (2024)
MMIDR: Teaching Large Language Model to Interpret Multimodal Misinformation via Knowledge Distillation
by: Wang, Longzheng, et al.
Published: (2024)
by: Wang, Longzheng, et al.
Published: (2024)
Pedagogically-Inspired Data Synthesis for Language Model Knowledge Distillation
by: He, Bowei, et al.
Published: (2026)
by: He, Bowei, et al.
Published: (2026)
Differentially Private Knowledge Distillation via Synthetic Text Generation
by: Flemings, James, et al.
Published: (2024)
by: Flemings, James, et al.
Published: (2024)
Does Knowledge Distillation Matter for Large Language Model based Bundle Generation?
by: Feng, Kaidong, et al.
Published: (2025)
by: Feng, Kaidong, et al.
Published: (2025)
DSG-KD: Knowledge Distillation from Domain-Specific to General Language Models
by: Cho, Sangyeon, et al.
Published: (2024)
by: Cho, Sangyeon, et al.
Published: (2024)
EGAD: Entropy-Guided Adaptive Distillation for Token-Level Knowledge Transfer
by: Zhang, Hao, et al.
Published: (2026)
by: Zhang, Hao, et al.
Published: (2026)
ELAD: Explanation-Guided Large Language Models Active Distillation
by: Zhang, Yifei, et al.
Published: (2024)
by: Zhang, Yifei, et al.
Published: (2024)
Privacy-Preserving Reasoning with Knowledge-Distilled Parametric Retrieval Augmented Generation
by: Chen, Jinwen, et al.
Published: (2025)
by: Chen, Jinwen, et al.
Published: (2025)
PromptKD: Distilling Student-Friendly Knowledge for Generative Language Models via Prompt Tuning
by: Kim, Gyeongman, et al.
Published: (2024)
by: Kim, Gyeongman, et al.
Published: (2024)
Gumbel Distillation for Parallel Text Generation
by: Zhang, Chi, et al.
Published: (2026)
by: Zhang, Chi, et al.
Published: (2026)
Knowledge Distillation for Temporal Knowledge Graph Reasoning with Large Language Models
by: Xing, Wang, et al.
Published: (2026)
by: Xing, Wang, et al.
Published: (2026)
Preconditioned Test-Time Adaptation for Out-of-Distribution Debiasing in Narrative Generation
by: Shen, Hanwen, et al.
Published: (2026)
by: Shen, Hanwen, et al.
Published: (2026)
Smoothing Out Hallucinations: Mitigating LLM Hallucination with Smoothed Knowledge Distillation
by: Nguyen, Hieu, et al.
Published: (2025)
by: Nguyen, Hieu, et al.
Published: (2025)
Routing Distilled Knowledge via Mixture of LoRA Experts for Large Language Model based Bundle Generation
by: Feng, Kaidong, et al.
Published: (2025)
by: Feng, Kaidong, et al.
Published: (2025)
Similar Items
-
Task-Agnostic Language Model Watermarking via High Entropy Passthrough Layers
by: Masrani, Vaden, et al.
Published: (2024) -
CASP: Compression of Large Multimodal Models Based on Attention Sparsity
by: Gholami, Mohsen, et al.
Published: (2025) -
Generation, Distillation and Evaluation of Motivational Interviewing-Style Reflections with a Foundational Language Model
by: Brown, Andrew, et al.
Published: (2024) -
GOLD: Geometry Problem Solver with Natural Language Description
by: Zhang, Jiaxin, et al.
Published: (2024) -
TAIA: Large Language Models are Out-of-Distribution Data Learners
by: Jiang, Shuyang, et al.
Published: (2024)