:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Gholami, Mohsen, Akbari, Mohammad, Hu, Cindy, Masrani, Vaden, Wang, Z. Jane, Zhang, Yong
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2403.19754
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Task-Agnostic Language Model Watermarking via High Entropy Passthrough Layers
by: Masrani, Vaden, et al.
Published: (2024)

CASP: Compression of Large Multimodal Models Based on Attention Sparsity
by: Gholami, Mohsen, et al.
Published: (2025)

Generation, Distillation and Evaluation of Motivational Interviewing-Style Reflections with a Foundational Language Model
by: Brown, Andrew, et al.
Published: (2024)

GOLD: Geometry Problem Solver with Natural Language Description
by: Zhang, Jiaxin, et al.
Published: (2024)

TAIA: Large Language Models are Out-of-Distribution Data Learners
by: Jiang, Shuyang, et al.
Published: (2024)

Extracting General-use Transformers for Low-resource Languages via Knowledge Distillation
by: Cruz, Jan Christian Blaise, et al.
Published: (2025)

Rainproof: An Umbrella To Shield Text Generators From Out-Of-Distribution Data
by: Darrin, Maxime, et al.
Published: (2022)

Distilling Implicit Multimodal Knowledge into Large Language Models for Zero-Resource Dialogue Generation
by: Zhang, Bo, et al.
Published: (2024)

LLM-Guided Knowledge Distillation for Temporal Knowledge Graph Reasoning
by: Xing, Wang, et al.
Published: (2026)

Spatial Reasoning with Vision-Language Models in Ego-Centric Multi-View Scenes
by: Gholami, Mohsen, et al.
Published: (2025)

Out-of-Distribution Detection using Synthetic Data Generation
by: Abbas, Momin, et al.
Published: (2025)

Exploring and Enhancing the Transfer of Distribution in Knowledge Distillation for Autoregressive Language Models
by: Rao, Jun, et al.
Published: (2024)

LLM-based Privacy Data Augmentation Guided by Knowledge Distillation with a Distribution Tutor for Medical Text Classification
by: Song, Yiping, et al.
Published: (2024)

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation
by: Zhou, Tianyi, et al.
Published: (2026)

EnrichEvent: Enriching Social Data with Contextual Information for Emerging Event Extraction
by: Esfahani, Mohammadali Sefidi, et al.
Published: (2023)

Compact Language Models via Pruning and Knowledge Distillation
by: Muralidharan, Saurav, et al.
Published: (2024)

On the Generalization vs Fidelity Paradox in Knowledge Distillation
by: Ramesh, Suhas Kamasetty, et al.
Published: (2025)

$\mathcal{X}$-KD: General Experiential Knowledge Distillation for Large Language Models
by: Cai, Yuang, et al.
Published: (2026)

Distribution Corrected Offline Data Distillation for Large Language Models
by: Zhang, Yumeng, et al.
Published: (2026)

Multi-MLLM Knowledge Distillation for Out-of-Context News Detection
by: Gu, Yimeng, et al.
Published: (2025)

LLM-Oriented Token-Adaptive Knowledge Distillation
by: Xie, Xurong, et al.
Published: (2025)

Scaling Knowledge Graph Construction through Synthetic Data Generation and Distillation
by: Choubey, Prafulla Kumar, et al.
Published: (2024)

Knowledge Graph-Guided Retrieval Augmented Generation
by: Zhu, Xiangrong, et al.
Published: (2025)

A Dual-Space Framework for General Knowledge Distillation of Large Language Models
by: Zhang, Xue, et al.
Published: (2025)

Large Language Models are Limited in Out-of-Context Knowledge Reasoning
by: Hu, Peng, et al.
Published: (2024)

Enhancing Knowledge Distillation of Large Language Models through Efficient Multi-Modal Distribution Alignment
by: Peng, Tianyu, et al.
Published: (2024)

MMIDR: Teaching Large Language Model to Interpret Multimodal Misinformation via Knowledge Distillation
by: Wang, Longzheng, et al.
Published: (2024)

Pedagogically-Inspired Data Synthesis for Language Model Knowledge Distillation
by: He, Bowei, et al.
Published: (2026)

Differentially Private Knowledge Distillation via Synthetic Text Generation
by: Flemings, James, et al.
Published: (2024)

Does Knowledge Distillation Matter for Large Language Model based Bundle Generation?
by: Feng, Kaidong, et al.
Published: (2025)

DSG-KD: Knowledge Distillation from Domain-Specific to General Language Models
by: Cho, Sangyeon, et al.
Published: (2024)

EGAD: Entropy-Guided Adaptive Distillation for Token-Level Knowledge Transfer
by: Zhang, Hao, et al.
Published: (2026)

ELAD: Explanation-Guided Large Language Models Active Distillation
by: Zhang, Yifei, et al.
Published: (2024)

Privacy-Preserving Reasoning with Knowledge-Distilled Parametric Retrieval Augmented Generation
by: Chen, Jinwen, et al.
Published: (2025)

PromptKD: Distilling Student-Friendly Knowledge for Generative Language Models via Prompt Tuning
by: Kim, Gyeongman, et al.
Published: (2024)

Gumbel Distillation for Parallel Text Generation
by: Zhang, Chi, et al.
Published: (2026)

Knowledge Distillation for Temporal Knowledge Graph Reasoning with Large Language Models
by: Xing, Wang, et al.
Published: (2026)

Preconditioned Test-Time Adaptation for Out-of-Distribution Debiasing in Narrative Generation
by: Shen, Hanwen, et al.
Published: (2026)

Smoothing Out Hallucinations: Mitigating LLM Hallucination with Smoothed Knowledge Distillation
by: Nguyen, Hieu, et al.
Published: (2025)

Routing Distilled Knowledge via Mixture of LoRA Experts for Large Language Model based Bundle Generation
by: Feng, Kaidong, et al.
Published: (2025)