:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Huang, Haiduo, Song, Jiangcheng, Zhang, Yadong, Ren, Pengju
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2510.24021
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

DeepKD: A Deeply Decoupled and Denoised Knowledge Distillation Trainer
by: Huang, Haiduo, et al.
Published: (2025)

KernelDNA: Dynamic Kernel Sharing via Decoupled Naive Adapters
by: Huang, Haiduo, et al.
Published: (2025)

"The Whole Is Greater Than the Sum of Its Parts": A Compatibility-Aware Multi-Teacher CoT Distillation Framework
by: Cui, Jin, et al.
Published: (2026)

Partial Channel Network: Compute Fewer, Perform Better
by: Huang, Haiduo, et al.
Published: (2025)

A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone
by: Hao, Jitai, et al.
Published: (2025)

Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE
by: Huang, Haiduo, et al.
Published: (2025)

LLM-Oriented Token-Adaptive Knowledge Distillation
by: Xie, Xurong, et al.
Published: (2025)

FastEagle: Cascaded Drafting for Accelerating Speculative Decoding
by: Huang, Haiduo, et al.
Published: (2025)

Nearly Lossless Adaptive Bit Switching
by: Huang, Haiduo, et al.
Published: (2025)

GeGS-PCR: Effective and Robust 3D Point Cloud Registration with Two-Stage Color-Enhanced Geometric-3DGS Fusion
by: Tian, Jiayi, et al.
Published: (2026)

Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding
by: Li, Jinze, et al.
Published: (2025)

MedDialBench: Benchmarking LLM Diagnostic Robustness under Parametric Adversarial Patient Behaviors
by: Luo, Xiaotian, et al.
Published: (2026)

Explain in Your Own Words: Improving Reasoning via Token-Selective Dual Knowledge Distillation
by: Kim, Minsang, et al.
Published: (2026)

Pair-In, Pair-Out: Latent Multi-Token Prediction for Efficient LLMs
by: Tan, Wenhui, et al.
Published: (2026)

Less is More: Selective Reflection for Compatible and Efficient Knowledge Distillation in Large Language Models
by: Liu, Lingyuan, et al.
Published: (2025)

SpecVLM: Fast Speculative Decoding in Vision-Language Models
by: Huang, Haiduo, et al.
Published: (2025)

Few-Shot Knowledge Distillation of LLMs With Counterfactual Explanations
by: Hamman, Faisal, et al.
Published: (2025)

AlignDistil: Token-Level Language Model Alignment as Adaptive Policy Distillation
by: Zhang, Songming, et al.
Published: (2025)

EasyDistill: A Comprehensive Toolkit for Effective Knowledge Distillation of Large Language Models
by: Wang, Chengyu, et al.
Published: (2025)

AtlasKV: Augmenting LLMs with Billion-Scale Knowledge Graphs in 20GB VRAM
by: Huang, Haoyu, et al.
Published: (2025)

Distilling Reasoning Without Knowledge: A Framework for Reliable LLMs
by: Kietkajornrit, Auksarapak, et al.
Published: (2026)

Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation
by: Zhao, Shuai, et al.
Published: (2024)

An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing
by: Chai, Ziwei, et al.
Published: (2024)

RankLLM: Weighted Ranking of LLMs by Quantifying Question Difficulty
by: Zhang, Ziqian, et al.
Published: (2026)

TokenSelect: Efficient Long-Context Inference and Length Extrapolation for LLMs via Dynamic Token-Level KV Cache Selection
by: Wu, Wei, et al.
Published: (2024)

Efficient Intent-Based Filtering for Multi-Party Conversations Using Knowledge Distillation from LLMs
by: Gody, Reem, et al.
Published: (2025)

AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders
by: Hu, Yuezhou, et al.
Published: (2025)

Hybrid Policy Distillation for LLMs
by: Zhu, Wenhong, et al.
Published: (2026)

Reliable Reasoning Path: Distilling Effective Guidance for LLM Reasoning with Knowledge Graphs
by: Xiao, Yilin, et al.
Published: (2025)

Dual-Space Knowledge Distillation for Large Language Models
by: Zhang, Songming, et al.
Published: (2024)

ConTextual: Improving Clinical Text Summarization in LLMs with Context-preserving Token Filtering and Knowledge Graphs
by: Piya, Fahmida Liza, et al.
Published: (2025)

Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation
by: Zhou, Yuhang, et al.
Published: (2024)

A Diversity-Enhanced Knowledge Distillation Model for Practical Math Word Problem Solving
by: Zhang, Yi, et al.
Published: (2025)

JoyAI-LLM Flash: Advancing Mid-Scale LLMs with Token Efficiency
by: Cai, Aichen, et al.
Published: (2026)

LLMs are Not Just Next Token Predictors
by: Downes, Stephen M., et al.
Published: (2024)

Partial Convolution Meets Visual Attention
by: Huang, Haiduo, et al.
Published: (2025)

Cornerstones or Stumbling Blocks? Deciphering the Rock Tokens in On-Policy Distillation
by: Jiang, Yuxuan, et al.
Published: (2026)

Incorporating Domain Knowledge into Materials Tokenization
by: Oh, Yerim, et al.
Published: (2025)

Self-Distillation for Multi-Token Prediction
by: Zhao, Guoliang, et al.
Published: (2026)

Can LLMs be Good Graph Judge for Knowledge Graph Construction?
by: Huang, Haoyu, et al.
Published: (2024)