Saved in:
| Main Authors: | Tian, Huiyuan, Xu, Bonan, Li, Shijian |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.06848 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
From Per-Image Low-Rank to Encoding Mismatch: Rethinking Feature Distillation in Vision Transformers
by: Tian, Huiyuan, et al.
Published: (2025)
by: Tian, Huiyuan, et al.
Published: (2025)
SpectralKD: A Unified Framework for Interpreting and Distilling Vision Transformers via Spectral Analysis
by: Tian, Huiyuan, et al.
Published: (2024)
by: Tian, Huiyuan, et al.
Published: (2024)
Vision Transformers with Self-Distilled Registers
by: Chen, Yinjie, et al.
Published: (2025)
by: Chen, Yinjie, et al.
Published: (2025)
Mutual Distillation Learning For Person Re-Identification
by: Fu, Huiyuan, et al.
Published: (2024)
by: Fu, Huiyuan, et al.
Published: (2024)
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction
by: Wu, Size, et al.
Published: (2023)
by: Wu, Size, et al.
Published: (2023)
Distilling Vision Transformers for Distortion-Robust Representation Learning
by: Alexis, Konstantinos, et al.
Published: (2026)
by: Alexis, Konstantinos, et al.
Published: (2026)
Knowledge Distillation in Vision Transformers: A Critical Review
by: Habib, Gousia, et al.
Published: (2023)
by: Habib, Gousia, et al.
Published: (2023)
Multi-Depth Branch Network for Efficient Image Super-Resolution
by: Tian, Huiyuan, et al.
Published: (2023)
by: Tian, Huiyuan, et al.
Published: (2023)
Towards Online Multi-Modal Social Interaction Understanding
by: Li, Xinpeng, et al.
Published: (2025)
by: Li, Xinpeng, et al.
Published: (2025)
Structurally Disentangled Feature Fields Distillation for 3D Understanding and Editing
by: Levy, Yoel, et al.
Published: (2025)
by: Levy, Yoel, et al.
Published: (2025)
Towards Optimal Trade-offs in Knowledge Distillation for CNNs and Vision Transformers at the Edge
by: Violos, John, et al.
Published: (2024)
by: Violos, John, et al.
Published: (2024)
Q-VDiT: Towards Accurate Quantization and Distillation of Video-Generation Diffusion Transformers
by: Feng, Weilun, et al.
Published: (2025)
by: Feng, Weilun, et al.
Published: (2025)
Dataset Distillation with Probabilistic Latent Features
by: Li, Zhe, et al.
Published: (2025)
by: Li, Zhe, et al.
Published: (2025)
X-Distill: Cross-Architecture Vision Distillation for Visuomotor Learning
by: Shao, Maanping, et al.
Published: (2026)
by: Shao, Maanping, et al.
Published: (2026)
AMMKD: Adaptive Multimodal Multi-teacher Distillation for Lightweight Vision-Language Models
by: Li, Yuqi, et al.
Published: (2025)
by: Li, Yuqi, et al.
Published: (2025)
Distill-SODA: Distilling Self-Supervised Vision Transformer for Source-Free Open-Set Domain Adaptation in Computational Pathology
by: Vray, Guillaume, et al.
Published: (2023)
by: Vray, Guillaume, et al.
Published: (2023)
3D Feature Distillation with Object-Centric Priors
by: Tziafas, Georgios, et al.
Published: (2024)
by: Tziafas, Georgios, et al.
Published: (2024)
Vision-Language Dataset Distillation
by: Wu, Xindi, et al.
Published: (2023)
by: Wu, Xindi, et al.
Published: (2023)
VideoDistill: Language-aware Vision Distillation for Video Question Answering
by: Zou, Bo, et al.
Published: (2024)
by: Zou, Bo, et al.
Published: (2024)
KD-DETR: Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling
by: Wang, Yu, et al.
Published: (2022)
by: Wang, Yu, et al.
Published: (2022)
Distill, Diffuse, and Semanticize (DDS): Annotation-Free 3D Scene Understanding Based on Multi-Granularity Distillation and Graph-Diffusion-Based Segmentation
by: Wang, Yijing, et al.
Published: (2026)
by: Wang, Yijing, et al.
Published: (2026)
X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation
by: Ma, Jian, et al.
Published: (2025)
by: Ma, Jian, et al.
Published: (2025)
Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression
by: Schmitt, Jonas, et al.
Published: (2024)
by: Schmitt, Jonas, et al.
Published: (2024)
SDRT: Enhance Vision-Language Models by Self-Distillation with Diverse Reasoning Traces
by: Wu, Guande, et al.
Published: (2025)
by: Wu, Guande, et al.
Published: (2025)
Autoregressive Distillation of Diffusion Transformers
by: Kim, Yeongmin, et al.
Published: (2025)
by: Kim, Yeongmin, et al.
Published: (2025)
Brain-CLIPLM: Decoding Compressed Semantic Representations in EEG for Language Reconstruction
by: Yang, Xiaoli, et al.
Published: (2026)
by: Yang, Xiaoli, et al.
Published: (2026)
RMMSS: Towards Advanced Robust Multi-Modal Semantic Segmentation with Hybrid Prototype Distillation and Feature Selection
by: Tan, Jiaqi, et al.
Published: (2025)
by: Tan, Jiaqi, et al.
Published: (2025)
Distillation of Diffusion Features for Semantic Correspondence
by: Fundel, Frank, et al.
Published: (2024)
by: Fundel, Frank, et al.
Published: (2024)
Preserving Angles Improves Feature Distillation
by: Mannix, Evelyn J., et al.
Published: (2024)
by: Mannix, Evelyn J., et al.
Published: (2024)
Non-target Divergence Hypothesis: Toward Understanding Domain Gaps in Cross-Modal Knowledge Distillation
by: Chen, Yilong, et al.
Published: (2024)
by: Chen, Yilong, et al.
Published: (2024)
Towards Trustworthy Dataset Distillation
by: Ma, Shijie, et al.
Published: (2023)
by: Ma, Shijie, et al.
Published: (2023)
DSConv: Dynamic Splitting Convolution for Pansharpening
by: Liu, Xuanyu, et al.
Published: (2025)
by: Liu, Xuanyu, et al.
Published: (2025)
MagicDistillation: Weak-to-Strong Video Distillation for Large-Scale Few-Step Synthesis
by: Shao, Shitong, et al.
Published: (2025)
by: Shao, Shitong, et al.
Published: (2025)
Omni-MMSI: Toward Identity-attributed Social Interaction Understanding
by: Li, Xinpeng, et al.
Published: (2026)
by: Li, Xinpeng, et al.
Published: (2026)
Visual-Advantage On-Policy Distillation for Vision-Language Models
by: Liu, Ruiqi, et al.
Published: (2026)
by: Liu, Ruiqi, et al.
Published: (2026)
PartDistill: 3D Shape Part Segmentation by Vision-Language Model Distillation
by: Umam, Ardian, et al.
Published: (2023)
by: Umam, Ardian, et al.
Published: (2023)
The Role of Masking for Efficient Supervised Knowledge Distillation of Vision Transformers
by: Son, Seungwoo, et al.
Published: (2023)
by: Son, Seungwoo, et al.
Published: (2023)
DFMSD: Dual Feature Masking Stage-wise Knowledge Distillation for Object Detection
by: Zhang, Zhourui, et al.
Published: (2024)
by: Zhang, Zhourui, et al.
Published: (2024)
Visual Program Distillation: Distilling Tools and Programmatic Reasoning into Vision-Language Models
by: Hu, Yushi, et al.
Published: (2023)
by: Hu, Yushi, et al.
Published: (2023)
Knowledge Distillation Based on Transformed Teacher Matching
by: Zheng, Kaixiang, et al.
Published: (2024)
by: Zheng, Kaixiang, et al.
Published: (2024)
Similar Items
-
From Per-Image Low-Rank to Encoding Mismatch: Rethinking Feature Distillation in Vision Transformers
by: Tian, Huiyuan, et al.
Published: (2025) -
SpectralKD: A Unified Framework for Interpreting and Distilling Vision Transformers via Spectral Analysis
by: Tian, Huiyuan, et al.
Published: (2024) -
Vision Transformers with Self-Distilled Registers
by: Chen, Yinjie, et al.
Published: (2025) -
Mutual Distillation Learning For Person Re-Identification
by: Fu, Huiyuan, et al.
Published: (2024) -
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction
by: Wu, Size, et al.
Published: (2023)