Saved in:
| Main Authors: | Shao, Maanping, Zhang, Feihong, Zhang, Gu, Cheng, Baiye, Xue, Zhengrong, Xu, Huazhe |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.11269 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
H$^3$DP: Triply-Hierarchical Diffusion Policy for Visuomotor Learning
by: Lu, Yiyang, et al.
Published: (2025)
by: Lu, Yiyang, et al.
Published: (2025)
ViTaS: Visual Tactile Soft Fusion Contrastive Learning for Visuomotor Learning
by: Tian, Yufeng, et al.
Published: (2026)
by: Tian, Yufeng, et al.
Published: (2026)
TernaryCLIP: Efficiently Compressing Vision-Language Models with Ternary Weights and Distilled Knowledge
by: Zhang, Shu-Hao, et al.
Published: (2025)
by: Zhang, Shu-Hao, et al.
Published: (2025)
Learning to Manipulate Anywhere: A Visual Generalizable Framework For Reinforcement Learning
by: Yuan, Zhecheng, et al.
Published: (2024)
by: Yuan, Zhecheng, et al.
Published: (2024)
Distilling Cross-Modal Knowledge via Feature Disentanglement
by: Liu, Junhong, et al.
Published: (2025)
by: Liu, Junhong, et al.
Published: (2025)
Continuous-Time Distribution Matching for Few-Step Diffusion Distillation
by: Liu, Tao, et al.
Published: (2026)
by: Liu, Tao, et al.
Published: (2026)
ArrayTac: A Closed-loop Piezoelectric Tactile Platform for Continuously Tunable Rendering of Shape, Stiffness, and Friction
by: Liang, Tianhai, et al.
Published: (2026)
by: Liang, Tianhai, et al.
Published: (2026)
FDBPL: Faster Distillation-Based Prompt Learning for Region-Aware Vision-Language Models Adaptation
by: Zhang, Zherui, et al.
Published: (2025)
by: Zhang, Zherui, et al.
Published: (2025)
MoE-DP: An MoE-Enhanced Diffusion Policy for Robust Long-Horizon Robotic Manipulation with Skill Decomposition and Failure Recovery
by: Cheng, Baiye, et al.
Published: (2025)
by: Cheng, Baiye, et al.
Published: (2025)
Learning to Project for Cross-Task Knowledge Distillation
by: Auty, Dylan, et al.
Published: (2024)
by: Auty, Dylan, et al.
Published: (2024)
AgriKD: Cross-Architecture Knowledge Distillation for Efficient Leaf Disease Classification
by: Le, Minh-Dung, et al.
Published: (2026)
by: Le, Minh-Dung, et al.
Published: (2026)
CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation
by: Govindarajan, Hariprasath, et al.
Published: (2025)
by: Govindarajan, Hariprasath, et al.
Published: (2025)
Agglomerating Large Vision Encoders via Distillation for VFSS Segmentation
by: Zeng, Chengxi, et al.
Published: (2025)
by: Zeng, Chengxi, et al.
Published: (2025)
Offline Semantic Guidance for Efficient Vision-Language-Action Policy Distillation
by: Shi, Jin, et al.
Published: (2026)
by: Shi, Jin, et al.
Published: (2026)
Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model
by: Feng, Qianhan, et al.
Published: (2024)
by: Feng, Qianhan, et al.
Published: (2024)
Adversarial Prompt Distillation for Vision-Language Models
by: Luo, Lin, et al.
Published: (2024)
by: Luo, Lin, et al.
Published: (2024)
Cross Knowledge Distillation between Artificial and Spiking Neural Networks
by: Ye, Shuhan, et al.
Published: (2025)
by: Ye, Shuhan, et al.
Published: (2025)
CIARD: Cyclic Iterative Adversarial Robustness Distillation
by: Lu, Liming, et al.
Published: (2025)
by: Lu, Liming, et al.
Published: (2025)
Taming Diffusion for Dataset Distillation with High Representativeness
by: Zhao, Lin, et al.
Published: (2025)
by: Zhao, Lin, et al.
Published: (2025)
Is Contrastive Distillation Enough for Learning Comprehensive 3D Representations?
by: Zhang, Yifan, et al.
Published: (2024)
by: Zhang, Yifan, et al.
Published: (2024)
Multimodal Distribution Matching for Vision-Language Dataset Distillation
by: Jeong, Jongoh, et al.
Published: (2026)
by: Jeong, Jongoh, et al.
Published: (2026)
AnimateDiff-Lightning: Cross-Model Diffusion Distillation
by: Lin, Shanchuan, et al.
Published: (2024)
by: Lin, Shanchuan, et al.
Published: (2024)
Vision-Language Meets the Skeleton: Progressively Distillation with Cross-Modal Knowledge for 3D Action Representation Learning
by: Chen, Yang, et al.
Published: (2024)
by: Chen, Yang, et al.
Published: (2024)
Multimodal Robust Prompt Distillation for 3D Point Cloud Models
by: Gu, Xiang, et al.
Published: (2025)
by: Gu, Xiang, et al.
Published: (2025)
Rebalanced Vision-Language Retrieval Considering Structure-Aware Distillation
by: Yang, Yang, et al.
Published: (2024)
by: Yang, Yang, et al.
Published: (2024)
COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training
by: Kim, Sanghwan, et al.
Published: (2024)
by: Kim, Sanghwan, et al.
Published: (2024)
CD^2: Constrained Dataset Distillation for Few-Shot Class-Incremental Learning
by: Bao, Kexin, et al.
Published: (2026)
by: Bao, Kexin, et al.
Published: (2026)
DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture
by: Xiang, Qianlong, et al.
Published: (2024)
by: Xiang, Qianlong, et al.
Published: (2024)
Distilling Channels for Efficient Deep Tracking
by: Ge, Shiming, et al.
Published: (2024)
by: Ge, Shiming, et al.
Published: (2024)
Score Distillation of Flow Matching Models
by: Zhou, Mingyuan, et al.
Published: (2025)
by: Zhou, Mingyuan, et al.
Published: (2025)
CompoDistill: Attention Distillation for Compositional Reasoning in Multimodal LLMs
by: Kim, Jiwan, et al.
Published: (2025)
by: Kim, Jiwan, et al.
Published: (2025)
Accelerating Diffusion Models with One-to-Many Knowledge Distillation
by: Zhang, Linfeng, et al.
Published: (2024)
by: Zhang, Linfeng, et al.
Published: (2024)
Global Intervention and Distillation for Federated Out-of-Distribution Generalization
by: Qi, Zhuang, et al.
Published: (2025)
by: Qi, Zhuang, et al.
Published: (2025)
SiNGER: A Clearer Voice Distills Vision Transformers Further
by: Yu, Geunhyeok, et al.
Published: (2025)
by: Yu, Geunhyeok, et al.
Published: (2025)
Enhancing Medical Large Vision-Language Models via Alignment Distillation
by: Chang, Aofei, et al.
Published: (2025)
by: Chang, Aofei, et al.
Published: (2025)
ViT-Linearizer: Distilling Quadratic Knowledge into Linear-Time Vision Models
by: Wei, Guoyizhe, et al.
Published: (2025)
by: Wei, Guoyizhe, et al.
Published: (2025)
ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation
by: Zhang, Ting, et al.
Published: (2024)
by: Zhang, Ting, et al.
Published: (2024)
Bridging Modality Gap for Visual Grounding with Effecitve Cross-modal Distillation
by: Wang, Jiaxi, et al.
Published: (2023)
by: Wang, Jiaxi, et al.
Published: (2023)
Learn where to Click from Yourself: On-Policy Self-Distillation for GUI Grounding
by: Zhang, Yan, et al.
Published: (2026)
by: Zhang, Yan, et al.
Published: (2026)
Self-supervised Dataset Distillation: A Good Compression Is All You Need
by: Zhou, Muxin, et al.
Published: (2024)
by: Zhou, Muxin, et al.
Published: (2024)
Similar Items
-
H$^3$DP: Triply-Hierarchical Diffusion Policy for Visuomotor Learning
by: Lu, Yiyang, et al.
Published: (2025) -
ViTaS: Visual Tactile Soft Fusion Contrastive Learning for Visuomotor Learning
by: Tian, Yufeng, et al.
Published: (2026) -
TernaryCLIP: Efficiently Compressing Vision-Language Models with Ternary Weights and Distilled Knowledge
by: Zhang, Shu-Hao, et al.
Published: (2025) -
Learning to Manipulate Anywhere: A Visual Generalizable Framework For Reinforcement Learning
by: Yuan, Zhecheng, et al.
Published: (2024) -
Distilling Cross-Modal Knowledge via Feature Disentanglement
by: Liu, Junhong, et al.
Published: (2025)