Saved in:
| Main Authors: | Xie, Yucheng, Feng, Fu, Shi, Ruixiao, Wang, Jing, Rui, Yong, Geng, Xin |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.19694 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DivControl: Knowledge Diversion for Controllable Image Generation
by: Xie, Yucheng, et al.
Published: (2025)
by: Xie, Yucheng, et al.
Published: (2025)
FINE: Factorizing Knowledge for Initialization of Variable-sized Diffusion Models
by: Xie, Yucheng, et al.
Published: (2024)
by: Xie, Yucheng, et al.
Published: (2024)
A Creative Agent is Worth a 64-Token Template
by: Shi, Ruixiao, et al.
Published: (2026)
by: Shi, Ruixiao, et al.
Published: (2026)
KIND: Knowledge Integration and Diversion for Training Decomposable Models
by: Xie, Yucheng, et al.
Published: (2024)
by: Xie, Yucheng, et al.
Published: (2024)
Constraint-based Pre-training: From Structured Constraints to Scalable Model Initialization
by: Feng, Fu, et al.
Published: (2026)
by: Feng, Fu, et al.
Published: (2026)
WAVE: Weight Templates for Adaptive Initialization of Variable-sized Models
by: Feng, Fu, et al.
Published: (2024)
by: Feng, Fu, et al.
Published: (2024)
FAD: Frequency Adaptation and Diversion for Cross-domain Few-shot Learning
by: Shi, Ruixiao, et al.
Published: (2025)
by: Shi, Ruixiao, et al.
Published: (2025)
Exploring Learngene via Stage-wise Weight Sharing for Initializing Variable-sized Models
by: Xia, Shi-Yu, et al.
Published: (2024)
by: Xia, Shi-Yu, et al.
Published: (2024)
Stratified Knowledge-Density Super-Network for Scalable Vision Transformers
by: Li, Longhua, et al.
Published: (2025)
by: Li, Longhua, et al.
Published: (2025)
Enriching Knowledge Distillation with Intra-Class Contrastive Learning
by: Yuan, Hua, et al.
Published: (2025)
by: Yuan, Hua, et al.
Published: (2025)
Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data
by: Shi, Yucheng, et al.
Published: (2025)
by: Shi, Yucheng, et al.
Published: (2025)
Self-Supervised Vision Transformers for Writer Retrieval
by: Raven, Tim, et al.
Published: (2024)
by: Raven, Tim, et al.
Published: (2024)
Deconstructing Denoising Diffusion Models for Self-Supervised Learning
by: Chen, Xinlei, et al.
Published: (2024)
by: Chen, Xinlei, et al.
Published: (2024)
N-Tree Diffusion for Long-Horizon Wildfire Risk Forecasting
by: Xing, Yucheng, et al.
Published: (2026)
by: Xing, Yucheng, et al.
Published: (2026)
On Using Quasirandom Sequences in Machine Learning for Model Weight Initialization
by: Miranskyy, Andriy, et al.
Published: (2024)
by: Miranskyy, Andriy, et al.
Published: (2024)
Distribution-Conditional Generation: From Class Distribution to Creative Generation
by: Feng, Fu, et al.
Published: (2025)
by: Feng, Fu, et al.
Published: (2025)
Self-Supervised Scalable Deep Compressed Sensing
by: Chen, Bin, et al.
Published: (2023)
by: Chen, Bin, et al.
Published: (2023)
Efficient and Long-Tailed Generalization for Pre-trained Vision-Language Model
by: Shi, Jiang-Xin, et al.
Published: (2024)
by: Shi, Jiang-Xin, et al.
Published: (2024)
Self-Improving Small Object Grounding in LVLMs
by: Yang, Tianze, et al.
Published: (2026)
by: Yang, Tianze, et al.
Published: (2026)
Dataset Distillation for Pre-Trained Self-Supervised Vision Models
by: Cazenavette, George, et al.
Published: (2025)
by: Cazenavette, George, et al.
Published: (2025)
Temporal Embeddings: Scalable Self-Supervised Temporal Representation Learning from Spatiotemporal Data for Multimodal Computer Vision
by: Cao, Yi, et al.
Published: (2023)
by: Cao, Yi, et al.
Published: (2023)
FlowDepth: Decoupling Optical Flow for Self-Supervised Monocular Depth Estimation
by: Sun, Yiyang, et al.
Published: (2024)
by: Sun, Yiyang, et al.
Published: (2024)
A Study on Self-Supervised Pretraining for Vision Problems in Gastrointestinal Endoscopy
by: Sanderson, Edward, et al.
Published: (2024)
by: Sanderson, Edward, et al.
Published: (2024)
MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data
by: Li, Zongxia, et al.
Published: (2026)
by: Li, Zongxia, et al.
Published: (2026)
SupMAE: Supervised Masked Autoencoders Are Efficient Vision Learners
by: Liang, Feng, et al.
Published: (2022)
by: Liang, Feng, et al.
Published: (2022)
Input-Adaptive Generative Dynamics in Diffusion Models
by: Xing, Yucheng, et al.
Published: (2024)
by: Xing, Yucheng, et al.
Published: (2024)
Couple to Control: Joint Initial Noise Design in Diffusion Models
by: Jia, Jing, et al.
Published: (2026)
by: Jia, Jing, et al.
Published: (2026)
Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection
by: Yu, Geng, et al.
Published: (2024)
by: Yu, Geng, et al.
Published: (2024)
Elastic Attention Cores for Scalable Vision Transformers
by: Song, Alan Z., et al.
Published: (2026)
by: Song, Alan Z., et al.
Published: (2026)
Knowledge Diversion for Efficient Morphology Control and Policy Transfer
by: Feng, Fu, et al.
Published: (2025)
by: Feng, Fu, et al.
Published: (2025)
Redefining <Creative> in Dictionary: Towards an Enhanced Semantic Understanding of Creative Generation
by: Feng, Fu, et al.
Published: (2024)
by: Feng, Fu, et al.
Published: (2024)
Vision-Language Models are Strong Noisy Label Detectors
by: Wei, Tong, et al.
Published: (2024)
by: Wei, Tong, et al.
Published: (2024)
Stable Consistency Tuning: Understanding and Improving Consistency Models
by: Wang, Fu-Yun, et al.
Published: (2024)
by: Wang, Fu-Yun, et al.
Published: (2024)
Hyperspectral Anomaly Detection with Self-Supervised Anomaly Prior
by: Liu, Yidan, et al.
Published: (2024)
by: Liu, Yidan, et al.
Published: (2024)
VLM-AD: End-to-End Autonomous Driving through Vision-Language Model Supervision
by: Xu, Yi, et al.
Published: (2024)
by: Xu, Yi, et al.
Published: (2024)
In-Context Symmetries: Self-Supervised Learning through Contextual World Models
by: Gupta, Sharut, et al.
Published: (2024)
by: Gupta, Sharut, et al.
Published: (2024)
LeJEPA: Provable and Scalable Self-Supervised Learning Without the Heuristics
by: Balestriero, Randall, et al.
Published: (2025)
by: Balestriero, Randall, et al.
Published: (2025)
Text-to-CadQuery: A New Paradigm for CAD Generation with Scalable Large Model Capabilities
by: Xie, Haoyang, et al.
Published: (2025)
by: Xie, Haoyang, et al.
Published: (2025)
Self-supervised Vision Transformer are Scalable Generative Models for Domain Generalization
by: Doerrich, Sebastian, et al.
Published: (2024)
by: Doerrich, Sebastian, et al.
Published: (2024)
SITUATE: Indoor Human Trajectory Prediction through Geometric Features and Self-Supervised Vision Representation
by: Capogrosso, Luigi, et al.
Published: (2024)
by: Capogrosso, Luigi, et al.
Published: (2024)
Similar Items
-
DivControl: Knowledge Diversion for Controllable Image Generation
by: Xie, Yucheng, et al.
Published: (2025) -
FINE: Factorizing Knowledge for Initialization of Variable-sized Diffusion Models
by: Xie, Yucheng, et al.
Published: (2024) -
A Creative Agent is Worth a 64-Token Template
by: Shi, Ruixiao, et al.
Published: (2026) -
KIND: Knowledge Integration and Diversion for Training Decomposable Models
by: Xie, Yucheng, et al.
Published: (2024) -
Constraint-based Pre-training: From Structured Constraints to Scalable Model Initialization
by: Feng, Fu, et al.
Published: (2026)