:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Xie, Yucheng, Feng, Fu, Shi, Ruixiao, Wang, Jing, Rui, Yong, Geng, Xin
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition Machine Learning
Online Access:	https://arxiv.org/abs/2601.19694
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

DivControl: Knowledge Diversion for Controllable Image Generation
by: Xie, Yucheng, et al.
Published: (2025)

FINE: Factorizing Knowledge for Initialization of Variable-sized Diffusion Models
by: Xie, Yucheng, et al.
Published: (2024)

A Creative Agent is Worth a 64-Token Template
by: Shi, Ruixiao, et al.
Published: (2026)

KIND: Knowledge Integration and Diversion for Training Decomposable Models
by: Xie, Yucheng, et al.
Published: (2024)

Constraint-based Pre-training: From Structured Constraints to Scalable Model Initialization
by: Feng, Fu, et al.
Published: (2026)

WAVE: Weight Templates for Adaptive Initialization of Variable-sized Models
by: Feng, Fu, et al.
Published: (2024)

FAD: Frequency Adaptation and Diversion for Cross-domain Few-shot Learning
by: Shi, Ruixiao, et al.
Published: (2025)

Exploring Learngene via Stage-wise Weight Sharing for Initializing Variable-sized Models
by: Xia, Shi-Yu, et al.
Published: (2024)

Stratified Knowledge-Density Super-Network for Scalable Vision Transformers
by: Li, Longhua, et al.
Published: (2025)

Enriching Knowledge Distillation with Intra-Class Contrastive Learning
by: Yuan, Hua, et al.
Published: (2025)

Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data
by: Shi, Yucheng, et al.
Published: (2025)

Self-Supervised Vision Transformers for Writer Retrieval
by: Raven, Tim, et al.
Published: (2024)

Deconstructing Denoising Diffusion Models for Self-Supervised Learning
by: Chen, Xinlei, et al.
Published: (2024)

N-Tree Diffusion for Long-Horizon Wildfire Risk Forecasting
by: Xing, Yucheng, et al.
Published: (2026)

On Using Quasirandom Sequences in Machine Learning for Model Weight Initialization
by: Miranskyy, Andriy, et al.
Published: (2024)

Distribution-Conditional Generation: From Class Distribution to Creative Generation
by: Feng, Fu, et al.
Published: (2025)

Self-Supervised Scalable Deep Compressed Sensing
by: Chen, Bin, et al.
Published: (2023)

Efficient and Long-Tailed Generalization for Pre-trained Vision-Language Model
by: Shi, Jiang-Xin, et al.
Published: (2024)

Self-Improving Small Object Grounding in LVLMs
by: Yang, Tianze, et al.
Published: (2026)

Dataset Distillation for Pre-Trained Self-Supervised Vision Models
by: Cazenavette, George, et al.
Published: (2025)

Temporal Embeddings: Scalable Self-Supervised Temporal Representation Learning from Spatiotemporal Data for Multimodal Computer Vision
by: Cao, Yi, et al.
Published: (2023)

FlowDepth: Decoupling Optical Flow for Self-Supervised Monocular Depth Estimation
by: Sun, Yiyang, et al.
Published: (2024)

A Study on Self-Supervised Pretraining for Vision Problems in Gastrointestinal Endoscopy
by: Sanderson, Edward, et al.
Published: (2024)

MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data
by: Li, Zongxia, et al.
Published: (2026)

SupMAE: Supervised Masked Autoencoders Are Efficient Vision Learners
by: Liang, Feng, et al.
Published: (2022)

Input-Adaptive Generative Dynamics in Diffusion Models
by: Xing, Yucheng, et al.
Published: (2024)

Couple to Control: Joint Initial Noise Design in Diffusion Models
by: Jia, Jing, et al.
Published: (2026)

Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection
by: Yu, Geng, et al.
Published: (2024)

Elastic Attention Cores for Scalable Vision Transformers
by: Song, Alan Z., et al.
Published: (2026)

Knowledge Diversion for Efficient Morphology Control and Policy Transfer
by: Feng, Fu, et al.
Published: (2025)

Redefining <Creative> in Dictionary: Towards an Enhanced Semantic Understanding of Creative Generation
by: Feng, Fu, et al.
Published: (2024)

Vision-Language Models are Strong Noisy Label Detectors
by: Wei, Tong, et al.
Published: (2024)

Stable Consistency Tuning: Understanding and Improving Consistency Models
by: Wang, Fu-Yun, et al.
Published: (2024)

Hyperspectral Anomaly Detection with Self-Supervised Anomaly Prior
by: Liu, Yidan, et al.
Published: (2024)

VLM-AD: End-to-End Autonomous Driving through Vision-Language Model Supervision
by: Xu, Yi, et al.
Published: (2024)

In-Context Symmetries: Self-Supervised Learning through Contextual World Models
by: Gupta, Sharut, et al.
Published: (2024)

LeJEPA: Provable and Scalable Self-Supervised Learning Without the Heuristics
by: Balestriero, Randall, et al.
Published: (2025)

Text-to-CadQuery: A New Paradigm for CAD Generation with Scalable Large Model Capabilities
by: Xie, Haoyang, et al.
Published: (2025)

Self-supervised Vision Transformer are Scalable Generative Models for Domain Generalization
by: Doerrich, Sebastian, et al.
Published: (2024)

SITUATE: Indoor Human Trajectory Prediction through Geometric Features and Self-Supervised Vision Representation
by: Capogrosso, Luigi, et al.
Published: (2024)