:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ren, Peng, Yang, Hai
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Machine Learning
Online Access:	https://arxiv.org/abs/2510.15392
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Stylized Text-to-Motion Generation via Hypernetwork-Driven Low-Rank Adaptation
by: Jeon, Junhyuk, et al.
Published: (2026)

REED-VAE: RE-Encode Decode Training for Iterative Image Editing with Diffusion Models
by: Almog, Gal, et al.
Published: (2025)

FissionVAE: Federated Non-IID Image Generation with Latent Space and Decoder Decomposition
by: Hu, Chen, et al.
Published: (2024)

Spatio-Temporal Branching for Motion Prediction using Motion Increments
by: Wang, Jiexin, et al.
Published: (2023)

Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding
by: Tang, Feilong, et al.
Published: (2025)

Closed-Loop Unsupervised Representation Disentanglement with $β$-VAE Distillation and Diffusion Probabilistic Feedback
by: Jin, Xin, et al.
Published: (2024)

Balanced Image Stylization with Style Matching Score
by: Jiang, Yuxin, et al.
Published: (2025)

LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models
by: Sadat, Seyedmorteza, et al.
Published: (2024)

StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation
by: Feng, Tianrui, et al.
Published: (2025)

A Converting Autoencoder Toward Low-latency and Energy-efficient DNN Inference at the Edge
by: Mahmud, Hasanul, et al.
Published: (2024)

REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers
by: Leng, Xingjian, et al.
Published: (2025)

MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE
by: Zhu, Ruijie, et al.
Published: (2026)

SVGFusion: A VAE-Diffusion Transformer for Vector Graphic Generation
by: Xing, Ximing, et al.
Published: (2024)

SepVAE: a contrastive VAE to separate pathological patterns from healthy ones
by: Louiset, Robin, et al.
Published: (2023)

Elucidating the Design Space of Arbitrary-Noise-Based Diffusion Models
by: Qiu, Xingyu, et al.
Published: (2025)

Remembering by Reconstructing: Domain Incremental Learning With Test-Time Training on Video Streams
by: Swinnen, Jonathan, et al.
Published: (2026)

VFM-VAE: Vision Foundation Models Can Be Good Tokenizers for Latent Diffusion Models
by: Bi, Tianci, et al.
Published: (2025)

Streaming-dLLM: Accelerating Diffusion LLMs via Suffix Pruning and Dynamic Decoding
by: Xiao, Zhongyu, et al.
Published: (2026)

MotionStream: Real-Time Video Generation with Interactive Motion Controls
by: Shin, Joonghyuk, et al.
Published: (2025)

Large VLM-based Stylized Sports Captioning
by: Dhar, Sauptik, et al.
Published: (2025)

StyDeSty: Min-Max Stylization and Destylization for Single Domain Generalization
by: Liu, Songhua, et al.
Published: (2024)

Stylized Synthetic Augmentation further improves Corruption Robustness
by: Siedel, Georg, et al.
Published: (2025)

CASHG: Context-Aware Stylized Online Handwriting Generation
by: Shin, Jinsu, et al.
Published: (2026)

DS-AL: A Dual-Stream Analytic Learning for Exemplar-Free Class-Incremental Learning
by: Zhuang, Huiping, et al.
Published: (2024)

Diffusion Models Learn Low-Dimensional Distributions via Subspace Clustering
by: Wang, Peng, et al.
Published: (2024)

Adaptive Adapter Routing for Long-Tailed Class-Incremental Learning
by: Qi, Zhi-Hong, et al.
Published: (2024)

CoMusion: Towards Consistent Stochastic Human Motion Prediction via Motion Diffusion
by: Sun, Jiarui, et al.
Published: (2023)

Memory Storyboard: Leveraging Temporal Segmentation for Streaming Self-Supervised Learning from Egocentric Videos
by: Yang, Yanlai, et al.
Published: (2025)

Understanding Representation Dynamics of Diffusion Models via Low-Dimensional Modeling
by: Li, Xiao, et al.
Published: (2025)

Quantize-then-Rectify: Efficient VQ-VAE Training
by: Zhang, Borui, et al.
Published: (2025)

Bridging the Simulation-to-Reality Gap in Electron Microscope Calibration via VAE-EM Estimation
by: van Hulst, Jilles S., et al.
Published: (2026)

Multimodal ELBO with Diffusion Decoders
by: Wesego, Daniel, et al.
Published: (2024)

Flexible Motion In-betweening with Diffusion Models
by: Cohan, Setareh, et al.
Published: (2024)

MotionMatcher: Motion Customization of Text-to-Video Diffusion Models via Motion Feature Matching
by: Wu, Yen-Siang, et al.
Published: (2025)

SR$^2$-LoRA: Self-Rectifying Inter-layer Relations in Low-Rank Adaptation for Class-Incremental Learning
by: Wan, Fengqiang, et al.
Published: (2026)

C$^{2}$INet: Realizing Incremental Trajectory Prediction with Prior-Aware Continual Causal Intervention
by: Li, Xiaohe, et al.
Published: (2024)

Expandable Subspace Ensemble for Pre-Trained Model-Based Class-Incremental Learning
by: Zhou, Da-Wei, et al.
Published: (2024)

RefDecoder: Enhancing Visual Generation with Conditional Video Decoding
by: Fan, Xiang, et al.
Published: (2026)

InsTex: Indoor Scenes Stylized Texture Synthesis
by: Zhang, Yunfan, et al.
Published: (2025)

StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion
by: Guo, Ziyu, et al.
Published: (2025)