Saved in:
| Main Authors: | Ren, Peng, Yang, Hai |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.15392 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Stylized Text-to-Motion Generation via Hypernetwork-Driven Low-Rank Adaptation
by: Jeon, Junhyuk, et al.
Published: (2026)
by: Jeon, Junhyuk, et al.
Published: (2026)
REED-VAE: RE-Encode Decode Training for Iterative Image Editing with Diffusion Models
by: Almog, Gal, et al.
Published: (2025)
by: Almog, Gal, et al.
Published: (2025)
FissionVAE: Federated Non-IID Image Generation with Latent Space and Decoder Decomposition
by: Hu, Chen, et al.
Published: (2024)
by: Hu, Chen, et al.
Published: (2024)
Spatio-Temporal Branching for Motion Prediction using Motion Increments
by: Wang, Jiexin, et al.
Published: (2023)
by: Wang, Jiexin, et al.
Published: (2023)
Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding
by: Tang, Feilong, et al.
Published: (2025)
by: Tang, Feilong, et al.
Published: (2025)
Closed-Loop Unsupervised Representation Disentanglement with $β$-VAE Distillation and Diffusion Probabilistic Feedback
by: Jin, Xin, et al.
Published: (2024)
by: Jin, Xin, et al.
Published: (2024)
Balanced Image Stylization with Style Matching Score
by: Jiang, Yuxin, et al.
Published: (2025)
by: Jiang, Yuxin, et al.
Published: (2025)
LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models
by: Sadat, Seyedmorteza, et al.
Published: (2024)
by: Sadat, Seyedmorteza, et al.
Published: (2024)
StreamDiffusionV2: A Streaming System for Dynamic and Interactive Video Generation
by: Feng, Tianrui, et al.
Published: (2025)
by: Feng, Tianrui, et al.
Published: (2025)
A Converting Autoencoder Toward Low-latency and Energy-efficient DNN Inference at the Edge
by: Mahmud, Hasanul, et al.
Published: (2024)
by: Mahmud, Hasanul, et al.
Published: (2024)
REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers
by: Leng, Xingjian, et al.
Published: (2025)
by: Leng, Xingjian, et al.
Published: (2025)
MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE
by: Zhu, Ruijie, et al.
Published: (2026)
by: Zhu, Ruijie, et al.
Published: (2026)
SVGFusion: A VAE-Diffusion Transformer for Vector Graphic Generation
by: Xing, Ximing, et al.
Published: (2024)
by: Xing, Ximing, et al.
Published: (2024)
SepVAE: a contrastive VAE to separate pathological patterns from healthy ones
by: Louiset, Robin, et al.
Published: (2023)
by: Louiset, Robin, et al.
Published: (2023)
Elucidating the Design Space of Arbitrary-Noise-Based Diffusion Models
by: Qiu, Xingyu, et al.
Published: (2025)
by: Qiu, Xingyu, et al.
Published: (2025)
Remembering by Reconstructing: Domain Incremental Learning With Test-Time Training on Video Streams
by: Swinnen, Jonathan, et al.
Published: (2026)
by: Swinnen, Jonathan, et al.
Published: (2026)
VFM-VAE: Vision Foundation Models Can Be Good Tokenizers for Latent Diffusion Models
by: Bi, Tianci, et al.
Published: (2025)
by: Bi, Tianci, et al.
Published: (2025)
Streaming-dLLM: Accelerating Diffusion LLMs via Suffix Pruning and Dynamic Decoding
by: Xiao, Zhongyu, et al.
Published: (2026)
by: Xiao, Zhongyu, et al.
Published: (2026)
MotionStream: Real-Time Video Generation with Interactive Motion Controls
by: Shin, Joonghyuk, et al.
Published: (2025)
by: Shin, Joonghyuk, et al.
Published: (2025)
Large VLM-based Stylized Sports Captioning
by: Dhar, Sauptik, et al.
Published: (2025)
by: Dhar, Sauptik, et al.
Published: (2025)
StyDeSty: Min-Max Stylization and Destylization for Single Domain Generalization
by: Liu, Songhua, et al.
Published: (2024)
by: Liu, Songhua, et al.
Published: (2024)
Stylized Synthetic Augmentation further improves Corruption Robustness
by: Siedel, Georg, et al.
Published: (2025)
by: Siedel, Georg, et al.
Published: (2025)
CASHG: Context-Aware Stylized Online Handwriting Generation
by: Shin, Jinsu, et al.
Published: (2026)
by: Shin, Jinsu, et al.
Published: (2026)
DS-AL: A Dual-Stream Analytic Learning for Exemplar-Free Class-Incremental Learning
by: Zhuang, Huiping, et al.
Published: (2024)
by: Zhuang, Huiping, et al.
Published: (2024)
Diffusion Models Learn Low-Dimensional Distributions via Subspace Clustering
by: Wang, Peng, et al.
Published: (2024)
by: Wang, Peng, et al.
Published: (2024)
Adaptive Adapter Routing for Long-Tailed Class-Incremental Learning
by: Qi, Zhi-Hong, et al.
Published: (2024)
by: Qi, Zhi-Hong, et al.
Published: (2024)
CoMusion: Towards Consistent Stochastic Human Motion Prediction via Motion Diffusion
by: Sun, Jiarui, et al.
Published: (2023)
by: Sun, Jiarui, et al.
Published: (2023)
Memory Storyboard: Leveraging Temporal Segmentation for Streaming Self-Supervised Learning from Egocentric Videos
by: Yang, Yanlai, et al.
Published: (2025)
by: Yang, Yanlai, et al.
Published: (2025)
Understanding Representation Dynamics of Diffusion Models via Low-Dimensional Modeling
by: Li, Xiao, et al.
Published: (2025)
by: Li, Xiao, et al.
Published: (2025)
Quantize-then-Rectify: Efficient VQ-VAE Training
by: Zhang, Borui, et al.
Published: (2025)
by: Zhang, Borui, et al.
Published: (2025)
Bridging the Simulation-to-Reality Gap in Electron Microscope Calibration via VAE-EM Estimation
by: van Hulst, Jilles S., et al.
Published: (2026)
by: van Hulst, Jilles S., et al.
Published: (2026)
Multimodal ELBO with Diffusion Decoders
by: Wesego, Daniel, et al.
Published: (2024)
by: Wesego, Daniel, et al.
Published: (2024)
Flexible Motion In-betweening with Diffusion Models
by: Cohan, Setareh, et al.
Published: (2024)
by: Cohan, Setareh, et al.
Published: (2024)
MotionMatcher: Motion Customization of Text-to-Video Diffusion Models via Motion Feature Matching
by: Wu, Yen-Siang, et al.
Published: (2025)
by: Wu, Yen-Siang, et al.
Published: (2025)
SR$^2$-LoRA: Self-Rectifying Inter-layer Relations in Low-Rank Adaptation for Class-Incremental Learning
by: Wan, Fengqiang, et al.
Published: (2026)
by: Wan, Fengqiang, et al.
Published: (2026)
C$^{2}$INet: Realizing Incremental Trajectory Prediction with Prior-Aware Continual Causal Intervention
by: Li, Xiaohe, et al.
Published: (2024)
by: Li, Xiaohe, et al.
Published: (2024)
Expandable Subspace Ensemble for Pre-Trained Model-Based Class-Incremental Learning
by: Zhou, Da-Wei, et al.
Published: (2024)
by: Zhou, Da-Wei, et al.
Published: (2024)
RefDecoder: Enhancing Visual Generation with Conditional Video Decoding
by: Fan, Xiang, et al.
Published: (2026)
by: Fan, Xiang, et al.
Published: (2026)
InsTex: Indoor Scenes Stylized Texture Synthesis
by: Zhang, Yunfan, et al.
Published: (2025)
by: Zhang, Yunfan, et al.
Published: (2025)
StyleMotif: Multi-Modal Motion Stylization using Style-Content Cross Fusion
by: Guo, Ziyu, et al.
Published: (2025)
by: Guo, Ziyu, et al.
Published: (2025)
Similar Items
-
Stylized Text-to-Motion Generation via Hypernetwork-Driven Low-Rank Adaptation
by: Jeon, Junhyuk, et al.
Published: (2026) -
REED-VAE: RE-Encode Decode Training for Iterative Image Editing with Diffusion Models
by: Almog, Gal, et al.
Published: (2025) -
FissionVAE: Federated Non-IID Image Generation with Latent Space and Decoder Decomposition
by: Hu, Chen, et al.
Published: (2024) -
Spatio-Temporal Branching for Motion Prediction using Motion Increments
by: Wang, Jiexin, et al.
Published: (2023) -
Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding
by: Tang, Feilong, et al.
Published: (2025)