Saved in:
| Main Authors: | Cai, Yuanhao, Zhang, He, Chen, Xi, Xing, Jinbo, Hu, Yiwei, Zhou, Yuqian, Zhang, Kai, Zhang, Zhifei, Kim, Soo Ye, Wang, Tianyu, Zhang, Yulun, Yang, Xiaokang, Lin, Zhe, Yuille, Alan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.23361 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction
by: Cai, Yuanhao, et al.
Published: (2024)
by: Cai, Yuanhao, et al.
Published: (2024)
DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning
by: Wei, Yujie, et al.
Published: (2026)
by: Wei, Yujie, et al.
Published: (2026)
Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis
by: Cai, Yuanhao, et al.
Published: (2024)
by: Cai, Yuanhao, et al.
Published: (2024)
HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting
by: Cai, Yuanhao, et al.
Published: (2024)
by: Cai, Yuanhao, et al.
Published: (2024)
Flow-Matching Guided Deep Unfolding for Hyperspectral Image Reconstruction
by: Ai, Yi, et al.
Published: (2025)
by: Ai, Yi, et al.
Published: (2025)
EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning
by: Ju, Xuan, et al.
Published: (2025)
by: Ju, Xuan, et al.
Published: (2025)
FreeCus: Free Lunch Subject-driven Customization in Diffusion Transformers
by: Zhang, Yanbing, et al.
Published: (2025)
by: Zhang, Yanbing, et al.
Published: (2025)
Asymmetric VAE for One-Step Video Super-Resolution Acceleration
by: Li, Jianze, et al.
Published: (2025)
by: Li, Jianze, et al.
Published: (2025)
DenoiseGS: Gaussian Reconstruction Model for Burst Denoising
by: Cheng, Yongsen, et al.
Published: (2025)
by: Cheng, Yongsen, et al.
Published: (2025)
VDFP: Video Deflickering with Flicker-banding Priors
by: Zhou, Zhiyi, et al.
Published: (2026)
by: Zhou, Zhiyi, et al.
Published: (2026)
OmniCustom: Sync Audio-Video Customization Via Joint Audio-Video Generation Model
by: Li, Maomao, et al.
Published: (2026)
by: Li, Maomao, et al.
Published: (2026)
Omni-Customizer: End-to-End MultiModal Customization for Joint Audio-Video Generation
by: Chen, Yuheng, et al.
Published: (2026)
by: Chen, Yuheng, et al.
Published: (2026)
Tri-Prompting: Video Diffusion with Unified Control over Scene, Subject, and Motion
by: Zhou, Zhenghong, et al.
Published: (2026)
by: Zhou, Zhenghong, et al.
Published: (2026)
OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding
by: Xi, Dianbing, et al.
Published: (2025)
by: Xi, Dianbing, et al.
Published: (2025)
QuantCache: Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation
by: Wu, Junyi, et al.
Published: (2025)
by: Wu, Junyi, et al.
Published: (2025)
X-LRM: X-ray Large Reconstruction Model for Extremely Sparse-View Computed Tomography Recovery in One Second
by: Zhang, Guofeng, et al.
Published: (2025)
by: Zhang, Guofeng, et al.
Published: (2025)
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics
by: Chen, Xi, et al.
Published: (2024)
by: Chen, Xi, et al.
Published: (2024)
TransPixeler: Advancing Text-to-Video Generation with Transparency
by: Wang, Luozhou, et al.
Published: (2025)
by: Wang, Luozhou, et al.
Published: (2025)
Structure-Aware Sparse-View X-ray 3D Reconstruction
by: Cai, Yuanhao, et al.
Published: (2023)
by: Cai, Yuanhao, et al.
Published: (2023)
Generative Video Propagation
by: Liu, Shaoteng, et al.
Published: (2024)
by: Liu, Shaoteng, et al.
Published: (2024)
Efficient Video Diffusion with Sparse Information Transmission for Video Compression
by: Zhou, Mingde, et al.
Published: (2026)
by: Zhou, Mingde, et al.
Published: (2026)
CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects
by: Wang, Zhao, et al.
Published: (2024)
by: Wang, Zhao, et al.
Published: (2024)
Xformer: Hybrid X-Shaped Transformer for Image Denoising
by: Zhang, Jiale, et al.
Published: (2023)
by: Zhang, Jiale, et al.
Published: (2023)
SUGAR: Subject-Driven Video Customization in a Zero-Shot Manner
by: Zhou, Yufan, et al.
Published: (2024)
by: Zhou, Yufan, et al.
Published: (2024)
PRISM: Prior Rectification and Uncertainty-Aware Structure Modeling for Diffusion-Based Text Image Super-Resolution
by: Xu, Zihang, et al.
Published: (2026)
by: Xu, Zihang, et al.
Published: (2026)
FlashEdit: Decoupling Speed, Structure, and Semantics for Precise Image Editing
by: Wu, Junyi, et al.
Published: (2025)
by: Wu, Junyi, et al.
Published: (2025)
Recursive Generalization Transformer for Image Super-Resolution
by: Chen, Zheng, et al.
Published: (2023)
by: Chen, Zheng, et al.
Published: (2023)
“Store Strategy”: A New Omni‐Channel Strategy in Community Group Buying
by: Nana Zhang, et al.
Published: (2024)
by: Nana Zhang, et al.
Published: (2024)
Human Body Restoration with One-Step Diffusion Model and A New Benchmark
by: Gong, Jue, et al.
Published: (2025)
by: Gong, Jue, et al.
Published: (2025)
DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control
by: Wei, Yujie, et al.
Published: (2024)
by: Wei, Yujie, et al.
Published: (2024)
DreamSwapV: Mask-guided Subject Swapping for Any Customized Video Editing
by: Wang, Weitao, et al.
Published: (2025)
by: Wang, Weitao, et al.
Published: (2025)
Thinking with Spatial Code for Physical-World Video Reasoning
by: Chen, Jieneng, et al.
Published: (2026)
by: Chen, Jieneng, et al.
Published: (2026)
Binarized Low-light Raw Video Enhancement
by: Zhang, Gengchen, et al.
Published: (2024)
by: Zhang, Gengchen, et al.
Published: (2024)
Are Pixel-Wise Metrics Reliable for Sparse-View Computed Tomography Reconstruction?
by: Lin, Tianyu, et al.
Published: (2025)
by: Lin, Tianyu, et al.
Published: (2025)
Asymptotic linear stability of columnar vortices driven by Coriolis force
by: Miao, Shuang, et al.
Published: (2026)
by: Miao, Shuang, et al.
Published: (2026)
HBridge: H-Shape Bridging of Heterogeneous Experts for Unified Multimodal Understanding and Generation
by: Wang, Xiang, et al.
Published: (2025)
by: Wang, Xiang, et al.
Published: (2025)
OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces
by: Wang, Zehan, et al.
Published: (2024)
by: Wang, Zehan, et al.
Published: (2024)
OmniSTVG: Toward Spatio-Temporal Omni-Object Video Grounding
by: Yao, Jiali, et al.
Published: (2025)
by: Yao, Jiali, et al.
Published: (2025)
DINeMo: Learning Neural Mesh Models with no 3D Annotations
by: Guo, Weijie, et al.
Published: (2025)
by: Guo, Weijie, et al.
Published: (2025)
Dictionary-based Framework for Interpretable and Consistent Object Parsing
by: Zhang, Tiezheng, et al.
Published: (2025)
by: Zhang, Tiezheng, et al.
Published: (2025)
Similar Items
-
Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction
by: Cai, Yuanhao, et al.
Published: (2024) -
DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning
by: Wei, Yujie, et al.
Published: (2026) -
Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis
by: Cai, Yuanhao, et al.
Published: (2024) -
HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting
by: Cai, Yuanhao, et al.
Published: (2024) -
Flow-Matching Guided Deep Unfolding for Hyperspectral Image Reconstruction
by: Ai, Yi, et al.
Published: (2025)