:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Cai, Yuanhao, Zhang, He, Chen, Xi, Xing, Jinbo, Hu, Yiwei, Zhou, Yuqian, Zhang, Kai, Zhang, Zhifei, Kim, Soo Ye, Wang, Tianyu, Zhang, Yulun, Yang, Xiaokang, Lin, Zhe, Yuille, Alan
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2506.23361
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction
by: Cai, Yuanhao, et al.
Published: (2024)

DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning
by: Wei, Yujie, et al.
Published: (2026)

Radiative Gaussian Splatting for Efficient X-ray Novel View Synthesis
by: Cai, Yuanhao, et al.
Published: (2024)

HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting
by: Cai, Yuanhao, et al.
Published: (2024)

Flow-Matching Guided Deep Unfolding for Hyperspectral Image Reconstruction
by: Ai, Yi, et al.
Published: (2025)

EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning
by: Ju, Xuan, et al.
Published: (2025)

FreeCus: Free Lunch Subject-driven Customization in Diffusion Transformers
by: Zhang, Yanbing, et al.
Published: (2025)

Asymmetric VAE for One-Step Video Super-Resolution Acceleration
by: Li, Jianze, et al.
Published: (2025)

DenoiseGS: Gaussian Reconstruction Model for Burst Denoising
by: Cheng, Yongsen, et al.
Published: (2025)

VDFP: Video Deflickering with Flicker-banding Priors
by: Zhou, Zhiyi, et al.
Published: (2026)

OmniCustom: Sync Audio-Video Customization Via Joint Audio-Video Generation Model
by: Li, Maomao, et al.
Published: (2026)

Omni-Customizer: End-to-End MultiModal Customization for Joint Audio-Video Generation
by: Chen, Yuheng, et al.
Published: (2026)

Tri-Prompting: Video Diffusion with Unified Control over Scene, Subject, and Motion
by: Zhou, Zhenghong, et al.
Published: (2026)

OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding
by: Xi, Dianbing, et al.
Published: (2025)

QuantCache: Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation
by: Wu, Junyi, et al.
Published: (2025)

X-LRM: X-ray Large Reconstruction Model for Extremely Sparse-View Computed Tomography Recovery in One Second
by: Zhang, Guofeng, et al.
Published: (2025)

UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics
by: Chen, Xi, et al.
Published: (2024)

TransPixeler: Advancing Text-to-Video Generation with Transparency
by: Wang, Luozhou, et al.
Published: (2025)

Structure-Aware Sparse-View X-ray 3D Reconstruction
by: Cai, Yuanhao, et al.
Published: (2023)

Generative Video Propagation
by: Liu, Shaoteng, et al.
Published: (2024)

Efficient Video Diffusion with Sparse Information Transmission for Video Compression
by: Zhou, Mingde, et al.
Published: (2026)

CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects
by: Wang, Zhao, et al.
Published: (2024)

Xformer: Hybrid X-Shaped Transformer for Image Denoising
by: Zhang, Jiale, et al.
Published: (2023)

SUGAR: Subject-Driven Video Customization in a Zero-Shot Manner
by: Zhou, Yufan, et al.
Published: (2024)

PRISM: Prior Rectification and Uncertainty-Aware Structure Modeling for Diffusion-Based Text Image Super-Resolution
by: Xu, Zihang, et al.
Published: (2026)

FlashEdit: Decoupling Speed, Structure, and Semantics for Precise Image Editing
by: Wu, Junyi, et al.
Published: (2025)

Recursive Generalization Transformer for Image Super-Resolution
by: Chen, Zheng, et al.
Published: (2023)

“Store Strategy”: A New Omni‐Channel Strategy in Community Group Buying
by: Nana Zhang, et al.
Published: (2024)

Human Body Restoration with One-Step Diffusion Model and A New Benchmark
by: Gong, Jue, et al.
Published: (2025)

DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control
by: Wei, Yujie, et al.
Published: (2024)

DreamSwapV: Mask-guided Subject Swapping for Any Customized Video Editing
by: Wang, Weitao, et al.
Published: (2025)

Thinking with Spatial Code for Physical-World Video Reasoning
by: Chen, Jieneng, et al.
Published: (2026)

Binarized Low-light Raw Video Enhancement
by: Zhang, Gengchen, et al.
Published: (2024)

Are Pixel-Wise Metrics Reliable for Sparse-View Computed Tomography Reconstruction?
by: Lin, Tianyu, et al.
Published: (2025)

Asymptotic linear stability of columnar vortices driven by Coriolis force
by: Miao, Shuang, et al.
Published: (2026)

HBridge: H-Shape Bridging of Heterogeneous Experts for Unified Multimodal Understanding and Generation
by: Wang, Xiang, et al.
Published: (2025)

OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces
by: Wang, Zehan, et al.
Published: (2024)

OmniSTVG: Toward Spatio-Temporal Omni-Object Video Grounding
by: Yao, Jiali, et al.
Published: (2025)

DINeMo: Learning Neural Mesh Models with no 3D Annotations
by: Guo, Weijie, et al.
Published: (2025)

Dictionary-based Framework for Interpretable and Consistent Object Parsing
by: Zhang, Tiezheng, et al.
Published: (2025)