Saved in:
| Main Authors: | Liu, Zhiheng, Ouyang, Hao, Wang, Qiuyu, Cheng, Ka Leong, Xiao, Jie, Zhu, Kai, Xue, Nan, Liu, Yu, Shen, Yujun, Cao, Yang |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.11613 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DepthLab: From Partial to Complete
by: Liu, Zhiheng, et al.
Published: (2024)
by: Liu, Zhiheng, et al.
Published: (2024)
MagicQuill: An Intelligent Interactive Image Editing System
by: Liu, Zichen, et al.
Published: (2024)
by: Liu, Zichen, et al.
Published: (2024)
AniDoc: Animation Creation Made Easier
by: Meng, Yihao, et al.
Published: (2024)
by: Meng, Yihao, et al.
Published: (2024)
MangaNinja: Line Art Colorization with Precise Reference Following
by: Liu, Zhiheng, et al.
Published: (2025)
by: Liu, Zhiheng, et al.
Published: (2025)
Edicho: Consistent Image Editing in the Wild
by: Bai, Qingyan, et al.
Published: (2024)
by: Bai, Qingyan, et al.
Published: (2024)
LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis
by: Wang, Hanlin, et al.
Published: (2024)
by: Wang, Hanlin, et al.
Published: (2024)
Learning Naturally Aggregated Appearance for Efficient 3D Editing
by: Cheng, Ka Leong, et al.
Published: (2023)
by: Cheng, Ka Leong, et al.
Published: (2023)
VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization
by: Fang, Zixun, et al.
Published: (2025)
by: Fang, Zixun, et al.
Published: (2025)
Calligrapher: Freestyle Text Image Customization
by: Ma, Yue, et al.
Published: (2025)
by: Ma, Yue, et al.
Published: (2025)
Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation
by: Lu, Yunhong, et al.
Published: (2025)
by: Lu, Yunhong, et al.
Published: (2025)
HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives
by: Meng, Yihao, et al.
Published: (2025)
by: Meng, Yihao, et al.
Published: (2025)
CausalCine: Real-Time Autoregressive Generation for Multi-Shot Video Narratives
by: Meng, Yihao, et al.
Published: (2026)
by: Meng, Yihao, et al.
Published: (2026)
Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset
by: Bai, Qingyan, et al.
Published: (2025)
by: Bai, Qingyan, et al.
Published: (2025)
MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues
by: Liu, Zichen, et al.
Published: (2025)
by: Liu, Zichen, et al.
Published: (2025)
The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text
by: Wang, Hanlin, et al.
Published: (2025)
by: Wang, Hanlin, et al.
Published: (2025)
Learning Temporally Consistent Video Depth from Video Diffusion Priors
by: Shao, Jiahao, et al.
Published: (2024)
by: Shao, Jiahao, et al.
Published: (2024)
NeRF Inpainting with Geometric Diffusion Prior and Balanced Score Distillation
by: Zhang, Menglin, et al.
Published: (2024)
by: Zhang, Menglin, et al.
Published: (2024)
P‐5.6: Synergistic Low‐Light Image Enhancement: A Fusion of Dark Channel Dehazing and K‐means Clustering
by: Nan Xue, et al.
Published: (2024)
by: Nan Xue, et al.
Published: (2024)
Rectified Diffusion Guidance for Conditional Generation
by: Xia, Mengfei, et al.
Published: (2024)
by: Xia, Mengfei, et al.
Published: (2024)
Geometric Context Transformer for Streaming 3D Reconstruction
by: Chen, Lin-Zhuo, et al.
Published: (2026)
by: Chen, Lin-Zhuo, et al.
Published: (2026)
RI3D: Few-Shot Gaussian Splatting With Repair and Inpainting Diffusion Priors
by: Paliwal, Avinash, et al.
Published: (2025)
by: Paliwal, Avinash, et al.
Published: (2025)
UCD: Unconditional Discriminator Promotes Nash Equilibrium in GANs
by: Xia, Mengfei, et al.
Published: (2025)
by: Xia, Mengfei, et al.
Published: (2025)
Framer: Interactive Frame Interpolation
by: Wang, Wen, et al.
Published: (2024)
by: Wang, Wen, et al.
Published: (2024)
EraserDiT: Fast Video Inpainting with Diffusion Transformer Model
by: Liu, Jie, et al.
Published: (2025)
by: Liu, Jie, et al.
Published: (2025)
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
by: Ouyang, Hao, et al.
Published: (2023)
by: Ouyang, Hao, et al.
Published: (2023)
Gaussian Belief Propagation Network for Depth Completion
by: Tang, Jie, et al.
Published: (2026)
by: Tang, Jie, et al.
Published: (2026)
Advancing Open-source World Models
by: Robbyant Team, et al.
Published: (2026)
by: Robbyant Team, et al.
Published: (2026)
A Recovery Theory for Diffusion Priors: Deterministic Analysis of the Implicit Prior Algorithm
by: Leong, Oscar, et al.
Published: (2025)
by: Leong, Oscar, et al.
Published: (2025)
InpDiffusion: Image Inpainting Localization via Conditional Diffusion Models
by: Wang, Kai, et al.
Published: (2025)
by: Wang, Kai, et al.
Published: (2025)
Diffusion-Based Depth Inpainting for Transparent and Reflective Objects
by: Sun, Tianyu, et al.
Published: (2024)
by: Sun, Tianyu, et al.
Published: (2024)
Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models
by: Wang, Wen, et al.
Published: (2025)
by: Wang, Wen, et al.
Published: (2025)
Structured Diffusion Models with Mixture of Gaussians as Prior Distribution
by: Jia, Nanshan, et al.
Published: (2024)
by: Jia, Nanshan, et al.
Published: (2024)
RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion
by: Shriram, Jaidev, et al.
Published: (2024)
by: Shriram, Jaidev, et al.
Published: (2024)
Stealing Stable Diffusion Prior for Robust Monocular Depth Estimation
by: Mao, Yifan, et al.
Published: (2024)
by: Mao, Yifan, et al.
Published: (2024)
Seeing through Satellite Images at Street Views
by: Qian, Ming, et al.
Published: (2025)
by: Qian, Ming, et al.
Published: (2025)
3D Gaussian Inpainting with Depth-Guided Cross-View Consistency
by: Huang, Sheng-Yu, et al.
Published: (2025)
by: Huang, Sheng-Yu, et al.
Published: (2025)
ViewPoint: Panoramic Video Generation with Pretrained Diffusion Models
by: Fang, Zixun, et al.
Published: (2025)
by: Fang, Zixun, et al.
Published: (2025)
DISCO: Language-Guided Manipulation with Diffusion Policies and Constrained Inpainting
by: Hao, Ce, et al.
Published: (2024)
by: Hao, Ce, et al.
Published: (2024)
Distilling Textual Priors from LLM to Efficient Image Fusion
by: Zhang, Ran, et al.
Published: (2025)
by: Zhang, Ran, et al.
Published: (2025)
Masked Depth Modeling for Spatial Perception
by: Tan, Bin, et al.
Published: (2026)
by: Tan, Bin, et al.
Published: (2026)
Similar Items
-
DepthLab: From Partial to Complete
by: Liu, Zhiheng, et al.
Published: (2024) -
MagicQuill: An Intelligent Interactive Image Editing System
by: Liu, Zichen, et al.
Published: (2024) -
AniDoc: Animation Creation Made Easier
by: Meng, Yihao, et al.
Published: (2024) -
MangaNinja: Line Art Colorization with Precise Reference Following
by: Liu, Zhiheng, et al.
Published: (2025) -
Edicho: Consistent Image Editing in the Wild
by: Bai, Qingyan, et al.
Published: (2024)