:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Liu, Zhiheng, Ouyang, Hao, Wang, Qiuyu, Cheng, Ka Leong, Xiao, Jie, Zhu, Kai, Xue, Nan, Liu, Yu, Shen, Yujun, Cao, Yang
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2404.11613
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

DepthLab: From Partial to Complete
by: Liu, Zhiheng, et al.
Published: (2024)

MagicQuill: An Intelligent Interactive Image Editing System
by: Liu, Zichen, et al.
Published: (2024)

AniDoc: Animation Creation Made Easier
by: Meng, Yihao, et al.
Published: (2024)

MangaNinja: Line Art Colorization with Precise Reference Following
by: Liu, Zhiheng, et al.
Published: (2025)

Edicho: Consistent Image Editing in the Wild
by: Bai, Qingyan, et al.
Published: (2024)

LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis
by: Wang, Hanlin, et al.
Published: (2024)

Learning Naturally Aggregated Appearance for Efficient 3D Editing
by: Cheng, Ka Leong, et al.
Published: (2023)

VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization
by: Fang, Zixun, et al.
Published: (2025)

Calligrapher: Freestyle Text Image Customization
by: Ma, Yue, et al.
Published: (2025)

Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation
by: Lu, Yunhong, et al.
Published: (2025)

HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives
by: Meng, Yihao, et al.
Published: (2025)

CausalCine: Real-Time Autoregressive Generation for Multi-Shot Video Narratives
by: Meng, Yihao, et al.
Published: (2026)

Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset
by: Bai, Qingyan, et al.
Published: (2025)

MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues
by: Liu, Zichen, et al.
Published: (2025)

The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text
by: Wang, Hanlin, et al.
Published: (2025)

Learning Temporally Consistent Video Depth from Video Diffusion Priors
by: Shao, Jiahao, et al.
Published: (2024)

NeRF Inpainting with Geometric Diffusion Prior and Balanced Score Distillation
by: Zhang, Menglin, et al.
Published: (2024)

P‐5.6: Synergistic Low‐Light Image Enhancement: A Fusion of Dark Channel Dehazing and K‐means Clustering
by: Nan Xue, et al.
Published: (2024)

Rectified Diffusion Guidance for Conditional Generation
by: Xia, Mengfei, et al.
Published: (2024)

Geometric Context Transformer for Streaming 3D Reconstruction
by: Chen, Lin-Zhuo, et al.
Published: (2026)

RI3D: Few-Shot Gaussian Splatting With Repair and Inpainting Diffusion Priors
by: Paliwal, Avinash, et al.
Published: (2025)

UCD: Unconditional Discriminator Promotes Nash Equilibrium in GANs
by: Xia, Mengfei, et al.
Published: (2025)

Framer: Interactive Frame Interpolation
by: Wang, Wen, et al.
Published: (2024)

EraserDiT: Fast Video Inpainting with Diffusion Transformer Model
by: Liu, Jie, et al.
Published: (2025)

CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
by: Ouyang, Hao, et al.
Published: (2023)

Gaussian Belief Propagation Network for Depth Completion
by: Tang, Jie, et al.
Published: (2026)

Advancing Open-source World Models
by: Robbyant Team, et al.
Published: (2026)

A Recovery Theory for Diffusion Priors: Deterministic Analysis of the Implicit Prior Algorithm
by: Leong, Oscar, et al.
Published: (2025)

InpDiffusion: Image Inpainting Localization via Conditional Diffusion Models
by: Wang, Kai, et al.
Published: (2025)

Diffusion-Based Depth Inpainting for Transparent and Reflective Objects
by: Sun, Tianyu, et al.
Published: (2024)

Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models
by: Wang, Wen, et al.
Published: (2025)

Structured Diffusion Models with Mixture of Gaussians as Prior Distribution
by: Jia, Nanshan, et al.
Published: (2024)

RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion
by: Shriram, Jaidev, et al.
Published: (2024)

Stealing Stable Diffusion Prior for Robust Monocular Depth Estimation
by: Mao, Yifan, et al.
Published: (2024)

Seeing through Satellite Images at Street Views
by: Qian, Ming, et al.
Published: (2025)

3D Gaussian Inpainting with Depth-Guided Cross-View Consistency
by: Huang, Sheng-Yu, et al.
Published: (2025)

ViewPoint: Panoramic Video Generation with Pretrained Diffusion Models
by: Fang, Zixun, et al.
Published: (2025)

DISCO: Language-Guided Manipulation with Diffusion Policies and Constrained Inpainting
by: Hao, Ce, et al.
Published: (2024)

Distilling Textual Priors from LLM to Efficient Image Fusion
by: Zhang, Ran, et al.
Published: (2025)

Masked Depth Modeling for Spatial Perception
by: Tan, Bin, et al.
Published: (2026)