:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wan, Zhen, Qi, Chenyang, Liu, Zhiheng, Gui, Tao, Ma, Yue
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2412.06340
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Unified Long Video Inpainting and Outpainting via Overlapping High-Order Co-Denoising
by: Lyu, Shuangquan, et al.
Published: (2025)

PrefPaint: Enhancing Medical Image Inpainting through Expert Human Feedback
by: Bui, Duy-Bao, et al.
Published: (2025)

UniVerse-1: Unified Audio-Video Generation via Stitching of Experts
by: Wang, Duomin, et al.
Published: (2025)

MTV-Inpaint: Multi-Task Long Video Inpainting
by: Yang, Shiyuan, et al.
Published: (2025)

Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts
by: Li, Yunxin, et al.
Published: (2024)

UniVideo: Unified Understanding, Generation, and Editing for Videos
by: Wei, Cong, et al.
Published: (2025)

UniRoute: Unified Routing Mixture-of-Experts for Modality-Adaptive Remote Sensing Change Detection
by: Shu, Qingling, et al.
Published: (2026)

Mixture-of-Attack-Experts with Class Regularization for Unified Physical-Digital Face Attack Detection
by: Chen, Shunxin, et al.
Published: (2025)

PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference
by: Liu, Kendong, et al.
Published: (2024)

HarmonPaint: Harmonized Training-Free Diffusion Inpainting
by: Li, Ying, et al.
Published: (2025)

MultiMotion: Multi Subject Video Motion Transfer via Video Diffusion Transformer
by: Liu, Penghui, et al.
Published: (2025)

Uni3D-MoE: Scalable Multimodal 3D Scene Understanding via Mixture of Experts
by: Zhang, Yue, et al.
Published: (2025)

UniEmo: Unifying Emotional Understanding and Generation with Learnable Expert Queries
by: Zhu, Yijie, et al.
Published: (2025)

OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting
by: Yu, Yongsheng, et al.
Published: (2025)

UniMMVSR: A Unified Multi-Modal Framework for Cascaded Video Super-Resolution
by: Du, Shian, et al.
Published: (2025)

UniHOI: Unified Human-Object Interaction Understanding via Unified Token Space
by: Yang, Panqi, et al.
Published: (2025)

Towards Language-Driven Video Inpainting via Multimodal Large Language Models
by: Wu, Jianzong, et al.
Published: (2024)

UniMRSeg: Unified Modality-Relax Segmentation via Hierarchical Self-Supervised Compensation
by: Zhao, Xiaoqi, et al.
Published: (2025)

UniEditBench: A Unified and Cost-Effective Benchmark for Image and Video Editing via Distilled MLLMs
by: Jiang, Lifan, et al.
Published: (2026)

SafePaint: Anti-forensic Image Inpainting with Domain Adaptation
by: Chen, Dunyun, et al.
Published: (2024)

GuidPaint: Class-Guided Image Inpainting with Diffusion Models
by: Wang, Qimin, et al.
Published: (2025)

UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models
by: Chen, Lan, et al.
Published: (2025)

TD-Paint: Faster Diffusion Inpainting Through Time Aware Pixel Conditioning
by: Mayet, Tsiry, et al.
Published: (2024)

UniMoD: Efficient Unified Multimodal Transformers with Mixture-of-Depths
by: Mao, Weijia, et al.
Published: (2025)

UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation
by: Sun, Yang-Tian, et al.
Published: (2025)

AdvPaint: Protecting Images from Inpainting Manipulation via Adversarial Attention Disruption
by: Jeon, Joonsung, et al.
Published: (2025)

Coherent and Multi-modality Image Inpainting via Latent Space Optimization
by: Pan, Lingzhi, et al.
Published: (2024)

UniVBench: Towards Unified Evaluation for Video Foundation Models
by: Wei, Jianhui, et al.
Published: (2026)

ARIN: Adaptive Resampling and Instance Normalization for Robust Blind Inpainting of Dunhuang Cave Paintings
by: Schmidt, Alexander, et al.
Published: (2024)

UniCP: A Unified Caching and Pruning Framework for Efficient Video Generation
by: Sun, Wenzhang, et al.
Published: (2025)

UniVid: The Open-Source Unified Video Model
by: Luo, Jiabin, et al.
Published: (2025)

UniComp: Rethinking Video Compression Through Informational Uniqueness
by: Yuan, Chao, et al.
Published: (2025)

Mixture of Scale Experts for Alignment-free RGBT Video Object Detection and A Unified Benchmark
by: Wang, Qishun, et al.
Published: (2024)

Paint by Inpaint: Learning to Add Image Objects by Removing Them First
by: Wasserman, Navve, et al.
Published: (2024)

Exposing and Defending the Achilles' Heel of Video Mixture-of-Experts
by: Wang, Songping, et al.
Published: (2026)

Unified Multimodal Visual Tracking with Dual Mixture-of-Experts
by: Hong, Lingyi, et al.
Published: (2026)

PaintFlow: A Unified Framework for Interactive Oil Paintings Editing and Generation
by: Hu, Zhangli, et al.
Published: (2025)

MagicStick: Controllable Video Editing via Control Handle Transformations
by: Ma, Yue, et al.
Published: (2023)

Follow-Your-Creation: Empowering 4D Creation through Video Inpainting
by: Ma, Yue, et al.
Published: (2025)

RSUniVLM: A Unified Vision Language Model for Remote Sensing via Granularity-oriented Mixture of Experts
by: Liu, Xu, et al.
Published: (2024)