Saved in:
| Main Authors: | Wan, Zhen, Qi, Chenyang, Liu, Zhiheng, Gui, Tao, Ma, Yue |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.06340 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Unified Long Video Inpainting and Outpainting via Overlapping High-Order Co-Denoising
by: Lyu, Shuangquan, et al.
Published: (2025)
by: Lyu, Shuangquan, et al.
Published: (2025)
PrefPaint: Enhancing Medical Image Inpainting through Expert Human Feedback
by: Bui, Duy-Bao, et al.
Published: (2025)
by: Bui, Duy-Bao, et al.
Published: (2025)
UniVerse-1: Unified Audio-Video Generation via Stitching of Experts
by: Wang, Duomin, et al.
Published: (2025)
by: Wang, Duomin, et al.
Published: (2025)
MTV-Inpaint: Multi-Task Long Video Inpainting
by: Yang, Shiyuan, et al.
Published: (2025)
by: Yang, Shiyuan, et al.
Published: (2025)
Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts
by: Li, Yunxin, et al.
Published: (2024)
by: Li, Yunxin, et al.
Published: (2024)
UniVideo: Unified Understanding, Generation, and Editing for Videos
by: Wei, Cong, et al.
Published: (2025)
by: Wei, Cong, et al.
Published: (2025)
UniRoute: Unified Routing Mixture-of-Experts for Modality-Adaptive Remote Sensing Change Detection
by: Shu, Qingling, et al.
Published: (2026)
by: Shu, Qingling, et al.
Published: (2026)
Mixture-of-Attack-Experts with Class Regularization for Unified Physical-Digital Face Attack Detection
by: Chen, Shunxin, et al.
Published: (2025)
by: Chen, Shunxin, et al.
Published: (2025)
PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference
by: Liu, Kendong, et al.
Published: (2024)
by: Liu, Kendong, et al.
Published: (2024)
HarmonPaint: Harmonized Training-Free Diffusion Inpainting
by: Li, Ying, et al.
Published: (2025)
by: Li, Ying, et al.
Published: (2025)
MultiMotion: Multi Subject Video Motion Transfer via Video Diffusion Transformer
by: Liu, Penghui, et al.
Published: (2025)
by: Liu, Penghui, et al.
Published: (2025)
Uni3D-MoE: Scalable Multimodal 3D Scene Understanding via Mixture of Experts
by: Zhang, Yue, et al.
Published: (2025)
by: Zhang, Yue, et al.
Published: (2025)
UniEmo: Unifying Emotional Understanding and Generation with Learnable Expert Queries
by: Zhu, Yijie, et al.
Published: (2025)
by: Zhu, Yijie, et al.
Published: (2025)
OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting
by: Yu, Yongsheng, et al.
Published: (2025)
by: Yu, Yongsheng, et al.
Published: (2025)
UniMMVSR: A Unified Multi-Modal Framework for Cascaded Video Super-Resolution
by: Du, Shian, et al.
Published: (2025)
by: Du, Shian, et al.
Published: (2025)
UniHOI: Unified Human-Object Interaction Understanding via Unified Token Space
by: Yang, Panqi, et al.
Published: (2025)
by: Yang, Panqi, et al.
Published: (2025)
Towards Language-Driven Video Inpainting via Multimodal Large Language Models
by: Wu, Jianzong, et al.
Published: (2024)
by: Wu, Jianzong, et al.
Published: (2024)
UniMRSeg: Unified Modality-Relax Segmentation via Hierarchical Self-Supervised Compensation
by: Zhao, Xiaoqi, et al.
Published: (2025)
by: Zhao, Xiaoqi, et al.
Published: (2025)
UniEditBench: A Unified and Cost-Effective Benchmark for Image and Video Editing via Distilled MLLMs
by: Jiang, Lifan, et al.
Published: (2026)
by: Jiang, Lifan, et al.
Published: (2026)
SafePaint: Anti-forensic Image Inpainting with Domain Adaptation
by: Chen, Dunyun, et al.
Published: (2024)
by: Chen, Dunyun, et al.
Published: (2024)
GuidPaint: Class-Guided Image Inpainting with Diffusion Models
by: Wang, Qimin, et al.
Published: (2025)
by: Wang, Qimin, et al.
Published: (2025)
UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models
by: Chen, Lan, et al.
Published: (2025)
by: Chen, Lan, et al.
Published: (2025)
TD-Paint: Faster Diffusion Inpainting Through Time Aware Pixel Conditioning
by: Mayet, Tsiry, et al.
Published: (2024)
by: Mayet, Tsiry, et al.
Published: (2024)
UniMoD: Efficient Unified Multimodal Transformers with Mixture-of-Depths
by: Mao, Weijia, et al.
Published: (2025)
by: Mao, Weijia, et al.
Published: (2025)
UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation
by: Sun, Yang-Tian, et al.
Published: (2025)
by: Sun, Yang-Tian, et al.
Published: (2025)
AdvPaint: Protecting Images from Inpainting Manipulation via Adversarial Attention Disruption
by: Jeon, Joonsung, et al.
Published: (2025)
by: Jeon, Joonsung, et al.
Published: (2025)
Coherent and Multi-modality Image Inpainting via Latent Space Optimization
by: Pan, Lingzhi, et al.
Published: (2024)
by: Pan, Lingzhi, et al.
Published: (2024)
UniVBench: Towards Unified Evaluation for Video Foundation Models
by: Wei, Jianhui, et al.
Published: (2026)
by: Wei, Jianhui, et al.
Published: (2026)
ARIN: Adaptive Resampling and Instance Normalization for Robust Blind Inpainting of Dunhuang Cave Paintings
by: Schmidt, Alexander, et al.
Published: (2024)
by: Schmidt, Alexander, et al.
Published: (2024)
UniCP: A Unified Caching and Pruning Framework for Efficient Video Generation
by: Sun, Wenzhang, et al.
Published: (2025)
by: Sun, Wenzhang, et al.
Published: (2025)
UniVid: The Open-Source Unified Video Model
by: Luo, Jiabin, et al.
Published: (2025)
by: Luo, Jiabin, et al.
Published: (2025)
UniComp: Rethinking Video Compression Through Informational Uniqueness
by: Yuan, Chao, et al.
Published: (2025)
by: Yuan, Chao, et al.
Published: (2025)
Mixture of Scale Experts for Alignment-free RGBT Video Object Detection and A Unified Benchmark
by: Wang, Qishun, et al.
Published: (2024)
by: Wang, Qishun, et al.
Published: (2024)
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
by: Wasserman, Navve, et al.
Published: (2024)
by: Wasserman, Navve, et al.
Published: (2024)
Exposing and Defending the Achilles' Heel of Video Mixture-of-Experts
by: Wang, Songping, et al.
Published: (2026)
by: Wang, Songping, et al.
Published: (2026)
Unified Multimodal Visual Tracking with Dual Mixture-of-Experts
by: Hong, Lingyi, et al.
Published: (2026)
by: Hong, Lingyi, et al.
Published: (2026)
PaintFlow: A Unified Framework for Interactive Oil Paintings Editing and Generation
by: Hu, Zhangli, et al.
Published: (2025)
by: Hu, Zhangli, et al.
Published: (2025)
MagicStick: Controllable Video Editing via Control Handle Transformations
by: Ma, Yue, et al.
Published: (2023)
by: Ma, Yue, et al.
Published: (2023)
Follow-Your-Creation: Empowering 4D Creation through Video Inpainting
by: Ma, Yue, et al.
Published: (2025)
by: Ma, Yue, et al.
Published: (2025)
RSUniVLM: A Unified Vision Language Model for Remote Sensing via Granularity-oriented Mixture of Experts
by: Liu, Xu, et al.
Published: (2024)
by: Liu, Xu, et al.
Published: (2024)
Similar Items
-
Unified Long Video Inpainting and Outpainting via Overlapping High-Order Co-Denoising
by: Lyu, Shuangquan, et al.
Published: (2025) -
PrefPaint: Enhancing Medical Image Inpainting through Expert Human Feedback
by: Bui, Duy-Bao, et al.
Published: (2025) -
UniVerse-1: Unified Audio-Video Generation via Stitching of Experts
by: Wang, Duomin, et al.
Published: (2025) -
MTV-Inpaint: Multi-Task Long Video Inpainting
by: Yang, Shiyuan, et al.
Published: (2025) -
Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of Experts
by: Li, Yunxin, et al.
Published: (2024)