:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	He, Xu, Zhang, Haoxian, Chen, Hejia, Zheng, Changyuan, Chen, Liyang, Tang, Songlin, Huang, Jiehui, Liu, Xiaoqiang, Wan, Pengfei, Wu, Zhiyong
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2512.25066
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

StableDub: Taming Diffusion Prior for Generalized and Efficient Visual Dubbing
by: Chen, Liyang, et al.
Published: (2025)

MIDAS: Multimodal Interactive Digital-humAn Synthesis via Real-time Autoregressive Video Generation
by: Chen, Ming, et al.
Published: (2025)

Semantic-Aware Prefix Learning for Token-Efficient Image Generation
by: Li, Qingfeng, et al.
Published: (2026)

3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation
by: Fang, Zhixue, et al.
Published: (2026)

Cafe-Talk: Generating 3D Talking Face Animation with Multimodal Coarse- and Fine-grained Control
by: Chen, Hejia, et al.
Published: (2025)

DiffDub: Person-generic Visual Dubbing Using Inpainting Renderer with Diffusion Auto-encoder
by: Liu, Tao, et al.
Published: (2023)

OmniSync: Towards Universal Lip Synchronization via Diffusion Transformers
by: Peng, Ziqiao, et al.
Published: (2025)

Video Editing for Audio-Visual Dubbing
by: Manela, Binyamin, et al.
Published: (2025)

MTV-Inpaint: Multi-Task Long Video Inpainting
by: Yang, Shiyuan, et al.
Published: (2025)

VisualChef: Generating Visual Aids in Cooking via Mask Inpainting
by: Kuzyk, Oleh, et al.
Published: (2025)

Robust 3D Brain MRI Inpainting with Random Masking Augmentation
by: Zhang, Juexin, et al.
Published: (2025)

STSA: Spatial-Temporal Semantic Alignment for Visual Dubbing
by: Ding, Zijun, et al.
Published: (2025)

FreeInpaint: Tuning-free Prompt Alignment and Visual Rationality Enhancement in Image Inpainting
by: Gong, Chao, et al.
Published: (2025)

Dubbing for Everyone: Data-Efficient Visual Dubbing using Neural Rendering Priors
by: Saunders, Jack, et al.
Published: (2024)

GGTalker: Talking Head Systhesis with Generalizable Gaussian Priors and Identity-Specific Adaptation
by: Hu, Wentao, et al.
Published: (2025)

From Inpainting to Layer Decomposition: Repurposing Generative Inpainting Models for Image Layer Decomposition
by: Chen, Jingxi, et al.
Published: (2025)

BVINet: Unlocking Blind Video Inpainting with Zero Annotations
by: Wu, Zhiliang, et al.
Published: (2025)

HoSNN: Adversarially-Robust Homeostatic Spiking Neural Networks with Adaptive Firing Thresholds
by: Geng, Hejia, et al.
Published: (2023)

MEMLA: Enhancing Multilingual Knowledge Editing with Neuron-Masked Low-Rank Adaptation
by: Xie, Jiakuan, et al.
Published: (2024)

On the Robustness of Knowledge Editing for Detoxification
by: Dong, Ming, et al.
Published: (2026)

Token Painter: Training-Free Text-Guided Image Inpainting via Mask Autoregressive Models
by: Jiang, Longtao, et al.
Published: (2025)

JUST-DUB-IT: Video Dubbing via Joint Audio-Visual Diffusion
by: Chen, Anthony, et al.
Published: (2026)

Comment on ‘When Nurses Leave: A Critical Incident Study of Turnover Intentions’
by: Zilin Zhao, et al.
Published: (2025)

Multimorbidity patterns and disability risk in aging populations: Insights from machine learning
by: Zilin Zhao, et al.
Published: (2025)

Deconstructing the Burden of History: Gender Internalisation and the Epistemology of Illegitimate Tasks in Nursing Practice
by: Zilin Zhao, et al.
Published: (2025)

The Application of Digital Life Stories in Elderly Care: Methodological Limitations and Future Directions
by: Zilin Zhao, et al.
Published: (2025)

MaskedMimic: Unified Physics-Based Character Control Through Masked Motion Inpainting
by: Tessler, Chen, et al.
Published: (2024)

IM-Animation: An Implicit Motion Representation for Identity-decoupled Character Animation
by: Xu, Zhufeng, et al.
Published: (2026)

Pseudo-Bayesian Optimization
by: Chen, Haoxian, et al.
Published: (2023)

Robustness is Important: Limitations of LLMs for Data Fitting
by: Liu, Hejia, et al.
Published: (2025)

Kling-MotionControl Technical Report
by: Kling Team, et al.
Published: (2026)

VINO: A Unified Visual Generator with Interleaved OmniModal Context
by: Chen, Junyi, et al.
Published: (2026)

DualDub: Video-to-Soundtrack Generation via Joint Speech and Background Audio Synthesis
by: Tian, Wenjie, et al.
Published: (2025)

From Covert Hiding to Visual Editing: Robust Generative Video Steganography
by: Mao, Xueying, et al.
Published: (2024)

Dub-S2ST: Textless Speech-to-Speech Translation for Seamless Dubbing
by: Choi, Jeongsoo, et al.
Published: (2025)

MRT: Masked Region Transformer for Layered Image Generation and Editing at Scale
by: Tang, Zhicong, et al.
Published: (2026)

Enhancing Expressiveness in Dance Generation via Integrating Frequency and Music Style Information
by: Huang, Qiaochu, et al.
Published: (2024)

Edit Where You Mean: Region-Aware Adapter Injection for Mask-Free Local Image Editing
by: Cai, Honghao, et al.
Published: (2026)

VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning
by: Li, Baolu, et al.
Published: (2025)

PainterNet: Adaptive Image Inpainting with Actual-Token Attention and Diverse Mask Control
by: Wang, Ruichen, et al.
Published: (2024)