Saved in:
| Main Authors: | He, Xu, Zhang, Haoxian, Chen, Hejia, Zheng, Changyuan, Chen, Liyang, Tang, Songlin, Huang, Jiehui, Liu, Xiaoqiang, Wan, Pengfei, Wu, Zhiyong |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.25066 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
StableDub: Taming Diffusion Prior for Generalized and Efficient Visual Dubbing
by: Chen, Liyang, et al.
Published: (2025)
by: Chen, Liyang, et al.
Published: (2025)
MIDAS: Multimodal Interactive Digital-humAn Synthesis via Real-time Autoregressive Video Generation
by: Chen, Ming, et al.
Published: (2025)
by: Chen, Ming, et al.
Published: (2025)
Semantic-Aware Prefix Learning for Token-Efficient Image Generation
by: Li, Qingfeng, et al.
Published: (2026)
by: Li, Qingfeng, et al.
Published: (2026)
3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation
by: Fang, Zhixue, et al.
Published: (2026)
by: Fang, Zhixue, et al.
Published: (2026)
Cafe-Talk: Generating 3D Talking Face Animation with Multimodal Coarse- and Fine-grained Control
by: Chen, Hejia, et al.
Published: (2025)
by: Chen, Hejia, et al.
Published: (2025)
DiffDub: Person-generic Visual Dubbing Using Inpainting Renderer with Diffusion Auto-encoder
by: Liu, Tao, et al.
Published: (2023)
by: Liu, Tao, et al.
Published: (2023)
OmniSync: Towards Universal Lip Synchronization via Diffusion Transformers
by: Peng, Ziqiao, et al.
Published: (2025)
by: Peng, Ziqiao, et al.
Published: (2025)
Video Editing for Audio-Visual Dubbing
by: Manela, Binyamin, et al.
Published: (2025)
by: Manela, Binyamin, et al.
Published: (2025)
MTV-Inpaint: Multi-Task Long Video Inpainting
by: Yang, Shiyuan, et al.
Published: (2025)
by: Yang, Shiyuan, et al.
Published: (2025)
VisualChef: Generating Visual Aids in Cooking via Mask Inpainting
by: Kuzyk, Oleh, et al.
Published: (2025)
by: Kuzyk, Oleh, et al.
Published: (2025)
Robust 3D Brain MRI Inpainting with Random Masking Augmentation
by: Zhang, Juexin, et al.
Published: (2025)
by: Zhang, Juexin, et al.
Published: (2025)
STSA: Spatial-Temporal Semantic Alignment for Visual Dubbing
by: Ding, Zijun, et al.
Published: (2025)
by: Ding, Zijun, et al.
Published: (2025)
FreeInpaint: Tuning-free Prompt Alignment and Visual Rationality Enhancement in Image Inpainting
by: Gong, Chao, et al.
Published: (2025)
by: Gong, Chao, et al.
Published: (2025)
Dubbing for Everyone: Data-Efficient Visual Dubbing using Neural Rendering Priors
by: Saunders, Jack, et al.
Published: (2024)
by: Saunders, Jack, et al.
Published: (2024)
GGTalker: Talking Head Systhesis with Generalizable Gaussian Priors and Identity-Specific Adaptation
by: Hu, Wentao, et al.
Published: (2025)
by: Hu, Wentao, et al.
Published: (2025)
From Inpainting to Layer Decomposition: Repurposing Generative Inpainting Models for Image Layer Decomposition
by: Chen, Jingxi, et al.
Published: (2025)
by: Chen, Jingxi, et al.
Published: (2025)
BVINet: Unlocking Blind Video Inpainting with Zero Annotations
by: Wu, Zhiliang, et al.
Published: (2025)
by: Wu, Zhiliang, et al.
Published: (2025)
HoSNN: Adversarially-Robust Homeostatic Spiking Neural Networks with Adaptive Firing Thresholds
by: Geng, Hejia, et al.
Published: (2023)
by: Geng, Hejia, et al.
Published: (2023)
MEMLA: Enhancing Multilingual Knowledge Editing with Neuron-Masked Low-Rank Adaptation
by: Xie, Jiakuan, et al.
Published: (2024)
by: Xie, Jiakuan, et al.
Published: (2024)
On the Robustness of Knowledge Editing for Detoxification
by: Dong, Ming, et al.
Published: (2026)
by: Dong, Ming, et al.
Published: (2026)
Token Painter: Training-Free Text-Guided Image Inpainting via Mask Autoregressive Models
by: Jiang, Longtao, et al.
Published: (2025)
by: Jiang, Longtao, et al.
Published: (2025)
JUST-DUB-IT: Video Dubbing via Joint Audio-Visual Diffusion
by: Chen, Anthony, et al.
Published: (2026)
by: Chen, Anthony, et al.
Published: (2026)
Comment on ‘When Nurses Leave: A Critical Incident Study of Turnover Intentions’
by: Zilin Zhao, et al.
Published: (2025)
by: Zilin Zhao, et al.
Published: (2025)
Multimorbidity patterns and disability risk in aging populations: Insights from machine learning
by: Zilin Zhao, et al.
Published: (2025)
by: Zilin Zhao, et al.
Published: (2025)
Deconstructing the Burden of History: Gender Internalisation and the Epistemology of Illegitimate Tasks in Nursing Practice
by: Zilin Zhao, et al.
Published: (2025)
by: Zilin Zhao, et al.
Published: (2025)
The Application of Digital Life Stories in Elderly Care: Methodological Limitations and Future Directions
by: Zilin Zhao, et al.
Published: (2025)
by: Zilin Zhao, et al.
Published: (2025)
MaskedMimic: Unified Physics-Based Character Control Through Masked Motion Inpainting
by: Tessler, Chen, et al.
Published: (2024)
by: Tessler, Chen, et al.
Published: (2024)
IM-Animation: An Implicit Motion Representation for Identity-decoupled Character Animation
by: Xu, Zhufeng, et al.
Published: (2026)
by: Xu, Zhufeng, et al.
Published: (2026)
Pseudo-Bayesian Optimization
by: Chen, Haoxian, et al.
Published: (2023)
by: Chen, Haoxian, et al.
Published: (2023)
Robustness is Important: Limitations of LLMs for Data Fitting
by: Liu, Hejia, et al.
Published: (2025)
by: Liu, Hejia, et al.
Published: (2025)
Kling-MotionControl Technical Report
by: Kling Team, et al.
Published: (2026)
by: Kling Team, et al.
Published: (2026)
VINO: A Unified Visual Generator with Interleaved OmniModal Context
by: Chen, Junyi, et al.
Published: (2026)
by: Chen, Junyi, et al.
Published: (2026)
DualDub: Video-to-Soundtrack Generation via Joint Speech and Background Audio Synthesis
by: Tian, Wenjie, et al.
Published: (2025)
by: Tian, Wenjie, et al.
Published: (2025)
From Covert Hiding to Visual Editing: Robust Generative Video Steganography
by: Mao, Xueying, et al.
Published: (2024)
by: Mao, Xueying, et al.
Published: (2024)
Dub-S2ST: Textless Speech-to-Speech Translation for Seamless Dubbing
by: Choi, Jeongsoo, et al.
Published: (2025)
by: Choi, Jeongsoo, et al.
Published: (2025)
MRT: Masked Region Transformer for Layered Image Generation and Editing at Scale
by: Tang, Zhicong, et al.
Published: (2026)
by: Tang, Zhicong, et al.
Published: (2026)
Enhancing Expressiveness in Dance Generation via Integrating Frequency and Music Style Information
by: Huang, Qiaochu, et al.
Published: (2024)
by: Huang, Qiaochu, et al.
Published: (2024)
Edit Where You Mean: Region-Aware Adapter Injection for Mask-Free Local Image Editing
by: Cai, Honghao, et al.
Published: (2026)
by: Cai, Honghao, et al.
Published: (2026)
VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning
by: Li, Baolu, et al.
Published: (2025)
by: Li, Baolu, et al.
Published: (2025)
PainterNet: Adaptive Image Inpainting with Actual-Token Attention and Diverse Mask Control
by: Wang, Ruichen, et al.
Published: (2024)
by: Wang, Ruichen, et al.
Published: (2024)
Similar Items
-
StableDub: Taming Diffusion Prior for Generalized and Efficient Visual Dubbing
by: Chen, Liyang, et al.
Published: (2025) -
MIDAS: Multimodal Interactive Digital-humAn Synthesis via Real-time Autoregressive Video Generation
by: Chen, Ming, et al.
Published: (2025) -
Semantic-Aware Prefix Learning for Token-Efficient Image Generation
by: Li, Qingfeng, et al.
Published: (2026) -
3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation
by: Fang, Zhixue, et al.
Published: (2026) -
Cafe-Talk: Generating 3D Talking Face Animation with Multimodal Coarse- and Fine-grained Control
by: Chen, Hejia, et al.
Published: (2025)