Saved in:
| Main Authors: | Brannon, William, Virkar, Yogesh, Thompson, Brian |
|---|---|
| Format: | Preprint |
| Published: |
2022
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2212.12137 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Dub-S2ST: Textless Speech-to-Speech Translation for Seamless Dubbing
by: Choi, Jeongsoo, et al.
Published: (2025)
by: Choi, Jeongsoo, et al.
Published: (2025)
StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing
by: Cong, Gaoxiang, et al.
Published: (2024)
by: Cong, Gaoxiang, et al.
Published: (2024)
ANIM-400K: A Large-Scale Dataset for Automated End-To-End Dubbing of Video
by: Cai, Kevin, et al.
Published: (2024)
by: Cai, Kevin, et al.
Published: (2024)
Length Aware Speech Translation for Video Dubbing
by: Chadha, Harveen Singh, et al.
Published: (2025)
by: Chadha, Harveen Singh, et al.
Published: (2025)
Towards Expressive Video Dubbing with Multiscale Multimodal Context Interaction
by: Zhao, Yuan, et al.
Published: (2024)
by: Zhao, Yuan, et al.
Published: (2024)
Towards Authentic Movie Dubbing with Retrieve-Augmented Director-Actor Interaction Learning
by: Liu, Rui, et al.
Published: (2025)
by: Liu, Rui, et al.
Published: (2025)
Fine-grained Video Dubbing Duration Alignment with Segment Supervised Preference Optimization
by: Cui, Chaoqun, et al.
Published: (2025)
by: Cui, Chaoqun, et al.
Published: (2025)
StableDub: Taming Diffusion Prior for Generalized and Efficient Visual Dubbing
by: Chen, Liyang, et al.
Published: (2025)
by: Chen, Liyang, et al.
Published: (2025)
Dubbing for Everyone: Data-Efficient Visual Dubbing using Neural Rendering Priors
by: Saunders, Jack, et al.
Published: (2024)
by: Saunders, Jack, et al.
Published: (2024)
VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models
by: Sung-Bin, Kim, et al.
Published: (2025)
by: Sung-Bin, Kim, et al.
Published: (2025)
SyncVoice: Towards Video Dubbing with Vision-Augmented Pretrained TTS Model
by: Wang, Kaidi, et al.
Published: (2025)
by: Wang, Kaidi, et al.
Published: (2025)
DiffDub: Person-generic Visual Dubbing Using Inpainting Renderer with Diffusion Auto-encoder
by: Liu, Tao, et al.
Published: (2023)
by: Liu, Tao, et al.
Published: (2023)
Video Editing for Audio-Visual Dubbing
by: Manela, Binyamin, et al.
Published: (2025)
by: Manela, Binyamin, et al.
Published: (2025)
Identity-Preserving Video Dubbing Using Motion Warping
by: Liu, Runzhen, et al.
Published: (2025)
by: Liu, Runzhen, et al.
Published: (2025)
STSA: Spatial-Temporal Semantic Alignment for Visual Dubbing
by: Ding, Zijun, et al.
Published: (2025)
by: Ding, Zijun, et al.
Published: (2025)
MCDubber: Multimodal Context-Aware Expressive Video Dubbing
by: Zhao, Yuan, et al.
Published: (2024)
by: Zhao, Yuan, et al.
Published: (2024)
PersonaTalk: Bring Attention to Your Persona in Visual Dubbing
by: Zhang, Longhao, et al.
Published: (2024)
by: Zhang, Longhao, et al.
Published: (2024)
RhythmTA: A Visual-Aided Interactive System for ESL Rhythm Training via Dubbing Practice
by: Chen, Chang, et al.
Published: (2025)
by: Chen, Chang, et al.
Published: (2025)
DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing
by: Sahipjohn, Neha, et al.
Published: (2024)
by: Sahipjohn, Neha, et al.
Published: (2024)
LubDubDecoder: Bringing Micro-Mechanical Cardiac Monitoring to Hearables
by: Zhang, Siqi, et al.
Published: (2025)
by: Zhang, Siqi, et al.
Published: (2025)
Improving Statistical Significance in Human Evaluation of Automatic Metrics via Soft Pairwise Accuracy
by: Thompson, Brian, et al.
Published: (2024)
by: Thompson, Brian, et al.
Published: (2024)
JUST-DUB-IT: Video Dubbing via Joint Audio-Visual Diffusion
by: Chen, Anthony, et al.
Published: (2026)
by: Chen, Anthony, et al.
Published: (2026)
CoSyncDiT: Cognitive Synchronous Diffusion Transformer for Movie Dubbing
by: Cong, Gaoxiang, et al.
Published: (2026)
by: Cong, Gaoxiang, et al.
Published: (2026)
InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing
by: Yang, Shaoshu, et al.
Published: (2025)
by: Yang, Shaoshu, et al.
Published: (2025)
Дуб Ісай Давидович [Dub Isai Davydovych]
by: Рікун, Інна Еміліївна
Published: (2008)
by: Рікун, Інна Еміліївна
Published: (2008)
Bridging Dictionary: AI-Generated Dictionary of Partisan Language Use
by: Jiang, Hang, et al.
Published: (2024)
by: Jiang, Hang, et al.
Published: (2024)
AudienceView: AI-Assisted Interpretation of Audience Feedback in Journalism
by: Brannon, William, et al.
Published: (2024)
by: Brannon, William, et al.
Published: (2024)
MM-MovieDubber: Towards Multi-Modal Learning for Multi-Modal Movie Dubbing
by: Zheng, Junjie, et al.
Published: (2025)
by: Zheng, Junjie, et al.
Published: (2025)
Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie Dubbing
by: Zhang, Zhedong, et al.
Published: (2025)
by: Zhang, Zhedong, et al.
Published: (2025)
From Inpainting to Editing: Unlocking Robust Mask-Free Visual Dubbing via Generative Bootstrapping
by: He, Xu, et al.
Published: (2025)
by: He, Xu, et al.
Published: (2025)
MuseTalk: Real-Time High-Fidelity Video Dubbing via Spatio-Temporal Sampling
by: Zhang, Yue, et al.
Published: (2024)
by: Zhang, Yue, et al.
Published: (2024)
Rethinking Retrieval-Augmented Generation for Medicine: A Large-Scale, Systematic Expert Evaluation and Practical Insights
by: Kim, Hyunjae, et al.
Published: (2025)
by: Kim, Hyunjae, et al.
Published: (2025)
The Translation of Culture-bound References for Dubbing: A Model for the Analysis
by: Jurgita Astrauskienė
Published: (2022)
by: Jurgita Astrauskienė
Published: (2022)
Fostering EFL Learners' Speaking Skills and Flow Experience With Video‐Dubbing Tasks: A Flow Theory Perspective
by: Gwo‐Jen Hwang, et al.
Published: (2025)
by: Gwo‐Jen Hwang, et al.
Published: (2025)
ConGraT: Self-Supervised Contrastive Pretraining for Joint Graph and Text Embeddings
by: Brannon, William, et al.
Published: (2023)
by: Brannon, William, et al.
Published: (2023)
Can Hierarchical Cross-Modal Fusion Predict Human Perception of AI Dubbed Content?
by: Dasare, Ashwini, et al.
Published: (2026)
by: Dasare, Ashwini, et al.
Published: (2026)
Introducción a la poesía Dub: Linton Kwesi Johnson
by: Arnaldo Valero
Published: (2005)
by: Arnaldo Valero
Published: (2005)
Joint Multi-scale Cross-lingual Speaking Style Transfer with Bidirectional Attention Mechanism for Automatic Dubbing
by: Li, Jingbei, et al.
Published: (2023)
by: Li, Jingbei, et al.
Published: (2023)
Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study
by: Cui, Menglong, et al.
Published: (2025)
by: Cui, Menglong, et al.
Published: (2025)
Verifying Claims About Metaphors with Large-Scale Automatic Metaphor Identification
by: Aono, Kotaro, et al.
Published: (2024)
by: Aono, Kotaro, et al.
Published: (2024)
Similar Items
-
Dub-S2ST: Textless Speech-to-Speech Translation for Seamless Dubbing
by: Choi, Jeongsoo, et al.
Published: (2025) -
StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing
by: Cong, Gaoxiang, et al.
Published: (2024) -
ANIM-400K: A Large-Scale Dataset for Automated End-To-End Dubbing of Video
by: Cai, Kevin, et al.
Published: (2024) -
Length Aware Speech Translation for Video Dubbing
by: Chadha, Harveen Singh, et al.
Published: (2025) -
Towards Expressive Video Dubbing with Multiscale Multimodal Context Interaction
by: Zhao, Yuan, et al.
Published: (2024)