Saved in:
| Main Authors: | Liu, Rui, Zhao, Yuan, Jia, Zhenqi |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.14249 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Towards Expressive Video Dubbing with Multiscale Multimodal Context Interaction
by: Zhao, Yuan, et al.
Published: (2024)
by: Zhao, Yuan, et al.
Published: (2024)
StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing
by: Cong, Gaoxiang, et al.
Published: (2024)
by: Cong, Gaoxiang, et al.
Published: (2024)
Retrieval-Augmented Dialogue Knowledge Aggregation for Expressive Conversational Speech Synthesis
by: Liu, Rui, et al.
Published: (2025)
by: Liu, Rui, et al.
Published: (2025)
Intra- and Inter-modal Context Interaction Modeling for Conversational Speech Synthesis
by: Jia, Zhenqi, et al.
Published: (2024)
by: Jia, Zhenqi, et al.
Published: (2024)
MCDubber: Multimodal Context-Aware Expressive Video Dubbing
by: Zhao, Yuan, et al.
Published: (2024)
by: Zhao, Yuan, et al.
Published: (2024)
Multimodal Fine-grained Context Interaction Graph Modeling for Conversational Speech Synthesis
by: Jia, Zhenqi, et al.
Published: (2025)
by: Jia, Zhenqi, et al.
Published: (2025)
MM-MovieDubber: Towards Multi-Modal Learning for Multi-Modal Movie Dubbing
by: Zheng, Junjie, et al.
Published: (2025)
by: Zheng, Junjie, et al.
Published: (2025)
SyncVoice: Towards Video Dubbing with Vision-Augmented Pretrained TTS Model
by: Wang, Kaidi, et al.
Published: (2025)
by: Wang, Kaidi, et al.
Published: (2025)
IBSEN: Director-Actor Agent Collaboration for Controllable and Interactive Drama Script Generation
by: Han, Senyu, et al.
Published: (2024)
by: Han, Senyu, et al.
Published: (2024)
Dubbing in Practice: A Large Scale Study of Human Localization With Insights for Automatic Dubbing
by: Brannon, William, et al.
Published: (2022)
by: Brannon, William, et al.
Published: (2022)
Dub-S2ST: Textless Speech-to-Speech Translation for Seamless Dubbing
by: Choi, Jeongsoo, et al.
Published: (2025)
by: Choi, Jeongsoo, et al.
Published: (2025)
MovieCORE: COgnitive REasoning in Movies
by: Faure, Gueter Josmy, et al.
Published: (2025)
by: Faure, Gueter Josmy, et al.
Published: (2025)
Emphasis Rendering for Conversational Text-to-Speech with Multi-modal Multi-scale Context Modeling
by: Liu, Rui, et al.
Published: (2024)
by: Liu, Rui, et al.
Published: (2024)
Logic-Oriented Retriever Enhancement via Contrastive Learning
by: Zhang, Wenxuan, et al.
Published: (2026)
by: Zhang, Wenxuan, et al.
Published: (2026)
GIP-RAG: An Evidence-Grounded Retrieval-Augmented Framework for Interpretable Gene Interaction and Pathway Impact Analysis
by: Jia, Fujian, et al.
Published: (2026)
by: Jia, Fujian, et al.
Published: (2026)
Towards Visually-Guided Movie Subtitle Translation for Indic Languages
by: Chintada, Tarun, et al.
Published: (2026)
by: Chintada, Tarun, et al.
Published: (2026)
Corrective Retrieval Augmented Generation
by: Yan, Shi-Qi, et al.
Published: (2024)
by: Yan, Shi-Qi, et al.
Published: (2024)
MovieSum: An Abstractive Summarization Dataset for Movie Screenplays
by: Saxena, Rohit, et al.
Published: (2024)
by: Saxena, Rohit, et al.
Published: (2024)
On Retrieval Augmentation and the Limitations of Language Model Training
by: Chiang, Ting-Rui, et al.
Published: (2023)
by: Chiang, Ting-Rui, et al.
Published: (2023)
Curriculum Guided Reinforcement Learning for Efficient Multi Hop Retrieval Augmented Generation
by: Ji, Yuelyu, et al.
Published: (2025)
by: Ji, Yuelyu, et al.
Published: (2025)
Toward Faithful Retrieval-Augmented Generation with Sparse Autoencoders
by: Xiong, Guangzhi, et al.
Published: (2025)
by: Xiong, Guangzhi, et al.
Published: (2025)
Fine-grained Video Dubbing Duration Alignment with Segment Supervised Preference Optimization
by: Cui, Chaoqun, et al.
Published: (2025)
by: Cui, Chaoqun, et al.
Published: (2025)
Lighter And Better: Towards Flexible Context Adaptation For Retrieval Augmented Generation
by: Liu, Zheng, et al.
Published: (2024)
by: Liu, Zheng, et al.
Published: (2024)
Language-Coupled Reinforcement Learning for Multilingual Retrieval-Augmented Generation
by: Qi, Rui, et al.
Published: (2026)
by: Qi, Rui, et al.
Published: (2026)
NaviRAG: Towards Active Knowledge Navigation for Retrieval-Augmented Generation
by: Dai, Jihao, et al.
Published: (2026)
by: Dai, Jihao, et al.
Published: (2026)
RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation
by: Chan, Chi-Min, et al.
Published: (2024)
by: Chan, Chi-Min, et al.
Published: (2024)
Movie101v2: Improved Movie Narration Benchmark
by: Yue, Zihao, et al.
Published: (2024)
by: Yue, Zihao, et al.
Published: (2024)
Detecting Hallucinations in Authentic LLM-Human Interactions
by: Ren, Yujie, et al.
Published: (2025)
by: Ren, Yujie, et al.
Published: (2025)
Bridging Relevance and Reasoning: Rationale Distillation in Retrieval-Augmented Generation
by: Jia, Pengyue, et al.
Published: (2024)
by: Jia, Pengyue, et al.
Published: (2024)
Length Aware Speech Translation for Video Dubbing
by: Chadha, Harveen Singh, et al.
Published: (2025)
by: Chadha, Harveen Singh, et al.
Published: (2025)
Toward Robust RALMs: Revealing the Impact of Imperfect Retrieval on Retrieval-Augmented Language Models
by: Park, Seong-Il, et al.
Published: (2024)
by: Park, Seong-Il, et al.
Published: (2024)
CoSyncDiT: Cognitive Synchronous Diffusion Transformer for Movie Dubbing
by: Cong, Gaoxiang, et al.
Published: (2026)
by: Cong, Gaoxiang, et al.
Published: (2026)
Steering Over-refusals Towards Safety in Retrieval Augmented Generation
by: Maskey, Utsav, et al.
Published: (2025)
by: Maskey, Utsav, et al.
Published: (2025)
Learning to Extract Rational Evidence via Reinforcement Learning for Retrieval-Augmented Generation
by: Zhao, Xinping, et al.
Published: (2025)
by: Zhao, Xinping, et al.
Published: (2025)
Toward Structured Knowledge Reasoning: Contrastive Retrieval-Augmented Generation on Experience
by: Gu, Jiawei, et al.
Published: (2025)
by: Gu, Jiawei, et al.
Published: (2025)
DocCHA: Towards LLM-Augmented Interactive Online diagnosis System
by: Liu, Xinyi, et al.
Published: (2025)
by: Liu, Xinyi, et al.
Published: (2025)
Towards Trustworthy Retrieval Augmented Generation for Large Language Models: A Survey
by: Ni, Bo, et al.
Published: (2025)
by: Ni, Bo, et al.
Published: (2025)
An Empirical Study of Retrieval Augmented Generation with Chain-of-Thought
by: Zhao, Yuetong, et al.
Published: (2024)
by: Zhao, Yuetong, et al.
Published: (2024)
Enhancing Retrieval-Augmented LMs with a Two-stage Consistency Learning Compressor
by: Xu, Chuankai, et al.
Published: (2024)
by: Xu, Chuankai, et al.
Published: (2024)
MSRS: Evaluating Multi-Source Retrieval-Augmented Generation
by: Phanse, Rohan, et al.
Published: (2025)
by: Phanse, Rohan, et al.
Published: (2025)
Similar Items
-
Towards Expressive Video Dubbing with Multiscale Multimodal Context Interaction
by: Zhao, Yuan, et al.
Published: (2024) -
StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing
by: Cong, Gaoxiang, et al.
Published: (2024) -
Retrieval-Augmented Dialogue Knowledge Aggregation for Expressive Conversational Speech Synthesis
by: Liu, Rui, et al.
Published: (2025) -
Intra- and Inter-modal Context Interaction Modeling for Conversational Speech Synthesis
by: Jia, Zhenqi, et al.
Published: (2024) -
MCDubber: Multimodal Context-Aware Expressive Video Dubbing
by: Zhao, Yuan, et al.
Published: (2024)