Guan, K., Wang, X., Lai, Z., Cheng, X., Zhang, P., Liu, X., . . . Cao, M. (2025). Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction.
Chicago Style (17th ed.) CitationGuan, Kaisi, Xihua Wang, Zhengfeng Lai, Xin Cheng, Peng Zhang, XiaoJiang Liu, Ruihua Song, and Meng Cao. Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction. 2025.
MLA (9th ed.) CitationGuan, Kaisi, et al. Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction. 2025.
Warning: These citations may not always be 100% accurate.