Ye, J., Cong, G., Wang, C., Wen, X., Li, Z., Cao, B., & Shan, H. (2026). Hierarchical Codec Diffusion for Video-to-Speech Generation.
Chicago Style (17th ed.) CitationYe, Jiaxin, Gaoxiang Cong, Chenhui Wang, Xin-Cheng Wen, Zhaoyang Li, Boyuan Cao, and Hongming Shan. Hierarchical Codec Diffusion for Video-to-Speech Generation. 2026.
MLA (9th ed.) CitationYe, Jiaxin, et al. Hierarchical Codec Diffusion for Video-to-Speech Generation. 2026.
Warning: These citations may not always be 100% accurate.