Mu, L., Liu, B., Zhang, R., Mo, G., Jin, J., Zhang, K., & Huang, H. (2025). FLAP: Fully-controllable Audio-driven Portrait Video Generation through 3D head conditioned diffusion model.
Chicago Style (17th ed.) CitationMu, Lingzhou, Baiji Liu, Ruonan Zhang, Guiming Mo, Jiawei Jin, Kai Zhang, and Haozhi Huang. FLAP: Fully-controllable Audio-driven Portrait Video Generation Through 3D Head Conditioned Diffusion Model. 2025.
MLA (9th ed.) CitationMu, Lingzhou, et al. FLAP: Fully-controllable Audio-driven Portrait Video Generation Through 3D Head Conditioned Diffusion Model. 2025.
Warning: These citations may not always be 100% accurate.