Dai, Y., Wu, Z., Zeng, B., Hua, D., Liu, J., Li, B., . . . Zhang, W. (2026). LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning.
Citación estilo ChicagoDai, Yifan, et al. LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning. 2026.
Cita MLADai, Yifan, et al. LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning. 2026.
Warning: These citations may not always be 100% accurate.