Chowdhury, S., Yang, K. D., Liu, X., Faghri, F., Vasu, P. K. A., Tuzel, O., . . . Vemulapalli, R. (2025). AMUSE: Audio-Visual Benchmark and Alignment Framework for Agentic Multi-Speaker Understanding.
Chicago Style (17th ed.) CitationChowdhury, Sanjoy, Karren D. Yang, Xudong Liu, Fartash Faghri, Pavan Kumar Anasosalu Vasu, Oncel Tuzel, Dinesh Manocha, Chun-Liang Li, and Raviteja Vemulapalli. AMUSE: Audio-Visual Benchmark and Alignment Framework for Agentic Multi-Speaker Understanding. 2025.
MLA (9th ed.) CitationChowdhury, Sanjoy, et al. AMUSE: Audio-Visual Benchmark and Alignment Framework for Agentic Multi-Speaker Understanding. 2025.
Warning: These citations may not always be 100% accurate.