Xing, Z., Hu, X., Fu, C., Wang, W., Dai, J., & Heng, P. (2025). EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning.
Chicago Style (17th ed.) CitationXing, Zhenghao, Xiaowei Hu, Chi-Wing Fu, Wenhai Wang, Jifeng Dai, and Pheng-Ann Heng. EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning. 2025.
MLA (9th ed.) CitationXing, Zhenghao, et al. EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning. 2025.
Warning: These citations may not always be 100% accurate.