Gong, C., Wang, D., Wei, Z., Guo, Y., Zhu, H., & Chen, J. (2025). EchoingPixels: Cross-Modal Adaptive Token Reduction for Efficient Audio-Visual LLMs.
Chicago Style (17th ed.) CitationGong, Chao, Depeng Wang, Zhipeng Wei, Ya Guo, Huijia Zhu, and Jingjing Chen. EchoingPixels: Cross-Modal Adaptive Token Reduction for Efficient Audio-Visual LLMs. 2025.
MLA (9th ed.) CitationGong, Chao, et al. EchoingPixels: Cross-Modal Adaptive Token Reduction for Efficient Audio-Visual LLMs. 2025.
Warning: These citations may not always be 100% accurate.