Cao, Y., Liu, Y., Chen, Z., Shi, G., Wang, W., Zhao, D., & Lu, T. (2024). MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding.
Chicago Style (17th ed.) CitationCao, Yue, Yangzhou Liu, Zhe Chen, Guangchen Shi, Wenhai Wang, Danhuai Zhao, and Tong Lu. MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding. 2024.
MLA (9th ed.) CitationCao, Yue, et al. MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding. 2024.
Warning: These citations may not always be 100% accurate.