Xi, S., Yang, C., Ding, H., Ni, Y., Liu, C. C., Liu, Y., & Zhang, C. (2025). Taming a Retrieval Framework to Read Images in Humanlike Manner for Augmenting Generation of MLLMs.
Chicago Style (17th ed.) CitationXi, Suyang, Chenxi Yang, Hong Ding, Yiqing Ni, Catherine C. Liu, Yunhao Liu, and Chengqi Zhang. Taming a Retrieval Framework to Read Images in Humanlike Manner for Augmenting Generation of MLLMs. 2025.
MLA (9th ed.) CitationXi, Suyang, et al. Taming a Retrieval Framework to Read Images in Humanlike Manner for Augmenting Generation of MLLMs. 2025.
Warning: These citations may not always be 100% accurate.