Liu, Q., Feng, J., Wang, Y., Han, X., Cheng, Y., Zhu, Y., . . . Lu, H. (2026). VISTA-Bench: Do Vision-Language Models Really Understand Visualized Text as Well as Pure Text?
Chicago Style (17th ed.) CitationLiu, Qing'an, Juntong Feng, Yuhao Wang, Xinzhe Han, Yujie Cheng, Yue Zhu, Haiwen Diao, Yunzhi Zhuge, and Huchuan Lu. VISTA-Bench: Do Vision-Language Models Really Understand Visualized Text as Well as Pure Text? 2026.
MLA (9th ed.) CitationLiu, Qing'an, et al. VISTA-Bench: Do Vision-Language Models Really Understand Visualized Text as Well as Pure Text? 2026.
Warning: These citations may not always be 100% accurate.