Woo, B., Wang, Z., Pak, B., Mo, S., & Yu, S. X. (2026). Aligning Forest and Trees in Images & Long Captions for Visually Grounded Understanding.
Chicago Style (17th ed.) CitationWoo, Byeongju, Zilin Wang, Byeonghyun Pak, Sangwoo Mo, and Stella X. Yu. Aligning Forest and Trees in Images & Long Captions for Visually Grounded Understanding. 2026.
MLA (9th ed.) CitationWoo, Byeongju, et al. Aligning Forest and Trees in Images & Long Captions for Visually Grounded Understanding. 2026.
Warning: These citations may not always be 100% accurate.