Sun, Y., Cai, C., Zhang, J., Ye, Z., Yuan, X., & Liu, F. (2026). Let's Roll a BiFTA: Bi-refinement for Fine-grained Text-visual Alignment in Vision-Language Models.
Chicago Style (17th ed.) CitationSun, Yuhao, Chengyi Cai, Jiacheng Zhang, Zesheng Ye, Xingliang Yuan, and Feng Liu. Let's Roll a BiFTA: Bi-refinement for Fine-grained Text-visual Alignment in Vision-Language Models. 2026.
MLA (9th ed.) CitationSun, Yuhao, et al. Let's Roll a BiFTA: Bi-refinement for Fine-grained Text-visual Alignment in Vision-Language Models. 2026.
Warning: These citations may not always be 100% accurate.