Wang, H., Wei, X., He, J., Bai, C., Fan, C., Cao, J., . . . Zhang, S. (2026). VEGA: Visual Encoder Grounding Alignment for Spatially-Aware Vision-Language-Action Models.
Chicago Style (17th ed.) CitationWang, Hao, et al. VEGA: Visual Encoder Grounding Alignment for Spatially-Aware Vision-Language-Action Models. 2026.
MLA (9th ed.) CitationWang, Hao, et al. VEGA: Visual Encoder Grounding Alignment for Spatially-Aware Vision-Language-Action Models. 2026.
Warning: These citations may not always be 100% accurate.