Yang, L., Dai, Y., Yan, A., Prabhu, V., Xu, R., & Chen, Z. (2026). How Far Are Vision-Language Models from Constructing the Real World? A Benchmark for Physical Generative Reasoning.
Chicago Style (17th ed.) CitationYang, Luyu, Yutong Dai, An Yan, Viraj Prabhu, Ran Xu, and Zeyuan Chen. How Far Are Vision-Language Models from Constructing the Real World? A Benchmark for Physical Generative Reasoning. 2026.
MLA (9th ed.) CitationYang, Luyu, et al. How Far Are Vision-Language Models from Constructing the Real World? A Benchmark for Physical Generative Reasoning. 2026.
Warning: These citations may not always be 100% accurate.