Ko, D., Kim, S., Suh, Y., G, V. K. B., Yoon, M., Chandraker, M., & Kim, H. J. (2025). ST-VLM: Kinematic Instruction Tuning for Spatio-Temporal Reasoning in Vision-Language Models.
Chicago Style (17th ed.) CitationKo, Dohwan, Sihyeon Kim, Yumin Suh, Vijay Kumar B. G, Minseo Yoon, Manmohan Chandraker, and Hyunwoo J. Kim. ST-VLM: Kinematic Instruction Tuning for Spatio-Temporal Reasoning in Vision-Language Models. 2025.
MLA (9th ed.) CitationKo, Dohwan, et al. ST-VLM: Kinematic Instruction Tuning for Spatio-Temporal Reasoning in Vision-Language Models. 2025.
Warning: These citations may not always be 100% accurate.