Ilaslan, M. F., Koksal, A., Lin, K. Q., Satar, B., Shou, M. Z., & Xu, Q. (2024). VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting.
Chicago Style (17th ed.) CitationIlaslan, Muhammet Furkan, Ali Koksal, Kevin Qinhong Lin, Burak Satar, Mike Zheng Shou, and Qianli Xu. VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting. 2024.
MLA (9th ed.) CitationIlaslan, Muhammet Furkan, et al. VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting. 2024.
Warning: These citations may not always be 100% accurate.