APA (7th ed.) Citation

Pham, T., Le, H., Nguyen, P., Ngo, C., & Hy, T. (2024). SilVar: Speech Driven Multimodal Model for Reasoning Visual Question Answering and Object Localization.

Chicago Style (17th ed.) Citation

Pham, Tan-Hanh, Hoang-Nam Le, Phu-Vinh Nguyen, Chris Ngo, and Truong-Son Hy. SilVar: Speech Driven Multimodal Model for Reasoning Visual Question Answering and Object Localization. 2024.

MLA (9th ed.) Citation

Pham, Tan-Hanh, et al. SilVar: Speech Driven Multimodal Model for Reasoning Visual Question Answering and Object Localization. 2024.

Warning: These citations may not always be 100% accurate.