Li, X., Hu, S., Feng, X., Zhang, D., Wu, M., Zhang, J., & Huang, K. (2024). DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM.
Chicago Style (17th ed.) CitationLi, Xuchen, Shiyu Hu, Xiaokun Feng, Dailing Zhang, Meiqi Wu, Jing Zhang, and Kaiqi Huang. DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM. 2024.
MLA (9th ed.) CitationLi, Xuchen, et al. DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM. 2024.
Warning: These citations may not always be 100% accurate.