Liu, X., Zhou, L., Zhou, Z., Chen, J., & He, Z. (2024). MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking.
Chicago Style (17th ed.) CitationLiu, Xinqi, Li Zhou, Zikun Zhou, Jianqiu Chen, and Zhenyu He. MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking. 2024.
MLA (9th ed.) CitationLiu, Xinqi, et al. MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking. 2024.
Warning: These citations may not always be 100% accurate.