Xie, Z., Liu, X., Zhang, B., Lin, Y., Cai, S., & Jin, T. (2026). HVD: Human Vision-Driven Video Representation Learning for Text-Video Retrieval.
Chicago Style (17th ed.) CitationXie, Zequn, Xin Liu, Boyun Zhang, Yuxiao Lin, Sihang Cai, and Tao Jin. HVD: Human Vision-Driven Video Representation Learning for Text-Video Retrieval. 2026.
MLA (9th ed.) CitationXie, Zequn, et al. HVD: Human Vision-Driven Video Representation Learning for Text-Video Retrieval. 2026.
Warning: These citations may not always be 100% accurate.