Jin, P., Takanobu, R., Zhang, W., Cao, X., & Yuan, L. (2023). Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding.
Chicago Style (17th ed.) CitationJin, Peng, Ryuichi Takanobu, Wancai Zhang, Xiaochun Cao, and Li Yuan. Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding. 2023.
MLA (9th ed.) CitationJin, Peng, et al. Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding. 2023.
Warning: These citations may not always be 100% accurate.