Gao, X., Dai, Y., Qiu, B., Wang, L., Qiu, H., & Li, H. (2025). Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection.
Chicago Style (17th ed.) CitationGao, Xiangyu, Yu Dai, Benliu Qiu, Lanxiao Wang, Heqian Qiu, and Hongliang Li. Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection. 2025.
MLA (9th ed.) CitationGao, Xiangyu, et al. Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection. 2025.
Warning: These citations may not always be 100% accurate.