:: Library Catalog

Image de couverture de livre

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Zhang, Tianyi, Simoulin, Antoine, Li, Kai, Lakdawala, Sana, Yu, Shiqing, Mittal, Arpit, Fu, Hongyu, Lin, Yu
Format:	Preprint
Publié:	2026
Sujets:	Computer Vision and Pattern Recognition
Accès en ligne:	https://arxiv.org/abs/2602.00531
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

Documents similaires

Decomposed Vision-Language Alignment for Fine-Grained Open-Vocabulary Segmentation
par: Wang, Chenhao, et autres
Publié: (2026)

Fine-Grained Open-Vocabulary Object Detection with Fined-Grained Prompts: Task, Dataset and Benchmark
par: Liu, Ying, et autres
Publié: (2025)

OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding
par: Deng, Yinan, et autres
Publié: (2024)

Memory-Efficient Fine-Tuning of Transformers via Token Selection
par: Simoulin, Antoine, et autres
Publié: (2025)

Open-Vocabulary Object Detection via Language Hierarchy
par: Huang, Jiaxing, et autres
Publié: (2024)

GUIDED: Granular Understanding via Identification, Detection, and Discrimination for Fine-Grained Open-Vocabulary Object Detection
par: Li, Jiaming, et autres
Publié: (2026)

FGAseg: Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation
par: Li, Bingyu, et autres
Publié: (2025)

Fine-Grained Open-Vocabulary Object Recognition via User-Guided Segmentation
par: Ahn, Jinwoo, et autres
Publié: (2024)

Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Object Detection
par: Lu, Yehao, et autres
Publié: (2025)

Open-Vocabulary Object Detection via Neighboring Region Attention Alignment
par: Qiang, Sunyuan, et autres
Publié: (2024)

Learning Fine-Grained Correspondence with Cross-Perspective Perception for Open-Vocabulary 6D Object Pose Estimation
par: Qin, Yu, et autres
Publié: (2026)

Unlocking Textual and Visual Wisdom: Open-Vocabulary 3D Object Detection Enhanced by Comprehensive Guidance from Text and Image
par: Jiao, Pengkun, et autres
Publié: (2024)

WeDetect: Fast Open-Vocabulary Object Detection as Retrieval
par: Fu, Shenghao, et autres
Publié: (2025)

Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation
par: Zheng, Yanhao, et autres
Publié: (2024)

Hierarchical Cross-Modal Alignment for Open-Vocabulary 3D Object Detection
par: Zhao, Youjun, et autres
Publié: (2025)

Scaling Open-Vocabulary Object Detection
par: Minderer, Matthias, et autres
Publié: (2023)

FOR: Finetuning for Object Level Open Vocabulary Image Retrieval
par: Levi, Hila, et autres
Publié: (2024)

OpenM3D: Open Vocabulary Multi-view Indoor 3D Object Detection without Human Annotations
par: Hsu, Peng-Hao, et autres
Publié: (2025)

LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors
par: Jin, Sheng, et autres
Publié: (2024)

ABRA: Teleporting Fine-Tuned Knowledge Across Domains for Open-Vocabulary Object Detection
par: Bernardi, Mattia, et autres
Publié: (2026)

Parameter-Efficient Semantic Augmentation for Enhancing Open-Vocabulary Object Detection
par: Cao, Weihao, et autres
Publié: (2026)

RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection
par: Chen, Fangyi, et autres
Publié: (2024)

A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection
par: Fu, Shenghao, et autres
Publié: (2025)

Retrieval-Augmented Open-Vocabulary Object Detection
par: Kim, Jooyeon, et autres
Publié: (2024)

Learning to Detect and Segment for Open Vocabulary Object Detection
par: Wang, Tao, et autres
Publié: (2022)

OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection
par: Chow, Adrian, et autres
Publié: (2025)

ODOV: Benchmark the Open-Domain Open-Vocabulary Object Detection
par: Zhang, Yupeng, et autres
Publié: (2025)

Open-Vocabulary Camouflaged Object Segmentation with Cascaded Vision Language Models
par: Zhao, Kai, et autres
Publié: (2025)

Exploring Open-Vocabulary Object Recognition in Images using CLIP
par: Chen, Wei Yu, et autres
Publié: (2026)

Prototype-Aware Multimodal Alignment for Open-Vocabulary Visual Grounding
par: Xie, Jiangnan, et autres
Publié: (2025)

State and Scene Enhanced Prototypes for Weakly Supervised Open-Vocabulary Object Detection
par: Zhou, Jiaying, et autres
Publié: (2025)

Adapting Vision-Language Model with Fine-grained Semantics for Open-Vocabulary Segmentation
par: Chng, Yong Xien, et autres
Publié: (2024)

Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection
par: Bao, Wentao, et autres
Publié: (2024)

Collaborative Novel Object Discovery and Box-Guided Cross-Modal Alignment for Open-Vocabulary 3D Object Detection
par: Cao, Yang, et autres
Publié: (2024)

Decompose and Transfer: CoT-Prompting Enhanced Alignment for Open-Vocabulary Temporal Action Detection
par: Zhu, Sa, et autres
Publié: (2026)

Taming Self-Training for Open-Vocabulary Object Detection
par: Zhao, Shiyu, et autres
Publié: (2023)

Sampling Bag of Views for Open-Vocabulary Object Detection
par: Choi, Hojun, et autres
Publié: (2024)

Streamlined Open-Vocabulary Human-Object Interaction Detection
par: Sun, Chang, et autres
Publié: (2026)

Open Vocabulary Monocular 3D Object Detection
par: Yao, Jin, et autres
Publié: (2024)

From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects
par: Li, Zizhao, et autres
Publié: (2024)