:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Liu, Mingxuan, Hayes, Tyler L., Ricci, Elisa, Csurka, Gabriela, Volpi, Riccardo
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2405.10053
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Test-time Vocabulary Adaptation for Language-driven Object Detection
by: Liu, Mingxuan, et al.
Published: (2025)

What could go wrong? Discovering and describing failure modes in computer vision
by: Csurka, Gabriela, et al.
Published: (2024)

Compositional Caching for Training-free Open-vocabulary Attribute Detection
by: Garosi, Marco, et al.
Published: (2025)

Can we make NeRF-based visual localization privacy-preserving?
by: Pietrantoni, Maxime, et al.
Published: (2025)

PANDAS: Prototype-based Novel Class Discovery and Detection
by: Hayes, Tyler L., et al.
Published: (2024)

Superpowering Open-Vocabulary Object Detectors for X-ray Vision
by: Garcia-Fernandez, Pablo, et al.
Published: (2025)

Gaussian Splatting Feature Fields for Privacy-Preserving Visual Localization
by: Pietrantoni, Maxime, et al.
Published: (2025)

Semantic-CD: Remote Sensing Image Semantic Change Detection towards Open-vocabulary Setting
by: Zhu, Yongshuo, et al.
Published: (2025)

OV-NeRF: Open-vocabulary Neural Radiance Fields with Vision and Language Foundation Models for 3D Semantic Understanding
by: Liao, Guibiao, et al.
Published: (2024)

Dynamic Scoring with Enhanced Semantics for Training-Free Human-Object Interaction Detection
by: Tonini, Francesco, et al.
Published: (2025)

DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection
by: Yao, Lewei, et al.
Published: (2024)

Incremental Object-Based Novelty Detection with Feedback Loop
by: Caldarella, Simone, et al.
Published: (2023)

Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement
by: Pietrantoni, Maxime, et al.
Published: (2024)

Training-Free Semantic Multi-Object Tracking with Vision-Language Models
by: Bonat, Laurence, et al.
Published: (2026)

IGLOSS: Image Generation for Lidar Open-vocabulary Semantic Segmentation
by: Samet, Nermin, et al.
Published: (2026)

Open-RGBT: Open-vocabulary RGB-T Zero-shot Semantic Segmentation in Open-world Environments
by: Yu, Meng, et al.
Published: (2024)

FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation
by: Benigmim, Yasser, et al.
Published: (2025)

DiSCO-3D : Discovering and segmenting Sub-Concepts from Open-vocabulary queries in NeRF
by: Petit, Doriand, et al.
Published: (2025)

3D Object Detection from Images for Autonomous Driving: A Survey
by: Ma, Xinzhu, et al.
Published: (2022)

OpenVIS: Open-vocabulary Video Instance Segmentation
by: Guo, Pinxue, et al.
Published: (2023)

ARM: A Learnable, Plug-and-Play Module for CLIP-based Open-vocabulary Semantic Segmentation
by: Liu, Ziquan, et al.
Published: (2025)

Semantic Consistent Language Gaussian Splatting for Point-Level Open-vocabulary Querying
by: Yin, Hairong, et al.
Published: (2025)

Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation
by: Peng, Zelin, et al.
Published: (2024)

Open-Vocabulary Object Detection via Language Hierarchy
by: Huang, Jiaxing, et al.
Published: (2024)

Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision
by: Liu, Yajie, et al.
Published: (2024)

Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability
by: Hosoya, Yusuke, et al.
Published: (2024)

METOR: A Unified Framework for Mutual Enhancement of Objects and Relationships in Open-vocabulary Video Visual Relationship Detection
by: Wang, Yongqi, et al.
Published: (2025)

From Semantics, Scene to Instance-awareness: Distilling Foundation Model for Grounded Open-vocabulary Situation Recognition
by: Cai, Chen, et al.
Published: (2025)

Placing Objects in Context via Inpainting for Out-of-distribution Segmentation
by: de Jorge, Pau, et al.
Published: (2024)

Democratizing Fine-grained Visual Recognition with Large Language Models
by: Liu, Mingxuan, et al.
Published: (2024)

Organizing Unstructured Image Collections using Natural Language
by: Liu, Mingxuan, et al.
Published: (2024)

Large-scale Pre-trained Models are Surprisingly Strong in Incremental Novel Class Discovery
by: Liu, Mingxuan, et al.
Published: (2023)

OpenTie: Open-vocabulary Sequential Rebar Tying System
by: Liu, Mingze, et al.
Published: (2025)

Weakly Supervised 3D Open-vocabulary Segmentation
by: Liu, Kunhao, et al.
Published: (2023)

GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane
by: Qu, Yansong, et al.
Published: (2024)

UrbanVerse: Scaling Urban Simulation by Watching City-Tour Videos
by: Liu, Mingxuan, et al.
Published: (2025)

LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation
by: Stojnić, Vladan, et al.
Published: (2025)

Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency
by: Kalantidis, Yannis, et al.
Published: (2024)

FlowOVD: Learning Generative Latent Flows for Zero-shot Open-vocabulary Detection
by: Wei, Yao, et al.
Published: (2026)

Open-vocabulary object 6D pose estimation
by: Corsetti, Jaime, et al.
Published: (2023)