Saved in:
| Main Authors: | Liu, Mingxuan, Hayes, Tyler L., Ricci, Elisa, Csurka, Gabriela, Volpi, Riccardo |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.10053 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Test-time Vocabulary Adaptation for Language-driven Object Detection
by: Liu, Mingxuan, et al.
Published: (2025)
by: Liu, Mingxuan, et al.
Published: (2025)
What could go wrong? Discovering and describing failure modes in computer vision
by: Csurka, Gabriela, et al.
Published: (2024)
by: Csurka, Gabriela, et al.
Published: (2024)
Compositional Caching for Training-free Open-vocabulary Attribute Detection
by: Garosi, Marco, et al.
Published: (2025)
by: Garosi, Marco, et al.
Published: (2025)
Can we make NeRF-based visual localization privacy-preserving?
by: Pietrantoni, Maxime, et al.
Published: (2025)
by: Pietrantoni, Maxime, et al.
Published: (2025)
PANDAS: Prototype-based Novel Class Discovery and Detection
by: Hayes, Tyler L., et al.
Published: (2024)
by: Hayes, Tyler L., et al.
Published: (2024)
Superpowering Open-Vocabulary Object Detectors for X-ray Vision
by: Garcia-Fernandez, Pablo, et al.
Published: (2025)
by: Garcia-Fernandez, Pablo, et al.
Published: (2025)
Gaussian Splatting Feature Fields for Privacy-Preserving Visual Localization
by: Pietrantoni, Maxime, et al.
Published: (2025)
by: Pietrantoni, Maxime, et al.
Published: (2025)
Semantic-CD: Remote Sensing Image Semantic Change Detection towards Open-vocabulary Setting
by: Zhu, Yongshuo, et al.
Published: (2025)
by: Zhu, Yongshuo, et al.
Published: (2025)
OV-NeRF: Open-vocabulary Neural Radiance Fields with Vision and Language Foundation Models for 3D Semantic Understanding
by: Liao, Guibiao, et al.
Published: (2024)
by: Liao, Guibiao, et al.
Published: (2024)
Dynamic Scoring with Enhanced Semantics for Training-Free Human-Object Interaction Detection
by: Tonini, Francesco, et al.
Published: (2025)
by: Tonini, Francesco, et al.
Published: (2025)
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection
by: Yao, Lewei, et al.
Published: (2024)
by: Yao, Lewei, et al.
Published: (2024)
Incremental Object-Based Novelty Detection with Feedback Loop
by: Caldarella, Simone, et al.
Published: (2023)
by: Caldarella, Simone, et al.
Published: (2023)
Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement
by: Pietrantoni, Maxime, et al.
Published: (2024)
by: Pietrantoni, Maxime, et al.
Published: (2024)
Training-Free Semantic Multi-Object Tracking with Vision-Language Models
by: Bonat, Laurence, et al.
Published: (2026)
by: Bonat, Laurence, et al.
Published: (2026)
IGLOSS: Image Generation for Lidar Open-vocabulary Semantic Segmentation
by: Samet, Nermin, et al.
Published: (2026)
by: Samet, Nermin, et al.
Published: (2026)
Open-RGBT: Open-vocabulary RGB-T Zero-shot Semantic Segmentation in Open-world Environments
by: Yu, Meng, et al.
Published: (2024)
by: Yu, Meng, et al.
Published: (2024)
FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation
by: Benigmim, Yasser, et al.
Published: (2025)
by: Benigmim, Yasser, et al.
Published: (2025)
DiSCO-3D : Discovering and segmenting Sub-Concepts from Open-vocabulary queries in NeRF
by: Petit, Doriand, et al.
Published: (2025)
by: Petit, Doriand, et al.
Published: (2025)
3D Object Detection from Images for Autonomous Driving: A Survey
by: Ma, Xinzhu, et al.
Published: (2022)
by: Ma, Xinzhu, et al.
Published: (2022)
OpenVIS: Open-vocabulary Video Instance Segmentation
by: Guo, Pinxue, et al.
Published: (2023)
by: Guo, Pinxue, et al.
Published: (2023)
ARM: A Learnable, Plug-and-Play Module for CLIP-based Open-vocabulary Semantic Segmentation
by: Liu, Ziquan, et al.
Published: (2025)
by: Liu, Ziquan, et al.
Published: (2025)
Semantic Consistent Language Gaussian Splatting for Point-Level Open-vocabulary Querying
by: Yin, Hairong, et al.
Published: (2025)
by: Yin, Hairong, et al.
Published: (2025)
Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation
by: Peng, Zelin, et al.
Published: (2024)
by: Peng, Zelin, et al.
Published: (2024)
Open-Vocabulary Object Detection via Language Hierarchy
by: Huang, Jiaxing, et al.
Published: (2024)
by: Huang, Jiaxing, et al.
Published: (2024)
Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision
by: Liu, Yajie, et al.
Published: (2024)
by: Liu, Yajie, et al.
Published: (2024)
Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability
by: Hosoya, Yusuke, et al.
Published: (2024)
by: Hosoya, Yusuke, et al.
Published: (2024)
METOR: A Unified Framework for Mutual Enhancement of Objects and Relationships in Open-vocabulary Video Visual Relationship Detection
by: Wang, Yongqi, et al.
Published: (2025)
by: Wang, Yongqi, et al.
Published: (2025)
From Semantics, Scene to Instance-awareness: Distilling Foundation Model for Grounded Open-vocabulary Situation Recognition
by: Cai, Chen, et al.
Published: (2025)
by: Cai, Chen, et al.
Published: (2025)
Placing Objects in Context via Inpainting for Out-of-distribution Segmentation
by: de Jorge, Pau, et al.
Published: (2024)
by: de Jorge, Pau, et al.
Published: (2024)
Democratizing Fine-grained Visual Recognition with Large Language Models
by: Liu, Mingxuan, et al.
Published: (2024)
by: Liu, Mingxuan, et al.
Published: (2024)
Organizing Unstructured Image Collections using Natural Language
by: Liu, Mingxuan, et al.
Published: (2024)
by: Liu, Mingxuan, et al.
Published: (2024)
Large-scale Pre-trained Models are Surprisingly Strong in Incremental Novel Class Discovery
by: Liu, Mingxuan, et al.
Published: (2023)
by: Liu, Mingxuan, et al.
Published: (2023)
OpenTie: Open-vocabulary Sequential Rebar Tying System
by: Liu, Mingze, et al.
Published: (2025)
by: Liu, Mingze, et al.
Published: (2025)
Weakly Supervised 3D Open-vocabulary Segmentation
by: Liu, Kunhao, et al.
Published: (2023)
by: Liu, Kunhao, et al.
Published: (2023)
GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane
by: Qu, Yansong, et al.
Published: (2024)
by: Qu, Yansong, et al.
Published: (2024)
UrbanVerse: Scaling Urban Simulation by Watching City-Tour Videos
by: Liu, Mingxuan, et al.
Published: (2025)
by: Liu, Mingxuan, et al.
Published: (2025)
LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation
by: Stojnić, Vladan, et al.
Published: (2025)
by: Stojnić, Vladan, et al.
Published: (2025)
Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency
by: Kalantidis, Yannis, et al.
Published: (2024)
by: Kalantidis, Yannis, et al.
Published: (2024)
FlowOVD: Learning Generative Latent Flows for Zero-shot Open-vocabulary Detection
by: Wei, Yao, et al.
Published: (2026)
by: Wei, Yao, et al.
Published: (2026)
Open-vocabulary object 6D pose estimation
by: Corsetti, Jaime, et al.
Published: (2023)
by: Corsetti, Jaime, et al.
Published: (2023)
Similar Items
-
Test-time Vocabulary Adaptation for Language-driven Object Detection
by: Liu, Mingxuan, et al.
Published: (2025) -
What could go wrong? Discovering and describing failure modes in computer vision
by: Csurka, Gabriela, et al.
Published: (2024) -
Compositional Caching for Training-free Open-vocabulary Attribute Detection
by: Garosi, Marco, et al.
Published: (2025) -
Can we make NeRF-based visual localization privacy-preserving?
by: Pietrantoni, Maxime, et al.
Published: (2025) -
PANDAS: Prototype-based Novel Class Discovery and Detection
by: Hayes, Tyler L., et al.
Published: (2024)