:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yoshida, Mitsuki, Yamamoto, Ryogo, Iwata, Daiki, Tanaka, Kanji
Format:	Preprint
Published:	2021
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2109.04569
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Training Self-localization Models for Unseen Unfamiliar Places via Teacher-to-Student Data-Free Knowledge Transfer
by: Tsukahara, Kenta, et al.
Published: (2024)

CON: Continual Object Navigation via Data-Free Inter-Agent Knowledge Transfer in Unseen and Unfamiliar Places
by: Terashima, Kouki, et al.
Published: (2024)

Recursive Distillation for Open-Set Distributed Robot Localization
by: Tsukahara, Kenta, et al.
Published: (2023)

Dynamic-Dark SLAM: RGB-Thermal Cooperative Robot Vision Strategy for Multi-Person Tracking in Both Well-Lit and Low-Light Scenes
by: Sakai, Tatsuro, et al.
Published: (2025)

From Reactive to Map-Based AI: Tuned Local LLMs for Semantic Zone Inference in Object-Goal Navigation
by: Noda, Yudai, et al.
Published: (2026)

FlatVPR: Plug-and-play Geo-linear Residual Adapter for Geometric Rectification of Foundation Model Feature Manifolds
by: Hisada, Rai, et al.
Published: (2026)

DRIP: Discriminative Rotation-Invariant Pole Landmark Descriptor for 3D LiDAR Localization
by: Li, Dingrui, et al.
Published: (2024)

A Dual-Stream Transformer Architecture for Illumination-Invariant TIR-LiDAR Person Tracking
by: Minase, Yuki, et al.
Published: (2026)

Towards Open-Vocabulary Audio-Visual Event Localization
by: Zhou, Jinxing, et al.
Published: (2024)

Open-Vocabulary Action Localization with Iterative Visual Prompting
by: Wake, Naoki, et al.
Published: (2024)

LoGoSeg: Integrating Local and Global Features for Open-Vocabulary Semantic Segmentation
by: Chen, Junyang, et al.
Published: (2026)

Unsupervised Open-Vocabulary Object Localization in Videos
by: Fan, Ke, et al.
Published: (2023)

Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization
by: Hyun, Jeongseok, et al.
Published: (2024)

WoMAP: World Models For Embodied Open-Vocabulary Object Localization
by: Yin, Tenny, et al.
Published: (2025)

OpenVidVRD: Open-Vocabulary Video Visual Relation Detection via Prompt-Driven Semantic Space Alignment
by: Liu, Qi, et al.
Published: (2025)

OpenGaFF: Open-Vocabulary Gaussian Feature Field with Codebook Attention
by: Li, Kunyi, et al.
Published: (2026)

Zero-shot Degree of Ill-posedness Estimation for Active Small Object Change Detection
by: Takeda, Koji, et al.
Published: (2024)

Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation
by: Zhang, Jianyu, et al.
Published: (2024)

Object-Centric Open-Vocabulary Image-Retrieval with Aggregated Features
by: Levi, Hila, et al.
Published: (2023)

RelWitness: Open-Vocabulary 3D Scene Graph Generation with Visual-Geometric Relation Witnesses
by: Nguyen, Minh Anh, et al.
Published: (2026)

Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection
by: Wang, Haoxuan, et al.
Published: (2024)

Taming Self-Training for Open-Vocabulary Object Detection
by: Zhao, Shiyu, et al.
Published: (2023)

YOLO-World: Real-Time Open-Vocabulary Object Detection
by: Cheng, Tianheng, et al.
Published: (2024)

ReasonGrounder: LVLM-Guided Hierarchical Feature Splatting for Open-Vocabulary 3D Visual Grounding and Reasoning
by: Liu, Zhenyang, et al.
Published: (2025)

Open-Vocabulary Temporal Action Localization using Multimodal Guidance
by: Gupta, Akshita, et al.
Published: (2024)

Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding
by: Yuan, Zhihao, et al.
Published: (2023)

In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation
by: Kang, Dahyun, et al.
Published: (2024)

Prototype-Aware Multimodal Alignment for Open-Vocabulary Visual Grounding
by: Xie, Jiangnan, et al.
Published: (2025)

Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation
by: Bai, Sule, et al.
Published: (2024)

HomeRobot: Open-Vocabulary Mobile Manipulation
by: Yenamandra, Sriram, et al.
Published: (2023)

Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge
by: Yenamandra, Sriram, et al.
Published: (2024)

Pix2Key: Controllable Open-Vocabulary Retrieval with Semantic Decomposition and Self-Supervised Visual Dictionary Learning
by: Wei, Guoyizhe, et al.
Published: (2026)

Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation
by: Li, Lin, et al.
Published: (2025)

DART: Dual Adaptive Refinement Transfer for Open-Vocabulary Multi-Label Recognition
by: Liu, Haijing, et al.
Published: (2025)

Enhancing Quantum-ready QUBO-based Suppression for Object Detection with Appearance and Confidence Features
by: Yamamura, Keiichiro, et al.
Published: (2025)

FoodLogAthl-218: Constructing a Real-World Food Image Dataset Using Dietary Management Applications
by: Watanabe, Mitsuki, et al.
Published: (2025)

From Open-Vocabulary to Vocabulary-Free Semantic Segmentation
by: Reichard, Klara, et al.
Published: (2025)

Open-Vocabulary Object Detectors: Robustness Challenges under Distribution Shifts
by: Chhipa, Prakash Chandra, et al.
Published: (2024)

OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning
by: Han, Zongyan, et al.
Published: (2025)

Objective, Absolute and Hue-aware Metrics for Intrinsic Image Decomposition on Real-World Scenes: A Proof of Concept
by: Sato, Shogo, et al.
Published: (2025)