Saved in:
| Main Authors: | Yoshida, Mitsuki, Yamamoto, Ryogo, Iwata, Daiki, Tanaka, Kanji |
|---|---|
| Format: | Preprint |
| Published: |
2021
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2109.04569 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Training Self-localization Models for Unseen Unfamiliar Places via Teacher-to-Student Data-Free Knowledge Transfer
by: Tsukahara, Kenta, et al.
Published: (2024)
by: Tsukahara, Kenta, et al.
Published: (2024)
CON: Continual Object Navigation via Data-Free Inter-Agent Knowledge Transfer in Unseen and Unfamiliar Places
by: Terashima, Kouki, et al.
Published: (2024)
by: Terashima, Kouki, et al.
Published: (2024)
Recursive Distillation for Open-Set Distributed Robot Localization
by: Tsukahara, Kenta, et al.
Published: (2023)
by: Tsukahara, Kenta, et al.
Published: (2023)
Dynamic-Dark SLAM: RGB-Thermal Cooperative Robot Vision Strategy for Multi-Person Tracking in Both Well-Lit and Low-Light Scenes
by: Sakai, Tatsuro, et al.
Published: (2025)
by: Sakai, Tatsuro, et al.
Published: (2025)
From Reactive to Map-Based AI: Tuned Local LLMs for Semantic Zone Inference in Object-Goal Navigation
by: Noda, Yudai, et al.
Published: (2026)
by: Noda, Yudai, et al.
Published: (2026)
FlatVPR: Plug-and-play Geo-linear Residual Adapter for Geometric Rectification of Foundation Model Feature Manifolds
by: Hisada, Rai, et al.
Published: (2026)
by: Hisada, Rai, et al.
Published: (2026)
DRIP: Discriminative Rotation-Invariant Pole Landmark Descriptor for 3D LiDAR Localization
by: Li, Dingrui, et al.
Published: (2024)
by: Li, Dingrui, et al.
Published: (2024)
A Dual-Stream Transformer Architecture for Illumination-Invariant TIR-LiDAR Person Tracking
by: Minase, Yuki, et al.
Published: (2026)
by: Minase, Yuki, et al.
Published: (2026)
Towards Open-Vocabulary Audio-Visual Event Localization
by: Zhou, Jinxing, et al.
Published: (2024)
by: Zhou, Jinxing, et al.
Published: (2024)
Open-Vocabulary Action Localization with Iterative Visual Prompting
by: Wake, Naoki, et al.
Published: (2024)
by: Wake, Naoki, et al.
Published: (2024)
LoGoSeg: Integrating Local and Global Features for Open-Vocabulary Semantic Segmentation
by: Chen, Junyang, et al.
Published: (2026)
by: Chen, Junyang, et al.
Published: (2026)
Unsupervised Open-Vocabulary Object Localization in Videos
by: Fan, Ke, et al.
Published: (2023)
by: Fan, Ke, et al.
Published: (2023)
Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization
by: Hyun, Jeongseok, et al.
Published: (2024)
by: Hyun, Jeongseok, et al.
Published: (2024)
WoMAP: World Models For Embodied Open-Vocabulary Object Localization
by: Yin, Tenny, et al.
Published: (2025)
by: Yin, Tenny, et al.
Published: (2025)
OpenVidVRD: Open-Vocabulary Video Visual Relation Detection via Prompt-Driven Semantic Space Alignment
by: Liu, Qi, et al.
Published: (2025)
by: Liu, Qi, et al.
Published: (2025)
OpenGaFF: Open-Vocabulary Gaussian Feature Field with Codebook Attention
by: Li, Kunyi, et al.
Published: (2026)
by: Li, Kunyi, et al.
Published: (2026)
Zero-shot Degree of Ill-posedness Estimation for Active Small Object Change Detection
by: Takeda, Koji, et al.
Published: (2024)
by: Takeda, Koji, et al.
Published: (2024)
Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation
by: Zhang, Jianyu, et al.
Published: (2024)
by: Zhang, Jianyu, et al.
Published: (2024)
Object-Centric Open-Vocabulary Image-Retrieval with Aggregated Features
by: Levi, Hila, et al.
Published: (2023)
by: Levi, Hila, et al.
Published: (2023)
RelWitness: Open-Vocabulary 3D Scene Graph Generation with Visual-Geometric Relation Witnesses
by: Nguyen, Minh Anh, et al.
Published: (2026)
by: Nguyen, Minh Anh, et al.
Published: (2026)
Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection
by: Wang, Haoxuan, et al.
Published: (2024)
by: Wang, Haoxuan, et al.
Published: (2024)
Taming Self-Training for Open-Vocabulary Object Detection
by: Zhao, Shiyu, et al.
Published: (2023)
by: Zhao, Shiyu, et al.
Published: (2023)
YOLO-World: Real-Time Open-Vocabulary Object Detection
by: Cheng, Tianheng, et al.
Published: (2024)
by: Cheng, Tianheng, et al.
Published: (2024)
ReasonGrounder: LVLM-Guided Hierarchical Feature Splatting for Open-Vocabulary 3D Visual Grounding and Reasoning
by: Liu, Zhenyang, et al.
Published: (2025)
by: Liu, Zhenyang, et al.
Published: (2025)
Open-Vocabulary Temporal Action Localization using Multimodal Guidance
by: Gupta, Akshita, et al.
Published: (2024)
by: Gupta, Akshita, et al.
Published: (2024)
Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding
by: Yuan, Zhihao, et al.
Published: (2023)
by: Yuan, Zhihao, et al.
Published: (2023)
In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation
by: Kang, Dahyun, et al.
Published: (2024)
by: Kang, Dahyun, et al.
Published: (2024)
Prototype-Aware Multimodal Alignment for Open-Vocabulary Visual Grounding
by: Xie, Jiangnan, et al.
Published: (2025)
by: Xie, Jiangnan, et al.
Published: (2025)
Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation
by: Bai, Sule, et al.
Published: (2024)
by: Bai, Sule, et al.
Published: (2024)
HomeRobot: Open-Vocabulary Mobile Manipulation
by: Yenamandra, Sriram, et al.
Published: (2023)
by: Yenamandra, Sriram, et al.
Published: (2023)
Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge
by: Yenamandra, Sriram, et al.
Published: (2024)
by: Yenamandra, Sriram, et al.
Published: (2024)
Pix2Key: Controllable Open-Vocabulary Retrieval with Semantic Decomposition and Self-Supervised Visual Dictionary Learning
by: Wei, Guoyizhe, et al.
Published: (2026)
by: Wei, Guoyizhe, et al.
Published: (2026)
Interaction-Centric Knowledge Infusion and Transfer for Open-Vocabulary Scene Graph Generation
by: Li, Lin, et al.
Published: (2025)
by: Li, Lin, et al.
Published: (2025)
DART: Dual Adaptive Refinement Transfer for Open-Vocabulary Multi-Label Recognition
by: Liu, Haijing, et al.
Published: (2025)
by: Liu, Haijing, et al.
Published: (2025)
Enhancing Quantum-ready QUBO-based Suppression for Object Detection with Appearance and Confidence Features
by: Yamamura, Keiichiro, et al.
Published: (2025)
by: Yamamura, Keiichiro, et al.
Published: (2025)
FoodLogAthl-218: Constructing a Real-World Food Image Dataset Using Dietary Management Applications
by: Watanabe, Mitsuki, et al.
Published: (2025)
by: Watanabe, Mitsuki, et al.
Published: (2025)
From Open-Vocabulary to Vocabulary-Free Semantic Segmentation
by: Reichard, Klara, et al.
Published: (2025)
by: Reichard, Klara, et al.
Published: (2025)
Open-Vocabulary Object Detectors: Robustness Challenges under Distribution Shifts
by: Chhipa, Prakash Chandra, et al.
Published: (2024)
by: Chhipa, Prakash Chandra, et al.
Published: (2024)
OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning
by: Han, Zongyan, et al.
Published: (2025)
by: Han, Zongyan, et al.
Published: (2025)
Objective, Absolute and Hue-aware Metrics for Intrinsic Image Decomposition on Real-World Scenes: A Proof of Concept
by: Sato, Shogo, et al.
Published: (2025)
by: Sato, Shogo, et al.
Published: (2025)
Similar Items
-
Training Self-localization Models for Unseen Unfamiliar Places via Teacher-to-Student Data-Free Knowledge Transfer
by: Tsukahara, Kenta, et al.
Published: (2024) -
CON: Continual Object Navigation via Data-Free Inter-Agent Knowledge Transfer in Unseen and Unfamiliar Places
by: Terashima, Kouki, et al.
Published: (2024) -
Recursive Distillation for Open-Set Distributed Robot Localization
by: Tsukahara, Kenta, et al.
Published: (2023) -
Dynamic-Dark SLAM: RGB-Thermal Cooperative Robot Vision Strategy for Multi-Person Tracking in Both Well-Lit and Low-Light Scenes
by: Sakai, Tatsuro, et al.
Published: (2025) -
From Reactive to Map-Based AI: Tuned Local LLMs for Semantic Zone Inference in Object-Goal Navigation
by: Noda, Yudai, et al.
Published: (2026)