:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Linok, Sergey, Zemskova, Tatiana, Ladanova, Svetlana, Titkov, Roman, Yudin, Dmitry, Monastyrny, Maxim, Valenkov, Aleksei
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2406.07113
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

OVSegDT: Segmenting Transformer for Open-Vocabulary Object Goal Navigation
by: Zemskova, Tatiana, et al.
Published: (2025)

Open-Vocabulary Indoor Object Grounding with 3D Hierarchical Scene Graph
by: Linok, Sergey, et al.
Published: (2025)

3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding
by: Zemskova, Tatiana, et al.
Published: (2024)

DyGEnc: Encoding a Sequence of Textual Scene Graphs to Reason and Answer Questions in Dynamic Scenes
by: Linok, Sergey, et al.
Published: (2025)

FocusGraph: Graph-Structured Frame Selection for Embodied Long Video Question Answering
by: Zemskova, Tatiana, et al.
Published: (2026)

LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM
by: Titkov, Roman, et al.
Published: (2025)

The Bare Necessities: Designing Simple, Effective Open-Vocabulary Scene Graphs
by: Kassab, Christina, et al.
Published: (2024)

M3DMap: Object-aware Multimodal 3D Mapping for Dynamic Environments
by: Yudin, Dmitry
Published: (2025)

SceneGraphVLM: Dynamic Scene Graph Generation from Video with Vision-Language Models
by: Makarov, Vladislav, et al.
Published: (2026)

RCDINO: Enhancing Radar-Camera 3D Object Detection with DINOv2 Semantic Features
by: Matykina, Olga, et al.
Published: (2025)

Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation
by: Werby, Abdelrhman, et al.
Published: (2024)

SGN-CIRL: Scene Graph-based Navigation with Curriculum, Imitation, and Reinforcement Learning
by: Oskolkov, Nikita, et al.
Published: (2025)

Open-Vocabulary Octree-Graph for 3D Scene Understanding
by: Wang, Zhigang, et al.
Published: (2024)

Talk2SAM: Text-Guided Semantic Enhancement for Complex-Shaped Object Segmentation
by: Vetoshkin, Luka, et al.
Published: (2025)

Open3DSG: Open-Vocabulary 3D Scene Graphs from Point Clouds with Queryable Objects and Open-Set Relationships
by: Koch, Sebastian, et al.
Published: (2024)

Collaborative Dynamic 3D Scene Graphs for Open-Vocabulary Urban Scene Understanding
by: Steinke, Tim, et al.
Published: (2025)

Query3D: LLM-Powered Open-Vocabulary Scene Segmentation with Language Embedded 3D Gaussian
by: Chahe, Amirhosein, et al.
Published: (2024)

OGScene3D: Incremental Open-Vocabulary 3D Gaussian Scene Graph Mapping for Scene Understanding
by: Zhu, Siting, et al.
Published: (2026)

OpenObject-NAV: Open-Vocabulary Object-Oriented Navigation Based on Dynamic Carrier-Relationship Scene Graph
by: Tang, Yujie, et al.
Published: (2024)

Open-Vocabulary SAM3D: Towards Training-free Open-Vocabulary 3D Scene Understanding
by: Tai, Hanchen, et al.
Published: (2024)

Retrieving Objects from 3D Scenes with Box-Guided Open-Vocabulary Instance Segmentation
by: Nguyen, Khanh, et al.
Published: (2025)

Beyond Averages: Open-Vocabulary 3D Scene Understanding with Gaussian Splatting and Bag of Embeddings
by: Arafa, Abdalla, et al.
Published: (2025)

Hierarchical and Holistic Open-Vocabulary Functional 3D Scene Graphs for Indoor Spaces
by: Hu, Xinggang, et al.
Published: (2026)

Open-Vocabulary Functional 3D Scene Graphs for Real-World Indoor Spaces
by: Zhang, Chenyangguang, et al.
Published: (2025)

GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding
by: Shao, Yawen, et al.
Published: (2024)

Open Vocabulary Monocular 3D Object Detection
by: Yao, Jin, et al.
Published: (2024)

On the Potential of Open-Vocabulary Models for Object Detection in Unusual Street Scenes
by: Ilyas, Sadia, et al.
Published: (2024)

Taking A Closer Look at Interacting Objects: Interaction-Aware Open Vocabulary Scene Graph Generation
by: Li, Lin, et al.
Published: (2025)

GHOST: Grounded Human Motion Generation with Open Vocabulary Scene-and-Text Contexts
by: Milacski, Zoltán Á., et al.
Published: (2024)

Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding
by: Li, Ruihuang, et al.
Published: (2024)

Open-Vocabulary Functional 3D Human-Scene Interaction Generation
by: Liu, Jie, et al.
Published: (2026)

Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation
by: Yan, Zhijie, et al.
Published: (2024)

RelWitness: Open-Vocabulary 3D Scene Graph Generation with Visual-Geometric Relation Witnesses
by: Nguyen, Minh Anh, et al.
Published: (2026)

SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding
by: Li, Rong, et al.
Published: (2024)

OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
by: Zhao, Youjun, et al.
Published: (2024)

OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation
by: Jiang, Haochen, et al.
Published: (2024)

LOST-3DSG: Lightweight Open-Vocabulary 3D Scene Graphs with Semantic Tracking in Dynamic Environments
by: Ferraina, Sara Micol, et al.
Published: (2026)

OFMPNet: Deep End-to-End Model for Occupancy and Flow Prediction in Urban Environment
by: Murhij, Youshaa, et al.
Published: (2024)

SceneAssistant: A Visual Feedback Agent for Open-Vocabulary 3D Scene Generation
by: Luo, Jun, et al.
Published: (2026)

State and Scene Enhanced Prototypes for Weakly Supervised Open-Vocabulary Object Detection
by: Zhou, Jiaying, et al.
Published: (2025)