Saved in:
| Main Authors: | Linok, Sergey, Zemskova, Tatiana, Ladanova, Svetlana, Titkov, Roman, Yudin, Dmitry, Monastyrny, Maxim, Valenkov, Aleksei |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.07113 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
OVSegDT: Segmenting Transformer for Open-Vocabulary Object Goal Navigation
by: Zemskova, Tatiana, et al.
Published: (2025)
by: Zemskova, Tatiana, et al.
Published: (2025)
Open-Vocabulary Indoor Object Grounding with 3D Hierarchical Scene Graph
by: Linok, Sergey, et al.
Published: (2025)
by: Linok, Sergey, et al.
Published: (2025)
3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding
by: Zemskova, Tatiana, et al.
Published: (2024)
by: Zemskova, Tatiana, et al.
Published: (2024)
DyGEnc: Encoding a Sequence of Textual Scene Graphs to Reason and Answer Questions in Dynamic Scenes
by: Linok, Sergey, et al.
Published: (2025)
by: Linok, Sergey, et al.
Published: (2025)
FocusGraph: Graph-Structured Frame Selection for Embodied Long Video Question Answering
by: Zemskova, Tatiana, et al.
Published: (2026)
by: Zemskova, Tatiana, et al.
Published: (2026)
LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM
by: Titkov, Roman, et al.
Published: (2025)
by: Titkov, Roman, et al.
Published: (2025)
The Bare Necessities: Designing Simple, Effective Open-Vocabulary Scene Graphs
by: Kassab, Christina, et al.
Published: (2024)
by: Kassab, Christina, et al.
Published: (2024)
M3DMap: Object-aware Multimodal 3D Mapping for Dynamic Environments
by: Yudin, Dmitry
Published: (2025)
by: Yudin, Dmitry
Published: (2025)
SceneGraphVLM: Dynamic Scene Graph Generation from Video with Vision-Language Models
by: Makarov, Vladislav, et al.
Published: (2026)
by: Makarov, Vladislav, et al.
Published: (2026)
RCDINO: Enhancing Radar-Camera 3D Object Detection with DINOv2 Semantic Features
by: Matykina, Olga, et al.
Published: (2025)
by: Matykina, Olga, et al.
Published: (2025)
Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation
by: Werby, Abdelrhman, et al.
Published: (2024)
by: Werby, Abdelrhman, et al.
Published: (2024)
SGN-CIRL: Scene Graph-based Navigation with Curriculum, Imitation, and Reinforcement Learning
by: Oskolkov, Nikita, et al.
Published: (2025)
by: Oskolkov, Nikita, et al.
Published: (2025)
Open-Vocabulary Octree-Graph for 3D Scene Understanding
by: Wang, Zhigang, et al.
Published: (2024)
by: Wang, Zhigang, et al.
Published: (2024)
Talk2SAM: Text-Guided Semantic Enhancement for Complex-Shaped Object Segmentation
by: Vetoshkin, Luka, et al.
Published: (2025)
by: Vetoshkin, Luka, et al.
Published: (2025)
Open3DSG: Open-Vocabulary 3D Scene Graphs from Point Clouds with Queryable Objects and Open-Set Relationships
by: Koch, Sebastian, et al.
Published: (2024)
by: Koch, Sebastian, et al.
Published: (2024)
Collaborative Dynamic 3D Scene Graphs for Open-Vocabulary Urban Scene Understanding
by: Steinke, Tim, et al.
Published: (2025)
by: Steinke, Tim, et al.
Published: (2025)
Query3D: LLM-Powered Open-Vocabulary Scene Segmentation with Language Embedded 3D Gaussian
by: Chahe, Amirhosein, et al.
Published: (2024)
by: Chahe, Amirhosein, et al.
Published: (2024)
OGScene3D: Incremental Open-Vocabulary 3D Gaussian Scene Graph Mapping for Scene Understanding
by: Zhu, Siting, et al.
Published: (2026)
by: Zhu, Siting, et al.
Published: (2026)
OpenObject-NAV: Open-Vocabulary Object-Oriented Navigation Based on Dynamic Carrier-Relationship Scene Graph
by: Tang, Yujie, et al.
Published: (2024)
by: Tang, Yujie, et al.
Published: (2024)
Open-Vocabulary SAM3D: Towards Training-free Open-Vocabulary 3D Scene Understanding
by: Tai, Hanchen, et al.
Published: (2024)
by: Tai, Hanchen, et al.
Published: (2024)
Retrieving Objects from 3D Scenes with Box-Guided Open-Vocabulary Instance Segmentation
by: Nguyen, Khanh, et al.
Published: (2025)
by: Nguyen, Khanh, et al.
Published: (2025)
Beyond Averages: Open-Vocabulary 3D Scene Understanding with Gaussian Splatting and Bag of Embeddings
by: Arafa, Abdalla, et al.
Published: (2025)
by: Arafa, Abdalla, et al.
Published: (2025)
Hierarchical and Holistic Open-Vocabulary Functional 3D Scene Graphs for Indoor Spaces
by: Hu, Xinggang, et al.
Published: (2026)
by: Hu, Xinggang, et al.
Published: (2026)
Open-Vocabulary Functional 3D Scene Graphs for Real-World Indoor Spaces
by: Zhang, Chenyangguang, et al.
Published: (2025)
by: Zhang, Chenyangguang, et al.
Published: (2025)
GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding
by: Shao, Yawen, et al.
Published: (2024)
by: Shao, Yawen, et al.
Published: (2024)
Open Vocabulary Monocular 3D Object Detection
by: Yao, Jin, et al.
Published: (2024)
by: Yao, Jin, et al.
Published: (2024)
On the Potential of Open-Vocabulary Models for Object Detection in Unusual Street Scenes
by: Ilyas, Sadia, et al.
Published: (2024)
by: Ilyas, Sadia, et al.
Published: (2024)
Taking A Closer Look at Interacting Objects: Interaction-Aware Open Vocabulary Scene Graph Generation
by: Li, Lin, et al.
Published: (2025)
by: Li, Lin, et al.
Published: (2025)
GHOST: Grounded Human Motion Generation with Open Vocabulary Scene-and-Text Contexts
by: Milacski, Zoltán Á., et al.
Published: (2024)
by: Milacski, Zoltán Á., et al.
Published: (2024)
Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding
by: Li, Ruihuang, et al.
Published: (2024)
by: Li, Ruihuang, et al.
Published: (2024)
Open-Vocabulary Functional 3D Human-Scene Interaction Generation
by: Liu, Jie, et al.
Published: (2026)
by: Liu, Jie, et al.
Published: (2026)
Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation
by: Yan, Zhijie, et al.
Published: (2024)
by: Yan, Zhijie, et al.
Published: (2024)
RelWitness: Open-Vocabulary 3D Scene Graph Generation with Visual-Geometric Relation Witnesses
by: Nguyen, Minh Anh, et al.
Published: (2026)
by: Nguyen, Minh Anh, et al.
Published: (2026)
SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding
by: Li, Rong, et al.
Published: (2024)
by: Li, Rong, et al.
Published: (2024)
OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
by: Zhao, Youjun, et al.
Published: (2024)
by: Zhao, Youjun, et al.
Published: (2024)
OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation
by: Jiang, Haochen, et al.
Published: (2024)
by: Jiang, Haochen, et al.
Published: (2024)
LOST-3DSG: Lightweight Open-Vocabulary 3D Scene Graphs with Semantic Tracking in Dynamic Environments
by: Ferraina, Sara Micol, et al.
Published: (2026)
by: Ferraina, Sara Micol, et al.
Published: (2026)
OFMPNet: Deep End-to-End Model for Occupancy and Flow Prediction in Urban Environment
by: Murhij, Youshaa, et al.
Published: (2024)
by: Murhij, Youshaa, et al.
Published: (2024)
SceneAssistant: A Visual Feedback Agent for Open-Vocabulary 3D Scene Generation
by: Luo, Jun, et al.
Published: (2026)
by: Luo, Jun, et al.
Published: (2026)
State and Scene Enhanced Prototypes for Weakly Supervised Open-Vocabulary Object Detection
by: Zhou, Jiaying, et al.
Published: (2025)
by: Zhou, Jiaying, et al.
Published: (2025)
Similar Items
-
OVSegDT: Segmenting Transformer for Open-Vocabulary Object Goal Navigation
by: Zemskova, Tatiana, et al.
Published: (2025) -
Open-Vocabulary Indoor Object Grounding with 3D Hierarchical Scene Graph
by: Linok, Sergey, et al.
Published: (2025) -
3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding
by: Zemskova, Tatiana, et al.
Published: (2024) -
DyGEnc: Encoding a Sequence of Textual Scene Graphs to Reason and Answer Questions in Dynamic Scenes
by: Linok, Sergey, et al.
Published: (2025) -
FocusGraph: Graph-Structured Frame Selection for Embodied Long Video Question Answering
by: Zemskova, Tatiana, et al.
Published: (2026)