Saved in:
| Main Authors: | Mirzaei, Mohamad Amin, Amoie, Pantea, Ekhterachian, Ali, Mirzababaei, Matin, Khalaj, Babak |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.24528 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation
by: Liu, Tao, et al.
Published: (2024)
by: Liu, Tao, et al.
Published: (2024)
Open-vocabulary Mobile Manipulation in Unseen Dynamic Environments with 3D Semantic Maps
by: Qiu, Dicong, et al.
Published: (2024)
by: Qiu, Dicong, et al.
Published: (2024)
Track, Inpaint, Resplat: Subject-driven 3D and 4D Generation with Progressive Texture Infilling
by: Zheng, Shuhong, et al.
Published: (2025)
by: Zheng, Shuhong, et al.
Published: (2025)
OmniView: An All-Seeing Diffusion Model for 3D and 4D View Synthesis
by: Fan, Xiang, et al.
Published: (2025)
by: Fan, Xiang, et al.
Published: (2025)
FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding
by: Zuo, Xingxing, et al.
Published: (2024)
by: Zuo, Xingxing, et al.
Published: (2024)
Bridge 2D-3D: Uncertainty-aware Hierarchical Registration Network with Domain Alignment
by: Cheng, Zhixin, et al.
Published: (2025)
by: Cheng, Zhixin, et al.
Published: (2025)
Cross-Axis Transformer with 3D Rotary Positional Embeddings
by: Erickson, Lily
Published: (2023)
by: Erickson, Lily
Published: (2023)
XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation
by: Wang, Ziyi, et al.
Published: (2024)
by: Wang, Ziyi, et al.
Published: (2024)
MALLVI: A Multi-Agent Framework for Integrated Generalized Robotics Manipulation
by: Taji, Mehrshad, et al.
Published: (2026)
by: Taji, Mehrshad, et al.
Published: (2026)
CA-W3D: Leveraging Context-Aware Knowledge for Weakly Supervised Monocular 3D Detection
by: Liu, Chupeng, et al.
Published: (2025)
by: Liu, Chupeng, et al.
Published: (2025)
ManipDreamer3D : Synthesizing Plausible Robotic Manipulation Video with Occupancy-aware 3D Trajectory
by: Li, Ying, et al.
Published: (2025)
by: Li, Ying, et al.
Published: (2025)
ContextGS: Compact 3D Gaussian Splatting with Anchor Level Context Model
by: Wang, Yufei, et al.
Published: (2024)
by: Wang, Yufei, et al.
Published: (2024)
Hyperbolic Contrastive Learning for Hierarchical 3D Point Cloud Embedding
by: Liu, Yingjie, et al.
Published: (2025)
by: Liu, Yingjie, et al.
Published: (2025)
Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection
by: Um, Sung Jin, et al.
Published: (2025)
by: Um, Sung Jin, et al.
Published: (2025)
OE3DIS: Open-Ended 3D Point Cloud Instance Segmentation
by: Nguyen, Phuc D. A., et al.
Published: (2024)
by: Nguyen, Phuc D. A., et al.
Published: (2024)
OpenSUN3D: 1st Workshop Challenge on Open-Vocabulary 3D Scene Understanding
by: Engelmann, Francis, et al.
Published: (2024)
by: Engelmann, Francis, et al.
Published: (2024)
OpenGround: Active Cognition-based Reasoning for Open-World 3D Visual Grounding
by: Huang, Wenyuan, et al.
Published: (2025)
by: Huang, Wenyuan, et al.
Published: (2025)
GeoSAM-3D: Geodesic Prompt Propagation for Open-Vocabulary 3D Scene Segmentation from Monocular Video
by: Sharma, Arun
Published: (2026)
by: Sharma, Arun
Published: (2026)
Semantic Compression of 3D Objects for Open and Collaborative Virtual Worlds
by: Dotzel, Jordan, et al.
Published: (2025)
by: Dotzel, Jordan, et al.
Published: (2025)
Open-Vocabulary Functional 3D Human-Scene Interaction Generation
by: Liu, Jie, et al.
Published: (2026)
by: Liu, Jie, et al.
Published: (2026)
Thickness-aware E(3)-Equivariant 3D Mesh Neural Networks
by: Kim, Sungwon, et al.
Published: (2025)
by: Kim, Sungwon, et al.
Published: (2025)
3D Occupancy Prediction with Low-Resolution Queries via Prototype-aware View Transformation
by: Oh, Gyeongrok, et al.
Published: (2025)
by: Oh, Gyeongrok, et al.
Published: (2025)
MimicParts: Part-aware Style Injection for Speech-Driven 3D Motion Generation
by: Liu, Lianlian, et al.
Published: (2025)
by: Liu, Lianlian, et al.
Published: (2025)
Depth-aware Fusion Method based on Image and 4D Radar Spectrum for 3D Object Detection
by: Sun, Yue, et al.
Published: (2025)
by: Sun, Yue, et al.
Published: (2025)
Weakly Supervised 3D Open-vocabulary Segmentation
by: Liu, Kunhao, et al.
Published: (2023)
by: Liu, Kunhao, et al.
Published: (2023)
Can Shape-Infused Joint Embeddings Improve Image-Conditioned 3D Diffusion?
by: Sbrolli, Cristian, et al.
Published: (2024)
by: Sbrolli, Cristian, et al.
Published: (2024)
SpatialForge: Bootstrapping 3D-Aware Spatial Reasoning from Open-World 2D Images
by: Liu, Zishan, et al.
Published: (2026)
by: Liu, Zishan, et al.
Published: (2026)
Ilov3Splat: Instance-Level Open-Vocabulary 3D Scene Understanding in Gaussian Splatting
by: Nguyen, Binh Long, et al.
Published: (2026)
by: Nguyen, Binh Long, et al.
Published: (2026)
Self Pre-training with Topology- and Spatiality-aware Masked Autoencoders for 3D Medical Image Segmentation
by: Gu, Pengfei, et al.
Published: (2024)
by: Gu, Pengfei, et al.
Published: (2024)
OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation
by: Huang, Zhening, et al.
Published: (2023)
by: Huang, Zhening, et al.
Published: (2023)
Large-Scale 3D Medical Image Pre-training with Geometric Context Priors
by: Wu, Linshan, et al.
Published: (2024)
by: Wu, Linshan, et al.
Published: (2024)
3D Cloud reconstruction through geospatially-aware Masked Autoencoders
by: Girtsou, Stella, et al.
Published: (2025)
by: Girtsou, Stella, et al.
Published: (2025)
Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control
by: Gu, Zekai, et al.
Published: (2025)
by: Gu, Zekai, et al.
Published: (2025)
Beyond Bare Queries: Open-Vocabulary Object Grounding with 3D Scene Graph
by: Linok, Sergey, et al.
Published: (2024)
by: Linok, Sergey, et al.
Published: (2024)
GeoGuide: Hierarchical Geometric Guidance for Open-Vocabulary 3D Semantic Segmentation
by: Tao, Xujing, et al.
Published: (2026)
by: Tao, Xujing, et al.
Published: (2026)
Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments
by: Etchegaray, Djamahl, et al.
Published: (2024)
by: Etchegaray, Djamahl, et al.
Published: (2024)
Towards Camera Open-set 3D Object Detection for Autonomous Driving Scenarios
by: He, Zhuolin, et al.
Published: (2024)
by: He, Zhuolin, et al.
Published: (2024)
Multi-agent Long-term 3D Human Pose Forecasting via Interaction-aware Trajectory Conditioning
by: Jeong, Jaewoo, et al.
Published: (2024)
by: Jeong, Jaewoo, et al.
Published: (2024)
MovieCORE: COgnitive REasoning in Movies
by: Faure, Gueter Josmy, et al.
Published: (2025)
by: Faure, Gueter Josmy, et al.
Published: (2025)
MetaFind: Scene-Aware 3D Asset Retrieval for Coherent Metaverse Scene Generation
by: Pan, Zhenyu, et al.
Published: (2025)
by: Pan, Zhenyu, et al.
Published: (2025)
Similar Items
-
Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation
by: Liu, Tao, et al.
Published: (2024) -
Open-vocabulary Mobile Manipulation in Unseen Dynamic Environments with 3D Semantic Maps
by: Qiu, Dicong, et al.
Published: (2024) -
Track, Inpaint, Resplat: Subject-driven 3D and 4D Generation with Progressive Texture Infilling
by: Zheng, Shuhong, et al.
Published: (2025) -
OmniView: An All-Seeing Diffusion Model for 3D and 4D View Synthesis
by: Fan, Xiang, et al.
Published: (2025) -
FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding
by: Zuo, Xingxing, et al.
Published: (2024)