:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Mirzaei, Mohamad Amin, Amoie, Pantea, Ekhterachian, Ali, Mirzababaei, Matin, Khalaj, Babak
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2509.24528
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation
by: Liu, Tao, et al.
Published: (2024)

Open-vocabulary Mobile Manipulation in Unseen Dynamic Environments with 3D Semantic Maps
by: Qiu, Dicong, et al.
Published: (2024)

Track, Inpaint, Resplat: Subject-driven 3D and 4D Generation with Progressive Texture Infilling
by: Zheng, Shuhong, et al.
Published: (2025)

OmniView: An All-Seeing Diffusion Model for 3D and 4D View Synthesis
by: Fan, Xiang, et al.
Published: (2025)

FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding
by: Zuo, Xingxing, et al.
Published: (2024)

Bridge 2D-3D: Uncertainty-aware Hierarchical Registration Network with Domain Alignment
by: Cheng, Zhixin, et al.
Published: (2025)

Cross-Axis Transformer with 3D Rotary Positional Embeddings
by: Erickson, Lily
Published: (2023)

XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation
by: Wang, Ziyi, et al.
Published: (2024)

MALLVI: A Multi-Agent Framework for Integrated Generalized Robotics Manipulation
by: Taji, Mehrshad, et al.
Published: (2026)

CA-W3D: Leveraging Context-Aware Knowledge for Weakly Supervised Monocular 3D Detection
by: Liu, Chupeng, et al.
Published: (2025)

ManipDreamer3D : Synthesizing Plausible Robotic Manipulation Video with Occupancy-aware 3D Trajectory
by: Li, Ying, et al.
Published: (2025)

ContextGS: Compact 3D Gaussian Splatting with Anchor Level Context Model
by: Wang, Yufei, et al.
Published: (2024)

Hyperbolic Contrastive Learning for Hierarchical 3D Point Cloud Embedding
by: Liu, Yingjie, et al.
Published: (2025)

Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection
by: Um, Sung Jin, et al.
Published: (2025)

OE3DIS: Open-Ended 3D Point Cloud Instance Segmentation
by: Nguyen, Phuc D. A., et al.
Published: (2024)

OpenSUN3D: 1st Workshop Challenge on Open-Vocabulary 3D Scene Understanding
by: Engelmann, Francis, et al.
Published: (2024)

OpenGround: Active Cognition-based Reasoning for Open-World 3D Visual Grounding
by: Huang, Wenyuan, et al.
Published: (2025)

GeoSAM-3D: Geodesic Prompt Propagation for Open-Vocabulary 3D Scene Segmentation from Monocular Video
by: Sharma, Arun
Published: (2026)

Semantic Compression of 3D Objects for Open and Collaborative Virtual Worlds
by: Dotzel, Jordan, et al.
Published: (2025)

Open-Vocabulary Functional 3D Human-Scene Interaction Generation
by: Liu, Jie, et al.
Published: (2026)

Thickness-aware E(3)-Equivariant 3D Mesh Neural Networks
by: Kim, Sungwon, et al.
Published: (2025)

3D Occupancy Prediction with Low-Resolution Queries via Prototype-aware View Transformation
by: Oh, Gyeongrok, et al.
Published: (2025)

MimicParts: Part-aware Style Injection for Speech-Driven 3D Motion Generation
by: Liu, Lianlian, et al.
Published: (2025)

Depth-aware Fusion Method based on Image and 4D Radar Spectrum for 3D Object Detection
by: Sun, Yue, et al.
Published: (2025)

Weakly Supervised 3D Open-vocabulary Segmentation
by: Liu, Kunhao, et al.
Published: (2023)

Can Shape-Infused Joint Embeddings Improve Image-Conditioned 3D Diffusion?
by: Sbrolli, Cristian, et al.
Published: (2024)

SpatialForge: Bootstrapping 3D-Aware Spatial Reasoning from Open-World 2D Images
by: Liu, Zishan, et al.
Published: (2026)

Ilov3Splat: Instance-Level Open-Vocabulary 3D Scene Understanding in Gaussian Splatting
by: Nguyen, Binh Long, et al.
Published: (2026)

Self Pre-training with Topology- and Spatiality-aware Masked Autoencoders for 3D Medical Image Segmentation
by: Gu, Pengfei, et al.
Published: (2024)

OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation
by: Huang, Zhening, et al.
Published: (2023)

Large-Scale 3D Medical Image Pre-training with Geometric Context Priors
by: Wu, Linshan, et al.
Published: (2024)

3D Cloud reconstruction through geospatially-aware Masked Autoencoders
by: Girtsou, Stella, et al.
Published: (2025)

Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control
by: Gu, Zekai, et al.
Published: (2025)

Beyond Bare Queries: Open-Vocabulary Object Grounding with 3D Scene Graph
by: Linok, Sergey, et al.
Published: (2024)

GeoGuide: Hierarchical Geometric Guidance for Open-Vocabulary 3D Semantic Segmentation
by: Tao, Xujing, et al.
Published: (2026)

Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments
by: Etchegaray, Djamahl, et al.
Published: (2024)

Towards Camera Open-set 3D Object Detection for Autonomous Driving Scenarios
by: He, Zhuolin, et al.
Published: (2024)

Multi-agent Long-term 3D Human Pose Forecasting via Interaction-aware Trajectory Conditioning
by: Jeong, Jaewoo, et al.
Published: (2024)

MovieCORE: COgnitive REasoning in Movies
by: Faure, Gueter Josmy, et al.
Published: (2025)

MetaFind: Scene-Aware 3D Asset Retrieval for Coherent Metaverse Scene Generation
by: Pan, Zhenyu, et al.
Published: (2025)