Saved in:
| Main Authors: | Yan, Chi, Xu, Dan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.04759 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
3D Occupancy Prediction with Low-Resolution Queries via Prototype-aware View Transformation
by: Oh, Gyeongrok, et al.
Published: (2025)
by: Oh, Gyeongrok, et al.
Published: (2025)
Sampling Bag of Views for Open-Vocabulary Object Detection
by: Choi, Hojun, et al.
Published: (2024)
by: Choi, Hojun, et al.
Published: (2024)
GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
by: Huang, Yuanhui, et al.
Published: (2024)
by: Huang, Yuanhui, et al.
Published: (2024)
Attention to Trajectory: Trajectory-Aware Open-Vocabulary Tracking
by: Li, Yunhao, et al.
Published: (2025)
by: Li, Yunhao, et al.
Published: (2025)
GraphGSOcc: Semantic-Geometric Graph Transformer with Dynamic-Static Decoupling for 3D Gaussian Splatting-based Occupancy Prediction
by: Song, Ke, et al.
Published: (2025)
by: Song, Ke, et al.
Published: (2025)
OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision
by: Wang, Junjie, et al.
Published: (2024)
by: Wang, Junjie, et al.
Published: (2024)
OG-Gaussian: Occupancy Based Street Gaussians for Autonomous Driving
by: Shen, Yedong, et al.
Published: (2025)
by: Shen, Yedong, et al.
Published: (2025)
GSRender: Deduplicated Occupancy Prediction via Weakly Supervised 3D Gaussian Splatting
by: Sun, Qianpu, et al.
Published: (2024)
by: Sun, Qianpu, et al.
Published: (2024)
SWA-SOP: Spatially-aware Window Attention for Semantic Occupancy Prediction in Autonomous Driving
by: Cao, Helin, et al.
Published: (2025)
by: Cao, Helin, et al.
Published: (2025)
GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction
by: Zuo, Sicheng, et al.
Published: (2024)
by: Zuo, Sicheng, et al.
Published: (2024)
SDGOCC: Semantic and Depth-Guided Bird's-Eye View Transformation for 3D Multimodal Occupancy Prediction
by: Duan, Zaipeng, et al.
Published: (2025)
by: Duan, Zaipeng, et al.
Published: (2025)
ExtrinSplat: Decoupling Geometry and Semantics for Open-Vocabulary Understanding in 3D Gaussian Splatting
by: Ding, Jiayu, et al.
Published: (2025)
by: Ding, Jiayu, et al.
Published: (2025)
GaussianFormer-2: Probabilistic Gaussian Superposition for Efficient 3D Occupancy Prediction
by: Huang, Yuanhui, et al.
Published: (2024)
by: Huang, Yuanhui, et al.
Published: (2024)
Towards Open Vocabulary Learning: A Survey
by: Wu, Jianzong, et al.
Published: (2023)
by: Wu, Jianzong, et al.
Published: (2023)
Mitigating Open-Vocabulary Caption Hallucinations
by: Ben-Kish, Assaf, et al.
Published: (2023)
by: Ben-Kish, Assaf, et al.
Published: (2023)
Ilov3Splat: Instance-Level Open-Vocabulary 3D Scene Understanding in Gaussian Splatting
by: Nguyen, Binh Long, et al.
Published: (2026)
by: Nguyen, Binh Long, et al.
Published: (2026)
Open-Vocabulary Functional 3D Human-Scene Interaction Generation
by: Liu, Jie, et al.
Published: (2026)
by: Liu, Jie, et al.
Published: (2026)
ManipDreamer3D : Synthesizing Plausible Robotic Manipulation Video with Occupancy-aware 3D Trajectory
by: Li, Ying, et al.
Published: (2025)
by: Li, Ying, et al.
Published: (2025)
CVT-Occ: Cost Volume Temporal Fusion for 3D Occupancy Prediction
by: Ye, Zhangchen, et al.
Published: (2024)
by: Ye, Zhangchen, et al.
Published: (2024)
OmniOVCD: Streamlining Open-Vocabulary Change Detection with SAM 3
by: Zhang, Xu, et al.
Published: (2026)
by: Zhang, Xu, et al.
Published: (2026)
Exploring Efficient Open-Vocabulary Segmentation in the Remote Sensing
by: Li, Bingyu, et al.
Published: (2025)
by: Li, Bingyu, et al.
Published: (2025)
SHOE: Semantic HOI Open-Vocabulary Evaluation Metric
by: Noack, Maja, et al.
Published: (2026)
by: Noack, Maja, et al.
Published: (2026)
Open-Vocabulary Segmentation with Unpaired Mask-Text Supervision
by: Wang, Zhaoqing, et al.
Published: (2024)
by: Wang, Zhaoqing, et al.
Published: (2024)
Open-Vocabulary Remote Sensing Image Semantic Segmentation
by: Cao, Qinglong, et al.
Published: (2024)
by: Cao, Qinglong, et al.
Published: (2024)
SD-OVON: A Semantics-aware Dataset and Benchmark Generation Pipeline for Open-Vocabulary Object Navigation in Dynamic Scenes
by: Qiu, Dicong, et al.
Published: (2025)
by: Qiu, Dicong, et al.
Published: (2025)
Latent Gaussian Splatting for 4D Panoptic Occupancy Tracking
by: Luz, Maximilian, et al.
Published: (2026)
by: Luz, Maximilian, et al.
Published: (2026)
Monocular Open Vocabulary Occupancy Prediction for Indoor Scenes
by: Zhou, Changqing, et al.
Published: (2026)
by: Zhou, Changqing, et al.
Published: (2026)
Generalized Decoupled Learning for Enhancing Open-Vocabulary Dense Perception
by: Wang, Junjie, et al.
Published: (2025)
by: Wang, Junjie, et al.
Published: (2025)
VOVTrack: Exploring the Potentiality in Videos for Open-Vocabulary Object Tracking
by: Qian, Zekun, et al.
Published: (2024)
by: Qian, Zekun, et al.
Published: (2024)
SANEval: Open-Vocabulary Compositional Benchmarks with Failure-mode Diagnosis
by: Pramanik, Rishav, et al.
Published: (2026)
by: Pramanik, Rishav, et al.
Published: (2026)
SOccDPT: Semi-Supervised 3D Semantic Occupancy from Dense Prediction Transformers trained under memory constraints
by: Ganesh, Aditya Nalgunda
Published: (2023)
by: Ganesh, Aditya Nalgunda
Published: (2023)
HomeRobot: Open-Vocabulary Mobile Manipulation
by: Yenamandra, Sriram, et al.
Published: (2023)
by: Yenamandra, Sriram, et al.
Published: (2023)
Semantic Causality-Aware Vision-Based 3D Occupancy Prediction
by: Chen, Dubing, et al.
Published: (2025)
by: Chen, Dubing, et al.
Published: (2025)
Social LSTM with Dynamic Occupancy Modeling for Realistic Pedestrian Trajectory Prediction
by: Alia, Ahmed, et al.
Published: (2025)
by: Alia, Ahmed, et al.
Published: (2025)
Seeking Consensus: Geometric-Semantic On-the-Fly Recalibration for Open-Vocabulary Remote Sensing Semantic Segmentation
by: Wang, Guanchun, et al.
Published: (2026)
by: Wang, Guanchun, et al.
Published: (2026)
From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects
by: Li, Zizhao, et al.
Published: (2024)
by: Li, Zizhao, et al.
Published: (2024)
Interpretable Open-Vocabulary Referring Object Detection with Reverse Contrast Attention
by: Juanico, Drandreb Earl O., et al.
Published: (2025)
by: Juanico, Drandreb Earl O., et al.
Published: (2025)
dinov3.seg: Open-Vocabulary Semantic Segmentation with DINOv3
by: Dutta, Saikat, et al.
Published: (2026)
by: Dutta, Saikat, et al.
Published: (2026)
Decomposed Vision-Language Alignment for Fine-Grained Open-Vocabulary Segmentation
by: Wang, Chenhao, et al.
Published: (2026)
by: Wang, Chenhao, et al.
Published: (2026)
Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization
by: Hyun, Jeongseok, et al.
Published: (2024)
by: Hyun, Jeongseok, et al.
Published: (2024)
Similar Items
-
3D Occupancy Prediction with Low-Resolution Queries via Prototype-aware View Transformation
by: Oh, Gyeongrok, et al.
Published: (2025) -
Sampling Bag of Views for Open-Vocabulary Object Detection
by: Choi, Hojun, et al.
Published: (2024) -
GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
by: Huang, Yuanhui, et al.
Published: (2024) -
Attention to Trajectory: Trajectory-Aware Open-Vocabulary Tracking
by: Li, Yunhao, et al.
Published: (2025) -
GraphGSOcc: Semantic-Geometric Graph Transformer with Dynamic-Static Decoupling for 3D Gaussian Splatting-based Occupancy Prediction
by: Song, Ke, et al.
Published: (2025)