Saved in:
| Main Authors: | Xiao, Zihao, Jing, Longlong, Wu, Shangxuan, Zhu, Alex Zihao, Ji, Jingwei, Jiang, Chiyu Max, Hung, Wei-Chih, Funkhouser, Thomas, Kuo, Weicheng, Angelova, Anelia, Zhou, Yin, Sheng, Shiwei |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.02402 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Region-centric Image-Language Pretraining for Open-Vocabulary Detection
by: Kim, Dahun, et al.
Published: (2023)
by: Kim, Dahun, et al.
Published: (2023)
Context-Adaptive Multi-Prompt Embedding with Large Language Models for Vision-Language Alignment
by: Kim, Dahun, et al.
Published: (2025)
by: Kim, Dahun, et al.
Published: (2025)
PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding
by: Zhai, Hongjia, et al.
Published: (2025)
by: Zhai, Hongjia, et al.
Published: (2025)
Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation
by: Xue, Feng, et al.
Published: (2025)
by: Xue, Feng, et al.
Published: (2025)
OnlinePG: Online Open-Vocabulary Panoptic Mapping with 3D Gaussian Splatting
by: Zhai, Hongjia, et al.
Published: (2026)
by: Zhai, Hongjia, et al.
Published: (2026)
Open Vocabulary Panoptic Segmentation With Retrieval Augmentation
by: Sadeq, Nafis, et al.
Published: (2026)
by: Sadeq, Nafis, et al.
Published: (2026)
COS3D: Collaborative Open-Vocabulary 3D Segmentation
by: Zhu, Runsong, et al.
Published: (2025)
by: Zhu, Runsong, et al.
Published: (2025)
Search3D: Hierarchical Open-Vocabulary 3D Segmentation
by: Takmaz, Ayca, et al.
Published: (2024)
by: Takmaz, Ayca, et al.
Published: (2024)
Unlocking Multi-Spectral Data for Multi-Modal Models with Guided Inputs and Chain-of-Thought Reasoning
by: Kim, Dahun, et al.
Published: (2026)
by: Kim, Dahun, et al.
Published: (2026)
EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation
by: Niu, Hongwei, et al.
Published: (2024)
by: Niu, Hongwei, et al.
Published: (2024)
FutureHuman3D: Forecasting Complex Long-Term 3D Human Behavior from Video Observations
by: Diller, Christian, et al.
Published: (2022)
by: Diller, Christian, et al.
Published: (2022)
3D Annotation-Free Learning by Distilling 2D Open-Vocabulary Segmentation Models for Autonomous Driving
by: Sun, Boyi, et al.
Published: (2024)
by: Sun, Boyi, et al.
Published: (2024)
OpenDAS: Open-Vocabulary Domain Adaptation for 2D and 3D Segmentation
by: Yilmaz, Gonca, et al.
Published: (2024)
by: Yilmaz, Gonca, et al.
Published: (2024)
VideoComp: Advancing Fine-Grained Compositional and Temporal Alignment in Video-Text Models
by: Kim, Dahun, et al.
Published: (2025)
by: Kim, Dahun, et al.
Published: (2025)
Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation
by: Lee, Junha, et al.
Published: (2025)
by: Lee, Junha, et al.
Published: (2025)
Multimodal Panoptic Segmentation of 3D Point Clouds
by: Dürr, Fabian
Published: (2023)
by: Dürr, Fabian
Published: (2023)
PartDistill: 3D Shape Part Segmentation by Vision-Language Model Distillation
by: Umam, Ardian, et al.
Published: (2023)
by: Umam, Ardian, et al.
Published: (2023)
OpenTrack3D: Towards Accurate and Generalizable Open-Vocabulary 3D Instance Segmentation
by: Zhou, Zhishan, et al.
Published: (2025)
by: Zhou, Zhishan, et al.
Published: (2025)
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation
by: Boudjoghra, Mohamed El Amine, et al.
Published: (2024)
by: Boudjoghra, Mohamed El Amine, et al.
Published: (2024)
OpenSplat3D: Open-Vocabulary 3D Instance Segmentation using Gaussian Splatting
by: Piekenbrinck, Jens, et al.
Published: (2025)
by: Piekenbrinck, Jens, et al.
Published: (2025)
Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance
by: Nguyen, Phuc D. A., et al.
Published: (2023)
by: Nguyen, Phuc D. A., et al.
Published: (2023)
RAZER: Robust Accelerated Zero-Shot 3D Open-Vocabulary Panoptic Reconstruction with Spatio-Temporal Aggregation
by: Patel, Naman, et al.
Published: (2025)
by: Patel, Naman, et al.
Published: (2025)
OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies
by: Chen, Runnan, et al.
Published: (2024)
by: Chen, Runnan, et al.
Published: (2024)
Open-Vocabulary Semantic Part Segmentation of 3D Human
by: Suzuki, Keito, et al.
Published: (2025)
by: Suzuki, Keito, et al.
Published: (2025)
FOLK: Fast Open-Vocabulary 3D Instance Segmentation via Label-guided Knowledge Distillation
by: Wu, Hongrui, et al.
Published: (2025)
by: Wu, Hongrui, et al.
Published: (2025)
GeoPurify: A Data-Efficient Geometric Distillation Framework for Open-Vocabulary 3D Segmentation
by: Dou, Weijia, et al.
Published: (2025)
by: Dou, Weijia, et al.
Published: (2025)
PGOV3D: Open-Vocabulary 3D Semantic Segmentation with Partial-to-Global Curriculum
by: Zhang, Shiqi, et al.
Published: (2025)
by: Zhang, Shiqi, et al.
Published: (2025)
Reasoning3D -- Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models
by: Chen, Tianrun, et al.
Published: (2024)
by: Chen, Tianrun, et al.
Published: (2024)
Global dynamics of Kato's solutions for the 3D incompressible micropolar system
by: Song, Zihao
Published: (2024)
by: Song, Zihao
Published: (2024)
Scalable 3D Panoptic Segmentation As Superpoint Graph Clustering
by: Robert, Damien, et al.
Published: (2024)
by: Robert, Damien, et al.
Published: (2024)
Time-Scaling State-Space Models for Dense Video Captioning
by: Piergiovanni, AJ, et al.
Published: (2025)
by: Piergiovanni, AJ, et al.
Published: (2025)
Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model
by: Chen, Yi-Chia, et al.
Published: (2024)
by: Chen, Yi-Chia, et al.
Published: (2024)
Mirasol3B: A Multimodal Autoregressive model for time-aligned and contextual modalities
by: Piergiovanni, AJ, et al.
Published: (2023)
by: Piergiovanni, AJ, et al.
Published: (2023)
XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation
by: Wang, Ziyi, et al.
Published: (2024)
by: Wang, Ziyi, et al.
Published: (2024)
Open-Vocabulary High-Resolution 3D (OVHR3D) Data Segmentation and Annotation Framework
by: Xu, Jiuyi, et al.
Published: (2024)
by: Xu, Jiuyi, et al.
Published: (2024)
Rethinking Open-Vocabulary Segmentation of Radiance Fields in 3D Space
by: Lee, Hyunjee, et al.
Published: (2024)
by: Lee, Hyunjee, et al.
Published: (2024)
Open-Vocabulary SAM3D: Towards Training-free Open-Vocabulary 3D Scene Understanding
by: Tai, Hanchen, et al.
Published: (2024)
by: Tai, Hanchen, et al.
Published: (2024)
Vocabulary-Free 3D Instance Segmentation with Vision and Language Assistant
by: Mei, Guofeng, et al.
Published: (2024)
by: Mei, Guofeng, et al.
Published: (2024)
Zero-Shot Dual-Path Integration Framework for Open-Vocabulary 3D Instance Segmentation
by: Ton, Tri, et al.
Published: (2024)
by: Ton, Tri, et al.
Published: (2024)
Mitigating Objectness Bias and Region-to-Text Misalignment for Open-Vocabulary Panoptic Segmentation
by: Kormushev, Nikolay, et al.
Published: (2026)
by: Kormushev, Nikolay, et al.
Published: (2026)
Similar Items
-
Region-centric Image-Language Pretraining for Open-Vocabulary Detection
by: Kim, Dahun, et al.
Published: (2023) -
Context-Adaptive Multi-Prompt Embedding with Large Language Models for Vision-Language Alignment
by: Kim, Dahun, et al.
Published: (2025) -
PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding
by: Zhai, Hongjia, et al.
Published: (2025) -
Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation
by: Xue, Feng, et al.
Published: (2025) -
OnlinePG: Online Open-Vocabulary Panoptic Mapping with 3D Gaussian Splatting
by: Zhai, Hongjia, et al.
Published: (2026)