:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Xiao, Zihao, Jing, Longlong, Wu, Shangxuan, Zhu, Alex Zihao, Ji, Jingwei, Jiang, Chiyu Max, Hung, Wei-Chih, Funkhouser, Thomas, Kuo, Weicheng, Angelova, Anelia, Zhou, Yin, Sheng, Shiwei
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2401.02402
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Region-centric Image-Language Pretraining for Open-Vocabulary Detection
by: Kim, Dahun, et al.
Published: (2023)

Context-Adaptive Multi-Prompt Embedding with Large Language Models for Vision-Language Alignment
by: Kim, Dahun, et al.
Published: (2025)

PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding
by: Zhai, Hongjia, et al.
Published: (2025)

Cues3D: Unleashing the Power of Sole NeRF for Consistent and Unique Instances in Open-Vocabulary 3D Panoptic Segmentation
by: Xue, Feng, et al.
Published: (2025)

OnlinePG: Online Open-Vocabulary Panoptic Mapping with 3D Gaussian Splatting
by: Zhai, Hongjia, et al.
Published: (2026)

Open Vocabulary Panoptic Segmentation With Retrieval Augmentation
by: Sadeq, Nafis, et al.
Published: (2026)

COS3D: Collaborative Open-Vocabulary 3D Segmentation
by: Zhu, Runsong, et al.
Published: (2025)

Search3D: Hierarchical Open-Vocabulary 3D Segmentation
by: Takmaz, Ayca, et al.
Published: (2024)

Unlocking Multi-Spectral Data for Multi-Modal Models with Guided Inputs and Chain-of-Thought Reasoning
by: Kim, Dahun, et al.
Published: (2026)

EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation
by: Niu, Hongwei, et al.
Published: (2024)

FutureHuman3D: Forecasting Complex Long-Term 3D Human Behavior from Video Observations
by: Diller, Christian, et al.
Published: (2022)

3D Annotation-Free Learning by Distilling 2D Open-Vocabulary Segmentation Models for Autonomous Driving
by: Sun, Boyi, et al.
Published: (2024)

OpenDAS: Open-Vocabulary Domain Adaptation for 2D and 3D Segmentation
by: Yilmaz, Gonca, et al.
Published: (2024)

VideoComp: Advancing Fine-Grained Compositional and Temporal Alignment in Video-Text Models
by: Kim, Dahun, et al.
Published: (2025)

Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation
by: Lee, Junha, et al.
Published: (2025)

Multimodal Panoptic Segmentation of 3D Point Clouds
by: Dürr, Fabian
Published: (2023)

PartDistill: 3D Shape Part Segmentation by Vision-Language Model Distillation
by: Umam, Ardian, et al.
Published: (2023)

OpenTrack3D: Towards Accurate and Generalizable Open-Vocabulary 3D Instance Segmentation
by: Zhou, Zhishan, et al.
Published: (2025)

Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance Segmentation
by: Boudjoghra, Mohamed El Amine, et al.
Published: (2024)

OpenSplat3D: Open-Vocabulary 3D Instance Segmentation using Gaussian Splatting
by: Piekenbrinck, Jens, et al.
Published: (2025)

Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance
by: Nguyen, Phuc D. A., et al.
Published: (2023)

RAZER: Robust Accelerated Zero-Shot 3D Open-Vocabulary Panoptic Reconstruction with Spatio-Temporal Aggregation
by: Patel, Naman, et al.
Published: (2025)

OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies
by: Chen, Runnan, et al.
Published: (2024)

Open-Vocabulary Semantic Part Segmentation of 3D Human
by: Suzuki, Keito, et al.
Published: (2025)

FOLK: Fast Open-Vocabulary 3D Instance Segmentation via Label-guided Knowledge Distillation
by: Wu, Hongrui, et al.
Published: (2025)

GeoPurify: A Data-Efficient Geometric Distillation Framework for Open-Vocabulary 3D Segmentation
by: Dou, Weijia, et al.
Published: (2025)

PGOV3D: Open-Vocabulary 3D Semantic Segmentation with Partial-to-Global Curriculum
by: Zhang, Shiqi, et al.
Published: (2025)

Reasoning3D -- Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models
by: Chen, Tianrun, et al.
Published: (2024)

Global dynamics of Kato's solutions for the 3D incompressible micropolar system
by: Song, Zihao
Published: (2024)

Scalable 3D Panoptic Segmentation As Superpoint Graph Clustering
by: Robert, Damien, et al.
Published: (2024)

Time-Scaling State-Space Models for Dense Video Captioning
by: Piergiovanni, AJ, et al.
Published: (2025)

Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model
by: Chen, Yi-Chia, et al.
Published: (2024)

Mirasol3B: A Multimodal Autoregressive model for time-aligned and contextual modalities
by: Piergiovanni, AJ, et al.
Published: (2023)

XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation
by: Wang, Ziyi, et al.
Published: (2024)

Open-Vocabulary High-Resolution 3D (OVHR3D) Data Segmentation and Annotation Framework
by: Xu, Jiuyi, et al.
Published: (2024)

Rethinking Open-Vocabulary Segmentation of Radiance Fields in 3D Space
by: Lee, Hyunjee, et al.
Published: (2024)

Open-Vocabulary SAM3D: Towards Training-free Open-Vocabulary 3D Scene Understanding
by: Tai, Hanchen, et al.
Published: (2024)

Vocabulary-Free 3D Instance Segmentation with Vision and Language Assistant
by: Mei, Guofeng, et al.
Published: (2024)

Zero-Shot Dual-Path Integration Framework for Open-Vocabulary 3D Instance Segmentation
by: Ton, Tri, et al.
Published: (2024)

Mitigating Objectness Bias and Region-to-Text Misalignment for Open-Vocabulary Panoptic Segmentation
by: Kormushev, Nikolay, et al.
Published: (2026)