:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Huang, Yu, Peng, Zelin, Wen, Changsong, Yang, Xiaokang, Shen, Wei
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2510.08316
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

MedSeg-R: Reasoning Segmentation in Medical Images with Multimodal Large Language Models
by: Huang, Yu, et al.
Published: (2025)

Learning 2D Invariant Affordance Knowledge for 3D Affordance Grounding
by: Gao, Xianqiang, et al.
Published: (2024)

Tackling View-Dependent Semantics in 3D Language Gaussian Splatting
by: Cen, Jiazhong, et al.
Published: (2025)

HyperET: Efficient Training in Hyperbolic Space for Multi-modal Large Language Models
by: Peng, Zelin, et al.
Published: (2025)

Tendency-driven Mutual Exclusivity for Weakly Supervised Incremental Semantic Segmentation
by: Si, Chongjie, et al.
Published: (2024)

Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation
by: Peng, Zelin, et al.
Published: (2024)

NEARL-CLIP: Interacted Query Adaptation with Orthogonal Regularization for Medical Vision-Language Understanding
by: Peng, Zelin, et al.
Published: (2025)

AffordanceSAM: Segment Anything Once More in Affordance Grounding
by: Jiang, Dengyang, et al.
Published: (2025)

3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians
by: Wei, Zeming, et al.
Published: (2025)

Gradient-Driven 3D Segmentation and Affordance Transfer in Gaussian Splatting Using 2D Masks
by: Joseph, Joji, et al.
Published: (2024)

FOLK: Fast Open-Vocabulary 3D Instance Segmentation via Label-guided Knowledge Distillation
by: Wu, Hongrui, et al.
Published: (2025)

HumanCrafter: Synergizing Generalizable Human Reconstruction and Semantic 3D Segmentation
by: Pan, Panwang, et al.
Published: (2025)

TCATSeg: A Tooth Center-Wise Attention Network for 3D Dental Model Semantic Segmentation
by: He, Qiang, et al.
Published: (2026)

A Unified Framework with Multimodal Fine-tuning for Remote Sensing Semantic Segmentation
by: Ma, Xianping, et al.
Published: (2024)

Task-Aware 3D Affordance Segmentation via 2D Guidance and Geometric Refinement
by: He, Lian, et al.
Published: (2025)

Adaptive Margin Contrastive Learning for Ambiguity-aware 3D Semantic Segmentation
by: Chen, Yang, et al.
Published: (2025)

Grounding 3D Scene Affordance From Egocentric Interactions
by: Liu, Cuiyu, et al.
Published: (2024)

RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic Segmentation
by: Ma, Xianping, et al.
Published: (2024)

MMRad-22K: A Structured Multimodal Evidence Dataset for Chest X-ray Report Generation
by: Zhao, Yichen, et al.
Published: (2026)

MFSeg: Efficient Multi-frame 3D Semantic Segmentation
by: Huang, Chengjie, et al.
Published: (2025)

GEAL: Generalizable 3D Affordance Learning with Cross-Modal Consistency
by: Lu, Dongyue, et al.
Published: (2024)

PGOV3D: Open-Vocabulary 3D Semantic Segmentation with Partial-to-Global Curriculum
by: Zhang, Shiqi, et al.
Published: (2025)

EGFormer: Towards Efficient and Generalizable Multimodal Semantic Segmentation
by: Zhang, Zelin, et al.
Published: (2025)

Segment Any 3D Gaussians
by: Cen, Jiazhong, et al.
Published: (2023)

3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds
by: Chu, Hengshuo, et al.
Published: (2025)

CutS3D: Cutting Semantics in 3D for 2D Unsupervised Instance Segmentation
by: Sick, Leon, et al.
Published: (2024)

Parameter Efficient Fine-tuning via Cross Block Orchestration for Segment Anything Model
by: Peng, Zelin, et al.
Published: (2023)

Part-Aware Open-Vocabulary 3D Affordance Grounding via Prototypical Semantic and Geometric Alignment
by: Gou, Dongqiang, et al.
Published: (2026)

Affostruction: 3D Affordance Grounding with Generative Reconstruction
by: Park, Chunghyun, et al.
Published: (2026)

AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction
by: Zou, Pufan, et al.
Published: (2024)

Open-Vocabulary Remote Sensing Image Semantic Segmentation
by: Cao, Qinglong, et al.
Published: (2024)

VAGNet: Grounding 3D Affordance from Human-Object Interactions in Videos
by: Mao, Aihua, et al.
Published: (2026)

Domain Adaptation-Based Crossmodal Knowledge Distillation for 3D Semantic Segmentation
by: Kang, Jialiang, et al.
Published: (2025)

3x2: 3D Object Part Segmentation by 2D Semantic Correspondences
by: Thai, Anh, et al.
Published: (2024)

SPHERE: Semantic-PHysical Engaged REpresentation for 3D Semantic Scene Completion
by: Yang, Zhiwen, et al.
Published: (2025)

Grounding by Remembering: Cross-Scene and In-Scene Memory for 3D Functional Affordances
by: Wang, Qirui, et al.
Published: (2026)

The More You See in 2D, the More You Perceive in 3D
by: Han, Xinyang, et al.
Published: (2024)

Segment Anything in 3D with Radiance Fields
by: Cen, Jiazhong, et al.
Published: (2023)

Geospatial-Reasoning-Driven Vocabulary-Agnostic Remote Sensing Semantic Segmentation
by: Zhou, Chufeng, et al.
Published: (2026)

Affordance-Guided Diffusion Prior for 3D Hand Reconstruction
by: Suzuki, Naru, et al.
Published: (2025)