Saved in:
| Main Authors: | Huang, Yu, Peng, Zelin, Wen, Changsong, Yang, Xiaokang, Shen, Wei |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.08316 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MedSeg-R: Reasoning Segmentation in Medical Images with Multimodal Large Language Models
by: Huang, Yu, et al.
Published: (2025)
by: Huang, Yu, et al.
Published: (2025)
Learning 2D Invariant Affordance Knowledge for 3D Affordance Grounding
by: Gao, Xianqiang, et al.
Published: (2024)
by: Gao, Xianqiang, et al.
Published: (2024)
Tackling View-Dependent Semantics in 3D Language Gaussian Splatting
by: Cen, Jiazhong, et al.
Published: (2025)
by: Cen, Jiazhong, et al.
Published: (2025)
HyperET: Efficient Training in Hyperbolic Space for Multi-modal Large Language Models
by: Peng, Zelin, et al.
Published: (2025)
by: Peng, Zelin, et al.
Published: (2025)
Tendency-driven Mutual Exclusivity for Weakly Supervised Incremental Semantic Segmentation
by: Si, Chongjie, et al.
Published: (2024)
by: Si, Chongjie, et al.
Published: (2024)
Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation
by: Peng, Zelin, et al.
Published: (2024)
by: Peng, Zelin, et al.
Published: (2024)
NEARL-CLIP: Interacted Query Adaptation with Orthogonal Regularization for Medical Vision-Language Understanding
by: Peng, Zelin, et al.
Published: (2025)
by: Peng, Zelin, et al.
Published: (2025)
AffordanceSAM: Segment Anything Once More in Affordance Grounding
by: Jiang, Dengyang, et al.
Published: (2025)
by: Jiang, Dengyang, et al.
Published: (2025)
3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians
by: Wei, Zeming, et al.
Published: (2025)
by: Wei, Zeming, et al.
Published: (2025)
Gradient-Driven 3D Segmentation and Affordance Transfer in Gaussian Splatting Using 2D Masks
by: Joseph, Joji, et al.
Published: (2024)
by: Joseph, Joji, et al.
Published: (2024)
FOLK: Fast Open-Vocabulary 3D Instance Segmentation via Label-guided Knowledge Distillation
by: Wu, Hongrui, et al.
Published: (2025)
by: Wu, Hongrui, et al.
Published: (2025)
HumanCrafter: Synergizing Generalizable Human Reconstruction and Semantic 3D Segmentation
by: Pan, Panwang, et al.
Published: (2025)
by: Pan, Panwang, et al.
Published: (2025)
TCATSeg: A Tooth Center-Wise Attention Network for 3D Dental Model Semantic Segmentation
by: He, Qiang, et al.
Published: (2026)
by: He, Qiang, et al.
Published: (2026)
A Unified Framework with Multimodal Fine-tuning for Remote Sensing Semantic Segmentation
by: Ma, Xianping, et al.
Published: (2024)
by: Ma, Xianping, et al.
Published: (2024)
Task-Aware 3D Affordance Segmentation via 2D Guidance and Geometric Refinement
by: He, Lian, et al.
Published: (2025)
by: He, Lian, et al.
Published: (2025)
Adaptive Margin Contrastive Learning for Ambiguity-aware 3D Semantic Segmentation
by: Chen, Yang, et al.
Published: (2025)
by: Chen, Yang, et al.
Published: (2025)
Grounding 3D Scene Affordance From Egocentric Interactions
by: Liu, Cuiyu, et al.
Published: (2024)
by: Liu, Cuiyu, et al.
Published: (2024)
RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic Segmentation
by: Ma, Xianping, et al.
Published: (2024)
by: Ma, Xianping, et al.
Published: (2024)
MMRad-22K: A Structured Multimodal Evidence Dataset for Chest X-ray Report Generation
by: Zhao, Yichen, et al.
Published: (2026)
by: Zhao, Yichen, et al.
Published: (2026)
MFSeg: Efficient Multi-frame 3D Semantic Segmentation
by: Huang, Chengjie, et al.
Published: (2025)
by: Huang, Chengjie, et al.
Published: (2025)
GEAL: Generalizable 3D Affordance Learning with Cross-Modal Consistency
by: Lu, Dongyue, et al.
Published: (2024)
by: Lu, Dongyue, et al.
Published: (2024)
PGOV3D: Open-Vocabulary 3D Semantic Segmentation with Partial-to-Global Curriculum
by: Zhang, Shiqi, et al.
Published: (2025)
by: Zhang, Shiqi, et al.
Published: (2025)
EGFormer: Towards Efficient and Generalizable Multimodal Semantic Segmentation
by: Zhang, Zelin, et al.
Published: (2025)
by: Zhang, Zelin, et al.
Published: (2025)
Segment Any 3D Gaussians
by: Cen, Jiazhong, et al.
Published: (2023)
by: Cen, Jiazhong, et al.
Published: (2023)
3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds
by: Chu, Hengshuo, et al.
Published: (2025)
by: Chu, Hengshuo, et al.
Published: (2025)
CutS3D: Cutting Semantics in 3D for 2D Unsupervised Instance Segmentation
by: Sick, Leon, et al.
Published: (2024)
by: Sick, Leon, et al.
Published: (2024)
Parameter Efficient Fine-tuning via Cross Block Orchestration for Segment Anything Model
by: Peng, Zelin, et al.
Published: (2023)
by: Peng, Zelin, et al.
Published: (2023)
Part-Aware Open-Vocabulary 3D Affordance Grounding via Prototypical Semantic and Geometric Alignment
by: Gou, Dongqiang, et al.
Published: (2026)
by: Gou, Dongqiang, et al.
Published: (2026)
Affostruction: 3D Affordance Grounding with Generative Reconstruction
by: Park, Chunghyun, et al.
Published: (2026)
by: Park, Chunghyun, et al.
Published: (2026)
AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction
by: Zou, Pufan, et al.
Published: (2024)
by: Zou, Pufan, et al.
Published: (2024)
Open-Vocabulary Remote Sensing Image Semantic Segmentation
by: Cao, Qinglong, et al.
Published: (2024)
by: Cao, Qinglong, et al.
Published: (2024)
VAGNet: Grounding 3D Affordance from Human-Object Interactions in Videos
by: Mao, Aihua, et al.
Published: (2026)
by: Mao, Aihua, et al.
Published: (2026)
Domain Adaptation-Based Crossmodal Knowledge Distillation for 3D Semantic Segmentation
by: Kang, Jialiang, et al.
Published: (2025)
by: Kang, Jialiang, et al.
Published: (2025)
3x2: 3D Object Part Segmentation by 2D Semantic Correspondences
by: Thai, Anh, et al.
Published: (2024)
by: Thai, Anh, et al.
Published: (2024)
SPHERE: Semantic-PHysical Engaged REpresentation for 3D Semantic Scene Completion
by: Yang, Zhiwen, et al.
Published: (2025)
by: Yang, Zhiwen, et al.
Published: (2025)
Grounding by Remembering: Cross-Scene and In-Scene Memory for 3D Functional Affordances
by: Wang, Qirui, et al.
Published: (2026)
by: Wang, Qirui, et al.
Published: (2026)
The More You See in 2D, the More You Perceive in 3D
by: Han, Xinyang, et al.
Published: (2024)
by: Han, Xinyang, et al.
Published: (2024)
Segment Anything in 3D with Radiance Fields
by: Cen, Jiazhong, et al.
Published: (2023)
by: Cen, Jiazhong, et al.
Published: (2023)
Geospatial-Reasoning-Driven Vocabulary-Agnostic Remote Sensing Semantic Segmentation
by: Zhou, Chufeng, et al.
Published: (2026)
by: Zhou, Chufeng, et al.
Published: (2026)
Affordance-Guided Diffusion Prior for 3D Hand Reconstruction
by: Suzuki, Naru, et al.
Published: (2025)
by: Suzuki, Naru, et al.
Published: (2025)
Similar Items
-
MedSeg-R: Reasoning Segmentation in Medical Images with Multimodal Large Language Models
by: Huang, Yu, et al.
Published: (2025) -
Learning 2D Invariant Affordance Knowledge for 3D Affordance Grounding
by: Gao, Xianqiang, et al.
Published: (2024) -
Tackling View-Dependent Semantics in 3D Language Gaussian Splatting
by: Cen, Jiazhong, et al.
Published: (2025) -
HyperET: Efficient Training in Hyperbolic Space for Multi-modal Large Language Models
by: Peng, Zelin, et al.
Published: (2025) -
Tendency-driven Mutual Exclusivity for Weakly Supervised Incremental Semantic Segmentation
by: Si, Chongjie, et al.
Published: (2024)