:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wan, Zishuo, Gao, Yu, Pang, Wanyuan, Ding, Dawei
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2501.03482
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

TARDis: Time Attenuated Representation Disentanglement for Incomplete Multi-Modal Tumor Segmentation and Classification
by: Wan, Zishuo, et al.
Published: (2025)

VOILA: Evaluation of MLLMs For Perceptual Understanding and Analogical Reasoning
by: Yilmaz, Nilay, et al.
Published: (2025)

VOILA: Value-of-Information Guided Fidelity Selection for Cost-Aware Multimodal Question Answering
by: Bhope, Rahul Atul, et al.
Published: (2026)

Sparsity-Aware Voxel Attention and Foreground Modulation for 3D Semantic Scene Completion
by: Xue, Yu, et al.
Published: (2026)

Interactive Test-Time Adaptation with Reliable Spatial-Temporal Voxels for Multi-Modal Segmentation
by: Cao, Haozhi, et al.
Published: (2024)

DynamicTree: Interactive Real Tree Animation via Sparse Voxel Spectrum
by: Li, Yaokun, et al.
Published: (2025)

Context and Geometry Aware Voxel Transformer for Semantic Scene Completion
by: Yu, Zhu, et al.
Published: (2024)

DivAS: Interactive 3D Segmentation of NeRFs via Depth-Weighted Voxel Aggregation
by: Pande, Ayush
Published: (2026)

Adapting Vision-Language Model with Fine-grained Semantics for Open-Vocabulary Segmentation
by: Chng, Yong Xien, et al.
Published: (2024)

Towards Universal Text-driven CT Image Segmentation
by: Li, Yuheng, et al.
Published: (2025)

TiFRe: Text-guided Video Frame Reduction for Efficient Video Multi-modal Large Language Models
by: Zheng, Xiangtian, et al.
Published: (2026)

OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding
by: Huang, Sheng-Yu, et al.
Published: (2026)

Taming Mambas for Voxel Level 3D Medical Image Segmentation
by: Lumetti, Luca, et al.
Published: (2024)

Not All Voxels Are Equal: Hardness-Aware Semantic Scene Completion with Self-Distillation
by: Wang, Song, et al.
Published: (2024)

LiteVoxel: Low-memory Intelligent Thresholding for Efficient Voxel Rasterization
by: Lee, Jee Won, et al.
Published: (2025)

UniVoxel: Fast Inverse Rendering by Unified Voxelization of Scene Representation
by: Wu, Shuang, et al.
Published: (2024)

Learning Trajectory-Aware Multimodal Large Language Models for Video Reasoning Segmentation
by: Luo, Jingnan, et al.
Published: (2026)

Towards Interactive Lesion Segmentation in Whole-Body PET/CT with Promptable Models
by: Rokuss, Maximilian, et al.
Published: (2025)

Interactive Segmentation and Report Generation for CT Images
by: Gu, Yannian, et al.
Published: (2025)

Global Position Aware Group Choreography using Large Language Model
by: Pang, Haozhou, et al.
Published: (2025)

Geometrical Cross-Attention and Nonvoid Voxelization for Efficient 3D Medical Image Segmentation
by: Yuan, Chenxin, et al.
Published: (2026)

GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation
by: Lin, Lang, et al.
Published: (2025)

Devil is in Details: Locality-Aware 3D Abdominal CT Volume Generation for Self-Supervised Organ Segmentation
by: Wang, Yuran, et al.
Published: (2024)

Voxel Densification for Serialized 3D Object Detection: Mitigating Sparsity via Pre-serialization Expansion
by: Liu, Qifeng, et al.
Published: (2025)

Open-set Anomaly Segmentation in Complex Scenarios
by: Xia, Song, et al.
Published: (2025)

4D Neural Voxel Splatting: Dynamic Scene Rendering with Voxelized Guassian Splatting
by: Wu, Chun-Tin, et al.
Published: (2025)

VoxelTrack: Exploring Voxel Representation for 3D Point Cloud Object Tracking
by: Lu, Yuxuan, et al.
Published: (2024)

SVRecon: Sparse Voxel Rasterization for Surface Reconstruction
by: Oh, Seunghun, et al.
Published: (2025)

PVNet: Point-Voxel Interaction LiDAR Scene Upsampling Via Diffusion Models
by: Cheng, Xianjing, et al.
Published: (2025)

ASSR-NeRF: Arbitrary-Scale Super-Resolution on Voxel Grid for High-Quality Radiance Fields Reconstruction
by: Huang, Ding-Jiun, et al.
Published: (2024)

Context-Aware Interaction Network for RGB-T Semantic Segmentation
by: Lv, Ying, et al.
Published: (2024)

Advancing Structured Priors for Sparse-Voxel Surface Reconstruction
by: Chi, Ting-Hsun, et al.
Published: (2026)

PointVoxelFormer -- Reviving point cloud networks for 3D medical imaging
by: Heinrich, Mattias Paul
Published: (2024)

Language and Geometry Grounded Sparse Voxel Representations for Holistic Scene Understanding
by: Wu, Guile, et al.
Published: (2026)

Dynamic Scene Understanding through Object-Centric Voxelization and Neural Rendering
by: Zhao, Yanpeng, et al.
Published: (2024)

Interactive Segmentation Model for Placenta Segmentation from 3D Ultrasound images
by: Li, Hao, et al.
Published: (2024)

VoxelOpt: Voxel-Adaptive Message Passing for Discrete Optimization in Deformable Abdominal CT Registration
by: Zhang, Hang, et al.
Published: (2025)

Irregularity Inspection using Neural Radiance Field
by: Ding, Tianqi, et al.
Published: (2024)

PIVOT-Net: Heterogeneous Point-Voxel-Tree-based Framework for Point Cloud Compression
by: Pang, Jiahao, et al.
Published: (2024)

CAR-SAM: Cross-Attention Reconstruction for Post-Training Quantization of the Segment Anything Model
by: Wen, Houji, et al.
Published: (2026)