Saved in:
| Main Authors: | Wan, Zishuo, Gao, Yu, Pang, Wanyuan, Ding, Dawei |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.03482 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
TARDis: Time Attenuated Representation Disentanglement for Incomplete Multi-Modal Tumor Segmentation and Classification
by: Wan, Zishuo, et al.
Published: (2025)
by: Wan, Zishuo, et al.
Published: (2025)
VOILA: Evaluation of MLLMs For Perceptual Understanding and Analogical Reasoning
by: Yilmaz, Nilay, et al.
Published: (2025)
by: Yilmaz, Nilay, et al.
Published: (2025)
VOILA: Value-of-Information Guided Fidelity Selection for Cost-Aware Multimodal Question Answering
by: Bhope, Rahul Atul, et al.
Published: (2026)
by: Bhope, Rahul Atul, et al.
Published: (2026)
Sparsity-Aware Voxel Attention and Foreground Modulation for 3D Semantic Scene Completion
by: Xue, Yu, et al.
Published: (2026)
by: Xue, Yu, et al.
Published: (2026)
Interactive Test-Time Adaptation with Reliable Spatial-Temporal Voxels for Multi-Modal Segmentation
by: Cao, Haozhi, et al.
Published: (2024)
by: Cao, Haozhi, et al.
Published: (2024)
DynamicTree: Interactive Real Tree Animation via Sparse Voxel Spectrum
by: Li, Yaokun, et al.
Published: (2025)
by: Li, Yaokun, et al.
Published: (2025)
Context and Geometry Aware Voxel Transformer for Semantic Scene Completion
by: Yu, Zhu, et al.
Published: (2024)
by: Yu, Zhu, et al.
Published: (2024)
DivAS: Interactive 3D Segmentation of NeRFs via Depth-Weighted Voxel Aggregation
by: Pande, Ayush
Published: (2026)
by: Pande, Ayush
Published: (2026)
Adapting Vision-Language Model with Fine-grained Semantics for Open-Vocabulary Segmentation
by: Chng, Yong Xien, et al.
Published: (2024)
by: Chng, Yong Xien, et al.
Published: (2024)
Towards Universal Text-driven CT Image Segmentation
by: Li, Yuheng, et al.
Published: (2025)
by: Li, Yuheng, et al.
Published: (2025)
TiFRe: Text-guided Video Frame Reduction for Efficient Video Multi-modal Large Language Models
by: Zheng, Xiangtian, et al.
Published: (2026)
by: Zheng, Xiangtian, et al.
Published: (2026)
OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding
by: Huang, Sheng-Yu, et al.
Published: (2026)
by: Huang, Sheng-Yu, et al.
Published: (2026)
Taming Mambas for Voxel Level 3D Medical Image Segmentation
by: Lumetti, Luca, et al.
Published: (2024)
by: Lumetti, Luca, et al.
Published: (2024)
Not All Voxels Are Equal: Hardness-Aware Semantic Scene Completion with Self-Distillation
by: Wang, Song, et al.
Published: (2024)
by: Wang, Song, et al.
Published: (2024)
LiteVoxel: Low-memory Intelligent Thresholding for Efficient Voxel Rasterization
by: Lee, Jee Won, et al.
Published: (2025)
by: Lee, Jee Won, et al.
Published: (2025)
UniVoxel: Fast Inverse Rendering by Unified Voxelization of Scene Representation
by: Wu, Shuang, et al.
Published: (2024)
by: Wu, Shuang, et al.
Published: (2024)
Learning Trajectory-Aware Multimodal Large Language Models for Video Reasoning Segmentation
by: Luo, Jingnan, et al.
Published: (2026)
by: Luo, Jingnan, et al.
Published: (2026)
Towards Interactive Lesion Segmentation in Whole-Body PET/CT with Promptable Models
by: Rokuss, Maximilian, et al.
Published: (2025)
by: Rokuss, Maximilian, et al.
Published: (2025)
Interactive Segmentation and Report Generation for CT Images
by: Gu, Yannian, et al.
Published: (2025)
by: Gu, Yannian, et al.
Published: (2025)
Global Position Aware Group Choreography using Large Language Model
by: Pang, Haozhou, et al.
Published: (2025)
by: Pang, Haozhou, et al.
Published: (2025)
Geometrical Cross-Attention and Nonvoid Voxelization for Efficient 3D Medical Image Segmentation
by: Yuan, Chenxin, et al.
Published: (2026)
by: Yuan, Chenxin, et al.
Published: (2026)
GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation
by: Lin, Lang, et al.
Published: (2025)
by: Lin, Lang, et al.
Published: (2025)
Devil is in Details: Locality-Aware 3D Abdominal CT Volume Generation for Self-Supervised Organ Segmentation
by: Wang, Yuran, et al.
Published: (2024)
by: Wang, Yuran, et al.
Published: (2024)
Voxel Densification for Serialized 3D Object Detection: Mitigating Sparsity via Pre-serialization Expansion
by: Liu, Qifeng, et al.
Published: (2025)
by: Liu, Qifeng, et al.
Published: (2025)
Open-set Anomaly Segmentation in Complex Scenarios
by: Xia, Song, et al.
Published: (2025)
by: Xia, Song, et al.
Published: (2025)
4D Neural Voxel Splatting: Dynamic Scene Rendering with Voxelized Guassian Splatting
by: Wu, Chun-Tin, et al.
Published: (2025)
by: Wu, Chun-Tin, et al.
Published: (2025)
VoxelTrack: Exploring Voxel Representation for 3D Point Cloud Object Tracking
by: Lu, Yuxuan, et al.
Published: (2024)
by: Lu, Yuxuan, et al.
Published: (2024)
SVRecon: Sparse Voxel Rasterization for Surface Reconstruction
by: Oh, Seunghun, et al.
Published: (2025)
by: Oh, Seunghun, et al.
Published: (2025)
PVNet: Point-Voxel Interaction LiDAR Scene Upsampling Via Diffusion Models
by: Cheng, Xianjing, et al.
Published: (2025)
by: Cheng, Xianjing, et al.
Published: (2025)
ASSR-NeRF: Arbitrary-Scale Super-Resolution on Voxel Grid for High-Quality Radiance Fields Reconstruction
by: Huang, Ding-Jiun, et al.
Published: (2024)
by: Huang, Ding-Jiun, et al.
Published: (2024)
Context-Aware Interaction Network for RGB-T Semantic Segmentation
by: Lv, Ying, et al.
Published: (2024)
by: Lv, Ying, et al.
Published: (2024)
Advancing Structured Priors for Sparse-Voxel Surface Reconstruction
by: Chi, Ting-Hsun, et al.
Published: (2026)
by: Chi, Ting-Hsun, et al.
Published: (2026)
PointVoxelFormer -- Reviving point cloud networks for 3D medical imaging
by: Heinrich, Mattias Paul
Published: (2024)
by: Heinrich, Mattias Paul
Published: (2024)
Language and Geometry Grounded Sparse Voxel Representations for Holistic Scene Understanding
by: Wu, Guile, et al.
Published: (2026)
by: Wu, Guile, et al.
Published: (2026)
Dynamic Scene Understanding through Object-Centric Voxelization and Neural Rendering
by: Zhao, Yanpeng, et al.
Published: (2024)
by: Zhao, Yanpeng, et al.
Published: (2024)
Interactive Segmentation Model for Placenta Segmentation from 3D Ultrasound images
by: Li, Hao, et al.
Published: (2024)
by: Li, Hao, et al.
Published: (2024)
VoxelOpt: Voxel-Adaptive Message Passing for Discrete Optimization in Deformable Abdominal CT Registration
by: Zhang, Hang, et al.
Published: (2025)
by: Zhang, Hang, et al.
Published: (2025)
Irregularity Inspection using Neural Radiance Field
by: Ding, Tianqi, et al.
Published: (2024)
by: Ding, Tianqi, et al.
Published: (2024)
PIVOT-Net: Heterogeneous Point-Voxel-Tree-based Framework for Point Cloud Compression
by: Pang, Jiahao, et al.
Published: (2024)
by: Pang, Jiahao, et al.
Published: (2024)
CAR-SAM: Cross-Attention Reconstruction for Post-Training Quantization of the Segment Anything Model
by: Wen, Houji, et al.
Published: (2026)
by: Wen, Houji, et al.
Published: (2026)
Similar Items
-
TARDis: Time Attenuated Representation Disentanglement for Incomplete Multi-Modal Tumor Segmentation and Classification
by: Wan, Zishuo, et al.
Published: (2025) -
VOILA: Evaluation of MLLMs For Perceptual Understanding and Analogical Reasoning
by: Yilmaz, Nilay, et al.
Published: (2025) -
VOILA: Value-of-Information Guided Fidelity Selection for Cost-Aware Multimodal Question Answering
by: Bhope, Rahul Atul, et al.
Published: (2026) -
Sparsity-Aware Voxel Attention and Foreground Modulation for 3D Semantic Scene Completion
by: Xue, Yu, et al.
Published: (2026) -
Interactive Test-Time Adaptation with Reliable Spatial-Temporal Voxels for Multi-Modal Segmentation
by: Cao, Haozhi, et al.
Published: (2024)