Saved in:
| Main Authors: | Su, Yuhao, Choudhuri, Anwesa, Gao, Zhongpai, Planche, Benjamin, Nguyen, Van Nguyen, Zheng, Meng, Shen, Yuhan, Innanje, Arun, Chen, Terrence, Elhamifar, Ehsan, Wu, Ziyan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.06581 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
PolypSegTrack: Unified Foundation Model for Colonoscopy Video Analysis
by: Choudhuri, Anwesa, et al.
Published: (2025)
by: Choudhuri, Anwesa, et al.
Published: (2025)
Render-FM: A Foundation Model for Real-time Photorealistic Volumetric Rendering
by: Gao, Zhongpai, et al.
Published: (2025)
by: Gao, Zhongpai, et al.
Published: (2025)
7DGS: Unified Spatial-Temporal-Angular Gaussian Splatting
by: Gao, Zhongpai, et al.
Published: (2025)
by: Gao, Zhongpai, et al.
Published: (2025)
6DGS: Enhanced Direction-Aware Gaussian Splatting for Volumetric Rendering
by: Gao, Zhongpai, et al.
Published: (2024)
by: Gao, Zhongpai, et al.
Published: (2024)
Failing Forward: Adaptive Failure-Informed Learning for Vision-Language-Action Models
by: Zheng, Meng, et al.
Published: (2026)
by: Zheng, Meng, et al.
Published: (2026)
3D Vision-Language Gaussian Splatting
by: Peng, Qucheng, et al.
Published: (2024)
by: Peng, Qucheng, et al.
Published: (2024)
Seq2Time: Sequential Knowledge Transfer for Video LLM Temporal Grounding
by: Deng, Andong, et al.
Published: (2024)
by: Deng, Andong, et al.
Published: (2024)
Universal Beta Splatting
by: Liu, Rong, et al.
Published: (2025)
by: Liu, Rong, et al.
Published: (2025)
Consistent Instance Field for Dynamic Scene Understanding
by: Wu, Junyi, et al.
Published: (2025)
by: Wu, Junyi, et al.
Published: (2025)
Order-aware Interactive Segmentation
by: Wang, Bin, et al.
Published: (2024)
by: Wang, Bin, et al.
Published: (2024)
From Particles to Fields: Reframing Photon Mapping with Continuous Gaussian Photon Fields
by: Tao, Jiachen, et al.
Published: (2025)
by: Tao, Jiachen, et al.
Published: (2025)
Automated Patient Positioning with Learned 3D Hand Gestures
by: Gao, Zhongpai, et al.
Published: (2024)
by: Gao, Zhongpai, et al.
Published: (2024)
Anatomy-Aware Conditional Image-Text Retrieval
by: Zheng, Meng, et al.
Published: (2025)
by: Zheng, Meng, et al.
Published: (2025)
DDGS-CT: Direction-Disentangled Gaussian Splatting for Realistic Volume Rendering
by: Gao, Zhongpai, et al.
Published: (2024)
by: Gao, Zhongpai, et al.
Published: (2024)
CHROME: Clothed Human Reconstruction with Occlusion-Resilience and Multiview-Consistency from a Single Image
by: Dutta, Arindam, et al.
Published: (2025)
by: Dutta, Arindam, et al.
Published: (2025)
Few-Shot 3D Volumetric Segmentation with Multi-Surrogate Fusion
by: Zheng, Meng, et al.
Published: (2024)
by: Zheng, Meng, et al.
Published: (2024)
PBADet: A One-Stage Anchor-Free Approach for Part-Body Association
by: Gao, Zhongpai, et al.
Published: (2024)
by: Gao, Zhongpai, et al.
Published: (2024)
Exploring Cycle Consistency Learning in Interactive Volume Segmentation
by: Liu, Qin, et al.
Published: (2023)
by: Liu, Qin, et al.
Published: (2023)
OW-VISCapTor: Abstractors for Open-World Video Instance Segmentation and Captioning
by: Choudhuri, Anwesa, et al.
Published: (2024)
by: Choudhuri, Anwesa, et al.
Published: (2024)
Neural Finite-State Machines for Surgical Phase Recognition
by: Ding, Hao, et al.
Published: (2024)
by: Ding, Hao, et al.
Published: (2024)
DaRePlane: Direction-aware Representations for Dynamic Scene Reconstruction
by: Lou, Ange, et al.
Published: (2024)
by: Lou, Ange, et al.
Published: (2024)
DaReNeRF: Direction-aware Representation for Dynamic Scenes
by: Lou, Ange, et al.
Published: (2024)
by: Lou, Ange, et al.
Published: (2024)
Divide and Fuse: Body Part Mesh Recovery from Partially Visible Human Images
by: Luan, Tianyu, et al.
Published: (2024)
by: Luan, Tianyu, et al.
Published: (2024)
Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion
by: Zheng, Meng, et al.
Published: (2024)
by: Zheng, Meng, et al.
Published: (2024)
Automating Catheterization Labs with Real-Time Perception
by: Yang, Fan, et al.
Published: (2024)
by: Yang, Fan, et al.
Published: (2024)
EgoMAGIC- An Egocentric Video Field Medicine Dataset for Training Perception Algorithms
by: VanVoorst, Brian, et al.
Published: (2026)
by: VanVoorst, Brian, et al.
Published: (2026)
PRS-Med: Position Reasoning Segmentation in Medical Imaging
by: Trinh, Quoc-Huy, et al.
Published: (2025)
by: Trinh, Quoc-Huy, et al.
Published: (2025)
DeCafNet: Delegate and Conquer for Efficient Temporal Grounding in Long Videos
by: Lu, Zijia, et al.
Published: (2025)
by: Lu, Zijia, et al.
Published: (2025)
ViCLSR: A Supervised Contrastive Learning Framework with Natural Language Inference for Natural Language Understanding Tasks
by: Van Huynh, Tin, et al.
Published: (2026)
by: Van Huynh, Tin, et al.
Published: (2026)
Oranits: Mission Assignment and Task Offloading in Open RAN-based ITS using Metaheuristic and Deep Reinforcement Learning
by: Nguyen, Ngoc Hung, et al.
Published: (2025)
by: Nguyen, Ngoc Hung, et al.
Published: (2025)
ExGra-Med: Extended Context Graph Alignment for Medical Vision-Language Models
by: Nguyen, Duy M. H., et al.
Published: (2024)
by: Nguyen, Duy M. H., et al.
Published: (2024)
MedHorizon: Towards Long-context Medical Video Understanding in the Wild
by: Du, Bodong, et al.
Published: (2026)
by: Du, Bodong, et al.
Published: (2026)
MultiMed-ST: Large-scale Many-to-many Multilingual Medical Speech Translation
by: Le-Duc, Khai, et al.
Published: (2025)
by: Le-Duc, Khai, et al.
Published: (2025)
Med-StepBench: A Hierarchical Reasoning Framework for Evaluating Hallucinations in Medical Vision-Language Models
by: Nguyen, Minh Khoi, et al.
Published: (2026)
by: Nguyen, Minh Khoi, et al.
Published: (2026)
Multi-Dialect Vietnamese: Task, Dataset, Baseline Models and Challenges
by: Van Dinh, Nguyen, et al.
Published: (2024)
by: Van Dinh, Nguyen, et al.
Published: (2024)
Balanced Aggregation: Understanding and Fixing Aggregation Bias in GRPO
by: Zeng, Zhiyuan, et al.
Published: (2026)
by: Zeng, Zhiyuan, et al.
Published: (2026)
Deadline-Aware Joint Task Scheduling and Offloading in Mobile Edge Computing Systems
by: Nguyen, Ngoc Hung, et al.
Published: (2025)
by: Nguyen, Ngoc Hung, et al.
Published: (2025)
Decentralized Covert Routing in Heterogeneous Networks Using Reinforcement Learning
by: Kong, Justin, et al.
Published: (2024)
by: Kong, Justin, et al.
Published: (2024)
SynerMedGen: Synergizing Medical Multimodal Understanding with Generation via Task Alignment
by: Zhao, Weiren, et al.
Published: (2026)
by: Zhao, Weiren, et al.
Published: (2026)
Machine Intelligence that Understands Visual and Linguistic Information and Interacts with Humans and Environments
by: Nguyen, Van Quang
Published: (2026)
by: Nguyen, Van Quang
Published: (2026)
Similar Items
-
PolypSegTrack: Unified Foundation Model for Colonoscopy Video Analysis
by: Choudhuri, Anwesa, et al.
Published: (2025) -
Render-FM: A Foundation Model for Real-time Photorealistic Volumetric Rendering
by: Gao, Zhongpai, et al.
Published: (2025) -
7DGS: Unified Spatial-Temporal-Angular Gaussian Splatting
by: Gao, Zhongpai, et al.
Published: (2025) -
6DGS: Enhanced Direction-Aware Gaussian Splatting for Volumetric Rendering
by: Gao, Zhongpai, et al.
Published: (2024) -
Failing Forward: Adaptive Failure-Informed Learning for Vision-Language-Action Models
by: Zheng, Meng, et al.
Published: (2026)