Saved in:
| Main Authors: | Wang, Guoxin, Zhao, Jun, Liu, Xinyi, Liu, Yanbo, Cao, Xuyang, Li, Chao, Liu, Zhuoyun, Sun, Qintian, Zhou, Fangru, Xing, Haoqiang, Yang, Zhenhong |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.19090 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
JoyTTS: LLM-based Spoken Chatbot With Voice Cloning
by: Zhou, Fangru, et al.
Published: (2025)
by: Zhou, Fangru, et al.
Published: (2025)
Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support
by: Wang, Guoxin, et al.
Published: (2025)
by: Wang, Guoxin, et al.
Published: (2025)
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning
by: LASA Team, et al.
Published: (2025)
by: LASA Team, et al.
Published: (2025)
JoyHallo: Digital human model for Mandarin
by: Shi, Sheng, et al.
Published: (2024)
by: Shi, Sheng, et al.
Published: (2024)
Clinically-Grounded Counterfactual Reasoning for Medical Video Diagnosis
by: Gao, Jianzhe, et al.
Published: (2026)
by: Gao, Jianzhe, et al.
Published: (2026)
Revisiting 2D Foundation Models for Scalable 3D Medical Image Classification
by: Liu, Han, et al.
Published: (2025)
by: Liu, Han, et al.
Published: (2025)
Med-GLIP: Advancing Medical Language-Image Pre-training with Large-scale Grounded Dataset
by: Deng, Ziye, et al.
Published: (2025)
by: Deng, Ziye, et al.
Published: (2025)
Ask Patients with Patience: Enabling LLMs for Human-Centric Medical Dialogue with Grounded Reasoning
by: Zhu, Jiayuan, et al.
Published: (2025)
by: Zhu, Jiayuan, et al.
Published: (2025)
ReMedi: Reasoner for Medical Clinical Prediction
by: Cao, Yushi, et al.
Published: (2026)
by: Cao, Yushi, et al.
Published: (2026)
Unified and Semantically Grounded Domain Adaptation for Medical Image Segmentation
by: Wang, Xin, et al.
Published: (2025)
by: Wang, Xin, et al.
Published: (2025)
Med-R2: An Adversarial Benchmark for Evidence-Grounded Reasoning in Medical VLMs
by: Ma, Wen, et al.
Published: (2026)
by: Ma, Wen, et al.
Published: (2026)
MediSee: Reasoning-based Pixel-level Perception in Medical Images
by: Tong, Qinyue, et al.
Published: (2025)
by: Tong, Qinyue, et al.
Published: (2025)
MedGround-R1: Advancing Medical Image Grounding via Spatial-Semantic Rewarded Group Relative Policy Optimization
by: Xu, Huihui, et al.
Published: (2025)
by: Xu, Huihui, et al.
Published: (2025)
How Do Medical MLLMs Fail? A Study on Visual Grounding in Medical Images
by: Liu, Guimeng, et al.
Published: (2026)
by: Liu, Guimeng, et al.
Published: (2026)
FedFMS: Exploring Federated Foundation Models for Medical Image Segmentation
by: Liu, Yuxi, et al.
Published: (2024)
by: Liu, Yuxi, et al.
Published: (2024)
Demographic Bias of Expert-Level Vision-Language Foundation Models in Medical Imaging
by: Yang, Yuzhe, et al.
Published: (2024)
by: Yang, Yuzhe, et al.
Published: (2024)
Vision Foundation Models in Medical Image Analysis: Advances and Challenges
by: Liang, Pengchen, et al.
Published: (2025)
by: Liang, Pengchen, et al.
Published: (2025)
MeDUET: Disentangled Unified Pretraining for 3D Medical Image Synthesis and Analysis
by: Liu, Junkai, et al.
Published: (2026)
by: Liu, Junkai, et al.
Published: (2026)
3DReasonKnee: Advancing Grounded Reasoning in Medical Vision Language Models
by: Sambara, Sraavya, et al.
Published: (2025)
by: Sambara, Sraavya, et al.
Published: (2025)
MedSeg-R: Medical Image Segmentation with Clinical Reasoning
by: Shao, Hao, et al.
Published: (2025)
by: Shao, Hao, et al.
Published: (2025)
InfiMed: Low-Resource Medical MLLMs with Advancing Understanding and Reasoning
by: Liu, Zeyu, et al.
Published: (2025)
by: Liu, Zeyu, et al.
Published: (2025)
MediRound: Multi-Round Entity-Level Reasoning Segmentation in Medical Images
by: Tong, Qinyue, et al.
Published: (2025)
by: Tong, Qinyue, et al.
Published: (2025)
uMedSum: A Unified Framework for Advancing Medical Abstractive Summarization
by: Nagar, Aishik, et al.
Published: (2024)
by: Nagar, Aishik, et al.
Published: (2024)
UI-E2I-Synth: Advancing GUI Grounding with Large-Scale Instruction Synthesis
by: Liu, Xinyi, et al.
Published: (2025)
by: Liu, Xinyi, et al.
Published: (2025)
Medical Vision Generalist: Unifying Medical Imaging Tasks in Context
by: Ren, Sucheng, et al.
Published: (2024)
by: Ren, Sucheng, et al.
Published: (2024)
A Foundation Model for General Moving Object Segmentation in Medical Images
by: Yan, Zhongnuo, et al.
Published: (2023)
by: Yan, Zhongnuo, et al.
Published: (2023)
MedSG-Bench: A Benchmark for Medical Image Sequences Grounding
by: Yue, Jingkun, et al.
Published: (2025)
by: Yue, Jingkun, et al.
Published: (2025)
MediEval: A Unified Medical Benchmark for Patient-Contextual and Knowledge-Grounded Reasoning in LLMs
by: Qu, Zhan, et al.
Published: (2025)
by: Qu, Zhan, et al.
Published: (2025)
Retrieval-augmented Few-shot Medical Image Segmentation with Foundation Models
by: Zhao, Lin, et al.
Published: (2024)
by: Zhao, Lin, et al.
Published: (2024)
MedFrameQA: A Multi-Image Medical VQA Benchmark for Clinical Reasoning
by: Yu, Suhao, et al.
Published: (2025)
by: Yu, Suhao, et al.
Published: (2025)
V2T-CoT: From Vision to Text Chain-of-Thought for Medical Reasoning and Diagnosis
by: Wang, Yuan, et al.
Published: (2025)
by: Wang, Yuan, et al.
Published: (2025)
Similarity Memory Prior is All You Need for Medical Image Segmentation
by: Tang, Hao, et al.
Published: (2025)
by: Tang, Hao, et al.
Published: (2025)
Advancing Problem-Based Learning with Clinical Reasoning for Improved Differential Diagnosis in Medical Education
by: Xu, Yuansong, et al.
Published: (2025)
by: Xu, Yuansong, et al.
Published: (2025)
JoyVASA: Portrait and Animal Image Animation with Diffusion-Based Audio-Driven Facial Dynamics and Head Motion Generation
by: Cao, Xuyang, et al.
Published: (2024)
by: Cao, Xuyang, et al.
Published: (2024)
Unified Medical Image Tokenizer for Autoregressive Synthesis and Understanding
by: Ma, Chenglong, et al.
Published: (2025)
by: Ma, Chenglong, et al.
Published: (2025)
PRS-Med: Position Reasoning Segmentation in Medical Imaging
by: Trinh, Quoc-Huy, et al.
Published: (2025)
by: Trinh, Quoc-Huy, et al.
Published: (2025)
Parameter-Efficient Fine-Tuning Medical Multimodal Large Language Models for Medical Visual Grounding
by: He, Jinlong, et al.
Published: (2024)
by: He, Jinlong, et al.
Published: (2024)
Uncertainty-aware Medical Diagnostic Phrase Identification and Grounding
by: Zou, Ke, et al.
Published: (2024)
by: Zou, Ke, et al.
Published: (2024)
Generative Artificial Intelligence in Medical Imaging: Foundations, Progress, and Clinical Translation
by: Zhou, Xuanru, et al.
Published: (2025)
by: Zhou, Xuanru, et al.
Published: (2025)
AutoMedic: An Automated Evaluation Framework for Clinical Conversational Agents with Medical Dataset Grounding
by: Oh, Gyutaek, et al.
Published: (2025)
by: Oh, Gyutaek, et al.
Published: (2025)
Similar Items
-
JoyTTS: LLM-based Spoken Chatbot With Voice Cloning
by: Zhou, Fangru, et al.
Published: (2025) -
Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support
by: Wang, Guoxin, et al.
Published: (2025) -
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning
by: LASA Team, et al.
Published: (2025) -
JoyHallo: Digital human model for Mandarin
by: Shi, Sheng, et al.
Published: (2024) -
Clinically-Grounded Counterfactual Reasoning for Medical Video Diagnosis
by: Gao, Jianzhe, et al.
Published: (2026)