:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Guoxin, Zhao, Jun, Liu, Xinyi, Liu, Yanbo, Cao, Xuyang, Li, Chao, Liu, Zhuoyun, Sun, Qintian, Zhou, Fangru, Xing, Haoqiang, Yang, Zhenhong
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence Computation and Language
Online Access:	https://arxiv.org/abs/2509.19090
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

JoyTTS: LLM-based Spoken Chatbot With Voice Cloning
by: Zhou, Fangru, et al.
Published: (2025)

Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support
by: Wang, Guoxin, et al.
Published: (2025)

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning
by: LASA Team, et al.
Published: (2025)

JoyHallo: Digital human model for Mandarin
by: Shi, Sheng, et al.
Published: (2024)

Clinically-Grounded Counterfactual Reasoning for Medical Video Diagnosis
by: Gao, Jianzhe, et al.
Published: (2026)

Revisiting 2D Foundation Models for Scalable 3D Medical Image Classification
by: Liu, Han, et al.
Published: (2025)

Med-GLIP: Advancing Medical Language-Image Pre-training with Large-scale Grounded Dataset
by: Deng, Ziye, et al.
Published: (2025)

Ask Patients with Patience: Enabling LLMs for Human-Centric Medical Dialogue with Grounded Reasoning
by: Zhu, Jiayuan, et al.
Published: (2025)

ReMedi: Reasoner for Medical Clinical Prediction
by: Cao, Yushi, et al.
Published: (2026)

Unified and Semantically Grounded Domain Adaptation for Medical Image Segmentation
by: Wang, Xin, et al.
Published: (2025)

Med-R2: An Adversarial Benchmark for Evidence-Grounded Reasoning in Medical VLMs
by: Ma, Wen, et al.
Published: (2026)

MediSee: Reasoning-based Pixel-level Perception in Medical Images
by: Tong, Qinyue, et al.
Published: (2025)

MedGround-R1: Advancing Medical Image Grounding via Spatial-Semantic Rewarded Group Relative Policy Optimization
by: Xu, Huihui, et al.
Published: (2025)

How Do Medical MLLMs Fail? A Study on Visual Grounding in Medical Images
by: Liu, Guimeng, et al.
Published: (2026)

FedFMS: Exploring Federated Foundation Models for Medical Image Segmentation
by: Liu, Yuxi, et al.
Published: (2024)

Demographic Bias of Expert-Level Vision-Language Foundation Models in Medical Imaging
by: Yang, Yuzhe, et al.
Published: (2024)

Vision Foundation Models in Medical Image Analysis: Advances and Challenges
by: Liang, Pengchen, et al.
Published: (2025)

MeDUET: Disentangled Unified Pretraining for 3D Medical Image Synthesis and Analysis
by: Liu, Junkai, et al.
Published: (2026)

3DReasonKnee: Advancing Grounded Reasoning in Medical Vision Language Models
by: Sambara, Sraavya, et al.
Published: (2025)

MedSeg-R: Medical Image Segmentation with Clinical Reasoning
by: Shao, Hao, et al.
Published: (2025)

InfiMed: Low-Resource Medical MLLMs with Advancing Understanding and Reasoning
by: Liu, Zeyu, et al.
Published: (2025)

MediRound: Multi-Round Entity-Level Reasoning Segmentation in Medical Images
by: Tong, Qinyue, et al.
Published: (2025)

uMedSum: A Unified Framework for Advancing Medical Abstractive Summarization
by: Nagar, Aishik, et al.
Published: (2024)

UI-E2I-Synth: Advancing GUI Grounding with Large-Scale Instruction Synthesis
by: Liu, Xinyi, et al.
Published: (2025)

Medical Vision Generalist: Unifying Medical Imaging Tasks in Context
by: Ren, Sucheng, et al.
Published: (2024)

A Foundation Model for General Moving Object Segmentation in Medical Images
by: Yan, Zhongnuo, et al.
Published: (2023)

MedSG-Bench: A Benchmark for Medical Image Sequences Grounding
by: Yue, Jingkun, et al.
Published: (2025)

MediEval: A Unified Medical Benchmark for Patient-Contextual and Knowledge-Grounded Reasoning in LLMs
by: Qu, Zhan, et al.
Published: (2025)

Retrieval-augmented Few-shot Medical Image Segmentation with Foundation Models
by: Zhao, Lin, et al.
Published: (2024)

MedFrameQA: A Multi-Image Medical VQA Benchmark for Clinical Reasoning
by: Yu, Suhao, et al.
Published: (2025)

V2T-CoT: From Vision to Text Chain-of-Thought for Medical Reasoning and Diagnosis
by: Wang, Yuan, et al.
Published: (2025)

Similarity Memory Prior is All You Need for Medical Image Segmentation
by: Tang, Hao, et al.
Published: (2025)

Advancing Problem-Based Learning with Clinical Reasoning for Improved Differential Diagnosis in Medical Education
by: Xu, Yuansong, et al.
Published: (2025)

JoyVASA: Portrait and Animal Image Animation with Diffusion-Based Audio-Driven Facial Dynamics and Head Motion Generation
by: Cao, Xuyang, et al.
Published: (2024)

Unified Medical Image Tokenizer for Autoregressive Synthesis and Understanding
by: Ma, Chenglong, et al.
Published: (2025)

PRS-Med: Position Reasoning Segmentation in Medical Imaging
by: Trinh, Quoc-Huy, et al.
Published: (2025)

Parameter-Efficient Fine-Tuning Medical Multimodal Large Language Models for Medical Visual Grounding
by: He, Jinlong, et al.
Published: (2024)

Uncertainty-aware Medical Diagnostic Phrase Identification and Grounding
by: Zou, Ke, et al.
Published: (2024)

Generative Artificial Intelligence in Medical Imaging: Foundations, Progress, and Clinical Translation
by: Zhou, Xuanru, et al.
Published: (2025)

AutoMedic: An Automated Evaluation Framework for Clinical Conversational Agents with Medical Dataset Grounding
by: Oh, Gyutaek, et al.
Published: (2025)