Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Liu, Philip R., Bansal, Sparsh, Dinh, Jimmy, Pawar, Aditya, Satishkumar, Ramani, Desai, Shail, Gupta, Neeraj, Wang, Xin, Hu, Shu
Format:	Preprint
Published:	2025
Subjects:	Multiagent Systems Artificial Intelligence Computer Vision and Pattern Recognition Machine Learning
Online Access:	https://arxiv.org/abs/2506.07400
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866908716252528640
author	Liu, Philip R. Bansal, Sparsh Dinh, Jimmy Pawar, Aditya Satishkumar, Ramani Desai, Shail Gupta, Neeraj Wang, Xin Hu, Shu
author_facet	Liu, Philip R. Bansal, Sparsh Dinh, Jimmy Pawar, Aditya Satishkumar, Ramani Desai, Shail Gupta, Neeraj Wang, Xin Hu, Shu
contents	The integration of deep learning-based glaucoma detection with large language models (LLMs) presents an automated strategy to mitigate ophthalmologist shortages and improve clinical reporting efficiency. However, applying general LLMs to medical imaging remains challenging due to hallucinations, limited interpretability, and insufficient domain-specific medical knowledge, which can potentially reduce clinical accuracy. Although recent approaches combining imaging models with LLM reasoning have improved reporting, they typically rely on a single generalist agent, restricting their capacity to emulate the diverse and complex reasoning found in multidisciplinary medical teams. To address these limitations, we propose MedChat, a multi-agent diagnostic framework and platform that combines specialized vision models with multiple role-specific LLM agents, all coordinated by a director agent. This design enhances reliability, reduces hallucination risk, and enables interactive diagnostic reporting through an interface tailored for clinical review and educational use. Code available at https://github.com/Purdue-M2/MedChat.
format	Preprint
id	arxiv_https___arxiv_org_abs_2506_07400
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	MedChat: A Multi-Agent Framework for Multimodal Diagnosis with Large Language Models Liu, Philip R. Bansal, Sparsh Dinh, Jimmy Pawar, Aditya Satishkumar, Ramani Desai, Shail Gupta, Neeraj Wang, Xin Hu, Shu Multiagent Systems Artificial Intelligence Computer Vision and Pattern Recognition Machine Learning The integration of deep learning-based glaucoma detection with large language models (LLMs) presents an automated strategy to mitigate ophthalmologist shortages and improve clinical reporting efficiency. However, applying general LLMs to medical imaging remains challenging due to hallucinations, limited interpretability, and insufficient domain-specific medical knowledge, which can potentially reduce clinical accuracy. Although recent approaches combining imaging models with LLM reasoning have improved reporting, they typically rely on a single generalist agent, restricting their capacity to emulate the diverse and complex reasoning found in multidisciplinary medical teams. To address these limitations, we propose MedChat, a multi-agent diagnostic framework and platform that combines specialized vision models with multiple role-specific LLM agents, all coordinated by a director agent. This design enhances reliability, reduces hallucination risk, and enables interactive diagnostic reporting through an interface tailored for clinical review and educational use. Code available at https://github.com/Purdue-M2/MedChat.
title	MedChat: A Multi-Agent Framework for Multimodal Diagnosis with Large Language Models
topic	Multiagent Systems Artificial Intelligence Computer Vision and Pattern Recognition Machine Learning
url	https://arxiv.org/abs/2506.07400

Similar Items