Saved in:
Bibliographic Details
Main Authors: He, Yufei, Liu, Juncheng, Hu, Zhiyuan, Chen, Yulin, Liu, Yue, Sui, Yuan, Li, Yibo, Chen, Nuo, Hu, Jun, Hooi, Bryan, Xu, Xinxing, Bian, Jiang
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.22964
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914295337451520
author He, Yufei
Liu, Juncheng
Hu, Zhiyuan
Chen, Yulin
Liu, Yue
Sui, Yuan
Li, Yibo
Chen, Nuo
Hu, Jun
Hooi, Bryan
Xu, Xinxing
Bian, Jiang
author_facet He, Yufei
Liu, Juncheng
Hu, Zhiyuan
Chen, Yulin
Liu, Yue
Sui, Yuan
Li, Yibo
Chen, Nuo
Hu, Jun
Hooi, Bryan
Xu, Xinxing
Bian, Jiang
contents Prevailing medical AI operates on an unrealistic ''one-shot'' model, diagnosing from a complete patient file. However, real-world diagnosis is an iterative inquiry where Clinicians sequentially ask questions and order tests to strategically gather information while managing cost and time. To address this, we first propose Med-Inquire, a new benchmark designed to evaluate an agent's ability to perform multi-turn diagnosis. Built upon a dataset of real-world clinical cases, Med-Inquire simulates the diagnostic process by hiding a complete patient file behind specialized Patient and Examination agents. They force the agent to proactively ask questions and order tests to gather information piece by piece. To tackle the challenges posed by Med-Inquire, we then introduce EvoClinician, a self-evolving agent that learns efficient diagnostic strategies at test time. Its core is a ''Diagnose-Grade-Evolve'' loop: an Actor agent attempts a diagnosis; a Process Grader agent performs credit assignment by evaluating each action for both clinical yield and resource efficiency; finally, an Evolver agent uses this feedback to update the Actor's strategy by evolving its prompt and memory. Our experiments show EvoClinician outperforms continual learning baselines and other self-evolving agents like memory agents. The code is available at https://github.com/yf-he/EvoClinician
format Preprint
id arxiv_https___arxiv_org_abs_2601_22964
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle EvoClinician: A Self-Evolving Agent for Multi-Turn Medical Diagnosis via Test-Time Evolutionary Learning
He, Yufei
Liu, Juncheng
Hu, Zhiyuan
Chen, Yulin
Liu, Yue
Sui, Yuan
Li, Yibo
Chen, Nuo
Hu, Jun
Hooi, Bryan
Xu, Xinxing
Bian, Jiang
Artificial Intelligence
Prevailing medical AI operates on an unrealistic ''one-shot'' model, diagnosing from a complete patient file. However, real-world diagnosis is an iterative inquiry where Clinicians sequentially ask questions and order tests to strategically gather information while managing cost and time. To address this, we first propose Med-Inquire, a new benchmark designed to evaluate an agent's ability to perform multi-turn diagnosis. Built upon a dataset of real-world clinical cases, Med-Inquire simulates the diagnostic process by hiding a complete patient file behind specialized Patient and Examination agents. They force the agent to proactively ask questions and order tests to gather information piece by piece. To tackle the challenges posed by Med-Inquire, we then introduce EvoClinician, a self-evolving agent that learns efficient diagnostic strategies at test time. Its core is a ''Diagnose-Grade-Evolve'' loop: an Actor agent attempts a diagnosis; a Process Grader agent performs credit assignment by evaluating each action for both clinical yield and resource efficiency; finally, an Evolver agent uses this feedback to update the Actor's strategy by evolving its prompt and memory. Our experiments show EvoClinician outperforms continual learning baselines and other self-evolving agents like memory agents. The code is available at https://github.com/yf-he/EvoClinician
title EvoClinician: A Self-Evolving Agent for Multi-Turn Medical Diagnosis via Test-Time Evolutionary Learning
topic Artificial Intelligence
url https://arxiv.org/abs/2601.22964