Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Lin, Zhichao, Liang, Zhichao, Liu, Gaoqiang, Xu, Meng, Xiang, Baoyu, Zhao, Shuxin, Wu, Yao, Xu, Jian, Jiang, Guanjun
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2604.12867
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866918462673125376
author	Lin, Zhichao Liang, Zhichao Liu, Gaoqiang Xu, Meng Xiang, Baoyu Zhao, Shuxin Wu, Yao Xu, Jian Jiang, Guanjun
author_facet	Lin, Zhichao Liang, Zhichao Liu, Gaoqiang Xu, Meng Xiang, Baoyu Zhao, Shuxin Wu, Yao Xu, Jian Jiang, Guanjun
contents	As agentic foundation models continue to evolve, how to further improve their performance in vertical domains has become an important challenge. To this end, building upon Tongyi DeepResearch, a powerful agentic foundation model, we focus on the Chinese medical deep search scenario and propose QuarkMedSearch, systematically exploring a full-pipeline approach spanning medical multi-hop data construction, training strategies, and evaluation benchmarks to further push and assess its performance upper bound in vertical domains. Specifically, for data synthesis, to address the scarcity of deep search training data in the medical domain, we combine a large-scale medical knowledge graph with real-time online exploration to construct long-horizon medical deep search training data; for post-training, we adopt a two-stage SFT and RL training strategy that progressively enhances the model's planning, tool invocation, and reflection capabilities required for deep search, while maintaining search efficiency; for evaluation, we collaborate with medical experts to construct the QuarkMedSearch Benchmark through rigorous manual verification. Experimental results demonstrate that QuarkMedSearch achieves state-of-the-art performance among open-source models of comparable scale on the QuarkMedSearch Benchmark, while also maintaining strong competitiveness on general benchmarks.
format	Preprint
id	arxiv_https___arxiv_org_abs_2604_12867
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	QuarkMedSearch: A Long-Horizon Deep Search Agent for Exploring Medical Intelligence Lin, Zhichao Liang, Zhichao Liu, Gaoqiang Xu, Meng Xiang, Baoyu Zhao, Shuxin Wu, Yao Xu, Jian Jiang, Guanjun Artificial Intelligence As agentic foundation models continue to evolve, how to further improve their performance in vertical domains has become an important challenge. To this end, building upon Tongyi DeepResearch, a powerful agentic foundation model, we focus on the Chinese medical deep search scenario and propose QuarkMedSearch, systematically exploring a full-pipeline approach spanning medical multi-hop data construction, training strategies, and evaluation benchmarks to further push and assess its performance upper bound in vertical domains. Specifically, for data synthesis, to address the scarcity of deep search training data in the medical domain, we combine a large-scale medical knowledge graph with real-time online exploration to construct long-horizon medical deep search training data; for post-training, we adopt a two-stage SFT and RL training strategy that progressively enhances the model's planning, tool invocation, and reflection capabilities required for deep search, while maintaining search efficiency; for evaluation, we collaborate with medical experts to construct the QuarkMedSearch Benchmark through rigorous manual verification. Experimental results demonstrate that QuarkMedSearch achieves state-of-the-art performance among open-source models of comparable scale on the QuarkMedSearch Benchmark, while also maintaining strong competitiveness on general benchmarks.
title	QuarkMedSearch: A Long-Horizon Deep Search Agent for Exploring Medical Intelligence
topic	Artificial Intelligence
url	https://arxiv.org/abs/2604.12867

Similar Items