Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Liu, Wenxuan, Li, Zixuan, Bai, Long, Zhang, Chunmao, Zhang, Fenghui, Chen, Zhuo, Li, Wei, Zuo, Yuxin, Wang, Fei, Xu, Bingbing, Jiang, Xuhui, Zhang, Jin, Jin, Xiaolong, Guo, Jiafeng, Chua, Tat-Seng, Cheng, Xueqi
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2604.07720
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866913021596532736
author	Liu, Wenxuan Li, Zixuan Bai, Long Zhang, Chunmao Zhang, Fenghui Chen, Zhuo Li, Wei Zuo, Yuxin Wang, Fei Xu, Bingbing Jiang, Xuhui Zhang, Jin Jin, Xiaolong Guo, Jiafeng Chua, Tat-Seng Cheng, Xueqi
author_facet	Liu, Wenxuan Li, Zixuan Bai, Long Zhang, Chunmao Zhang, Fenghui Chen, Zhuo Li, Wei Zuo, Yuxin Wang, Fei Xu, Bingbing Jiang, Xuhui Zhang, Jin Jin, Xiaolong Guo, Jiafeng Chua, Tat-Seng Cheng, Xueqi
contents	Deep Research (DR) requires LLM agents to autonomously perform multi-step information seeking, processing, and reasoning to generate comprehensive reports. In contrast to existing studies that mainly focus on unstructured web content, a more challenging DR task should additionally utilize structured knowledge to provide a solid data foundation, facilitate quantitative computation, and lead to in-depth analyses. In this paper, we refer to this novel task as Knowledgeable Deep Research (KDR), which requires DR agents to generate reports with both structured and unstructured knowledge. Furthermore, we propose the Hybrid Knowledge Analysis framework (HKA), a multi-agent architecture that reasons over both kinds of knowledge and integrates the texts, figures, and tables into coherent multimodal reports. The key design is the Structured Knowledge Analyzer, which utilizes both coding and vision-language models to produce figures, tables, and corresponding insights. To support systematic evaluation, we construct KDR-Bench, which covers 9 domains, includes 41 expert-level questions, and incorporates a large number of structured knowledge resources (e.g., 1,252 tables). We further annotate the main conclusions and key points for each question and propose three categories of evaluation metrics including general-purpose, knowledge-centric, and vision-enhanced ones. Experimental results demonstrate that HKA consistently outperforms most existing DR agents on general-purpose and knowledge-centric metrics, and even surpasses the Gemini DR agent on vision-enhanced metrics, highlighting its effectiveness in deep, structure-aware knowledge analysis. Finally, we hope this work can serve as a new foundation for structured knowledge analysis in DR agents and facilitate future multimodal DR studies.
format	Preprint
id	arxiv_https___arxiv_org_abs_2604_07720
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Towards Knowledgeable Deep Research: Framework and Benchmark Liu, Wenxuan Li, Zixuan Bai, Long Zhang, Chunmao Zhang, Fenghui Chen, Zhuo Li, Wei Zuo, Yuxin Wang, Fei Xu, Bingbing Jiang, Xuhui Zhang, Jin Jin, Xiaolong Guo, Jiafeng Chua, Tat-Seng Cheng, Xueqi Artificial Intelligence Deep Research (DR) requires LLM agents to autonomously perform multi-step information seeking, processing, and reasoning to generate comprehensive reports. In contrast to existing studies that mainly focus on unstructured web content, a more challenging DR task should additionally utilize structured knowledge to provide a solid data foundation, facilitate quantitative computation, and lead to in-depth analyses. In this paper, we refer to this novel task as Knowledgeable Deep Research (KDR), which requires DR agents to generate reports with both structured and unstructured knowledge. Furthermore, we propose the Hybrid Knowledge Analysis framework (HKA), a multi-agent architecture that reasons over both kinds of knowledge and integrates the texts, figures, and tables into coherent multimodal reports. The key design is the Structured Knowledge Analyzer, which utilizes both coding and vision-language models to produce figures, tables, and corresponding insights. To support systematic evaluation, we construct KDR-Bench, which covers 9 domains, includes 41 expert-level questions, and incorporates a large number of structured knowledge resources (e.g., 1,252 tables). We further annotate the main conclusions and key points for each question and propose three categories of evaluation metrics including general-purpose, knowledge-centric, and vision-enhanced ones. Experimental results demonstrate that HKA consistently outperforms most existing DR agents on general-purpose and knowledge-centric metrics, and even surpasses the Gemini DR agent on vision-enhanced metrics, highlighting its effectiveness in deep, structure-aware knowledge analysis. Finally, we hope this work can serve as a new foundation for structured knowledge analysis in DR agents and facilitate future multimodal DR studies.
title	Towards Knowledgeable Deep Research: Framework and Benchmark
topic	Artificial Intelligence
url	https://arxiv.org/abs/2604.07720

Similar Items