Saved in:
Bibliographic Details
Main Authors: Liu, Wenxuan, Li, Zixuan, Bai, Long, Zhang, Chunmao, Zhang, Fenghui, Chen, Zhuo, Li, Wei, Zuo, Yuxin, Wang, Fei, Xu, Bingbing, Jiang, Xuhui, Zhang, Jin, Jin, Xiaolong, Guo, Jiafeng, Chua, Tat-Seng, Cheng, Xueqi
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2604.07720
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913021596532736
author Liu, Wenxuan
Li, Zixuan
Bai, Long
Zhang, Chunmao
Zhang, Fenghui
Chen, Zhuo
Li, Wei
Zuo, Yuxin
Wang, Fei
Xu, Bingbing
Jiang, Xuhui
Zhang, Jin
Jin, Xiaolong
Guo, Jiafeng
Chua, Tat-Seng
Cheng, Xueqi
author_facet Liu, Wenxuan
Li, Zixuan
Bai, Long
Zhang, Chunmao
Zhang, Fenghui
Chen, Zhuo
Li, Wei
Zuo, Yuxin
Wang, Fei
Xu, Bingbing
Jiang, Xuhui
Zhang, Jin
Jin, Xiaolong
Guo, Jiafeng
Chua, Tat-Seng
Cheng, Xueqi
contents Deep Research (DR) requires LLM agents to autonomously perform multi-step information seeking, processing, and reasoning to generate comprehensive reports. In contrast to existing studies that mainly focus on unstructured web content, a more challenging DR task should additionally utilize structured knowledge to provide a solid data foundation, facilitate quantitative computation, and lead to in-depth analyses. In this paper, we refer to this novel task as Knowledgeable Deep Research (KDR), which requires DR agents to generate reports with both structured and unstructured knowledge. Furthermore, we propose the Hybrid Knowledge Analysis framework (HKA), a multi-agent architecture that reasons over both kinds of knowledge and integrates the texts, figures, and tables into coherent multimodal reports. The key design is the Structured Knowledge Analyzer, which utilizes both coding and vision-language models to produce figures, tables, and corresponding insights. To support systematic evaluation, we construct KDR-Bench, which covers 9 domains, includes 41 expert-level questions, and incorporates a large number of structured knowledge resources (e.g., 1,252 tables). We further annotate the main conclusions and key points for each question and propose three categories of evaluation metrics including general-purpose, knowledge-centric, and vision-enhanced ones. Experimental results demonstrate that HKA consistently outperforms most existing DR agents on general-purpose and knowledge-centric metrics, and even surpasses the Gemini DR agent on vision-enhanced metrics, highlighting its effectiveness in deep, structure-aware knowledge analysis. Finally, we hope this work can serve as a new foundation for structured knowledge analysis in DR agents and facilitate future multimodal DR studies.
format Preprint
id arxiv_https___arxiv_org_abs_2604_07720
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Towards Knowledgeable Deep Research: Framework and Benchmark
Liu, Wenxuan
Li, Zixuan
Bai, Long
Zhang, Chunmao
Zhang, Fenghui
Chen, Zhuo
Li, Wei
Zuo, Yuxin
Wang, Fei
Xu, Bingbing
Jiang, Xuhui
Zhang, Jin
Jin, Xiaolong
Guo, Jiafeng
Chua, Tat-Seng
Cheng, Xueqi
Artificial Intelligence
Deep Research (DR) requires LLM agents to autonomously perform multi-step information seeking, processing, and reasoning to generate comprehensive reports. In contrast to existing studies that mainly focus on unstructured web content, a more challenging DR task should additionally utilize structured knowledge to provide a solid data foundation, facilitate quantitative computation, and lead to in-depth analyses. In this paper, we refer to this novel task as Knowledgeable Deep Research (KDR), which requires DR agents to generate reports with both structured and unstructured knowledge. Furthermore, we propose the Hybrid Knowledge Analysis framework (HKA), a multi-agent architecture that reasons over both kinds of knowledge and integrates the texts, figures, and tables into coherent multimodal reports. The key design is the Structured Knowledge Analyzer, which utilizes both coding and vision-language models to produce figures, tables, and corresponding insights. To support systematic evaluation, we construct KDR-Bench, which covers 9 domains, includes 41 expert-level questions, and incorporates a large number of structured knowledge resources (e.g., 1,252 tables). We further annotate the main conclusions and key points for each question and propose three categories of evaluation metrics including general-purpose, knowledge-centric, and vision-enhanced ones. Experimental results demonstrate that HKA consistently outperforms most existing DR agents on general-purpose and knowledge-centric metrics, and even surpasses the Gemini DR agent on vision-enhanced metrics, highlighting its effectiveness in deep, structure-aware knowledge analysis. Finally, we hope this work can serve as a new foundation for structured knowledge analysis in DR agents and facilitate future multimodal DR studies.
title Towards Knowledgeable Deep Research: Framework and Benchmark
topic Artificial Intelligence
url https://arxiv.org/abs/2604.07720