Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Chen, Fengxian, Tao, Zhilong, Li, Jiaxuan, Li, Yunlong, Zhou, Qingguo
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.05195
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866914317790609408
author	Chen, Fengxian Tao, Zhilong Li, Jiaxuan Li, Yunlong Zhou, Qingguo
author_facet	Chen, Fengxian Tao, Zhilong Li, Jiaxuan Li, Yunlong Zhou, Qingguo
contents	Retrieval-augmented generation (RAG) promises grounded question answering, yet domain settings with multiple heterogeneous knowledge bases (KBs) remain challenging. In Chinese Tibetan medicine, encyclopedia entries are often dense and easy to match, which can dominate retrieval even when classics or clinical papers provide more authoritative evidence. We study a practical setting with three KBs (encyclopedia, classics, and clinical papers) and a 500-query benchmark (cutoff $K{=}5$) covering both single-KB and cross-KB questions. We propose two complementary methods to improve traceability, reduce hallucinations, and enable cross-KB verification. First, DAKS performs KB routing and budgeted retrieval to mitigate density-driven bias and to prioritize authoritative sources when appropriate. Second, we use an alignment graph to guide evidence fusion and coverage-aware packing, improving cross-KB evidence coverage without relying on naive concatenation. All answers are generated by a lightweight generator, \textsc{openPangu-Embedded-7B}. Experiments show consistent gains in routing quality and cross-KB evidence coverage, with the full system achieving the best CrossEv@5 while maintaining strong faithfulness and citation correctness.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_05195
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Traceable Cross-Source RAG for Chinese Tibetan Medicine Question Answering Chen, Fengxian Tao, Zhilong Li, Jiaxuan Li, Yunlong Zhou, Qingguo Artificial Intelligence Retrieval-augmented generation (RAG) promises grounded question answering, yet domain settings with multiple heterogeneous knowledge bases (KBs) remain challenging. In Chinese Tibetan medicine, encyclopedia entries are often dense and easy to match, which can dominate retrieval even when classics or clinical papers provide more authoritative evidence. We study a practical setting with three KBs (encyclopedia, classics, and clinical papers) and a 500-query benchmark (cutoff $K{=}5$) covering both single-KB and cross-KB questions. We propose two complementary methods to improve traceability, reduce hallucinations, and enable cross-KB verification. First, DAKS performs KB routing and budgeted retrieval to mitigate density-driven bias and to prioritize authoritative sources when appropriate. Second, we use an alignment graph to guide evidence fusion and coverage-aware packing, improving cross-KB evidence coverage without relying on naive concatenation. All answers are generated by a lightweight generator, \textsc{openPangu-Embedded-7B}. Experiments show consistent gains in routing quality and cross-KB evidence coverage, with the full system achieving the best CrossEv@5 while maintaining strong faithfulness and citation correctness.
title	Traceable Cross-Source RAG for Chinese Tibetan Medicine Question Answering
topic	Artificial Intelligence
url	https://arxiv.org/abs/2602.05195

Similar Items