Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Ceresa, Mario, Bertolini, Lorenzo, Comte, Valentin, Spadaro, Nicholas, Raffael, Barbara, Toussaint, Brigitte, Consoli, Sergio, Piñeiro, Amalia Muñoz, Patak, Alex, Querci, Maddalena, Wiesenthal, Tobias
Format:	Preprint
Published:	2025
Subjects:	Information Retrieval
Online Access:	https://arxiv.org/abs/2505.04680
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910932316192768
author	Ceresa, Mario Bertolini, Lorenzo Comte, Valentin Spadaro, Nicholas Raffael, Barbara Toussaint, Brigitte Consoli, Sergio Piñeiro, Amalia Muñoz Patak, Alex Querci, Maddalena Wiesenthal, Tobias
author_facet	Ceresa, Mario Bertolini, Lorenzo Comte, Valentin Spadaro, Nicholas Raffael, Barbara Toussaint, Brigitte Consoli, Sergio Piñeiro, Amalia Muñoz Patak, Alex Querci, Maddalena Wiesenthal, Tobias
contents	Safe and trustworthy use of Large Language Models (LLM) in the processing of healthcare documents and scientific papers could substantially help clinicians, scientists and policymakers in overcoming information overload and focusing on the most relevant information at a given moment. Retrieval Augmented Generation (RAG) is a promising method to leverage the potential of LLMs while enhancing the accuracy of their outcomes. This report assesses the potentials and shortcomings of such approaches in the automatic knowledge synthesis of different types of documents in the health domain. To this end, it describes: (1) an internally developed proof of concept pipeline that employs state-of-the-art practices to deliver safe and trustable analysis for healthcare documents and scientific papers called RAGEv (Retrieval Augmented Generation Evaluation); (2) a set of evaluation tools for LLM-based document retrieval and generation; (3) a benchmark dataset to verify the accuracy and veracity of the results called RAGEv-Bench. It concludes that careful implementations of RAG techniques could minimize most of the common problems in the use of LLMs for document processing in the health domain, obtaining very high scores both on short yes/no answers and long answers. There is a high potential for incorporating it into the day-to-day work of policy support tasks, but additional efforts are required to obtain a consistent and trustworthy tool.
format	Preprint
id	arxiv_https___arxiv_org_abs_2505_04680
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Retrieval Augmented Generation Evaluation for Health Documents Ceresa, Mario Bertolini, Lorenzo Comte, Valentin Spadaro, Nicholas Raffael, Barbara Toussaint, Brigitte Consoli, Sergio Piñeiro, Amalia Muñoz Patak, Alex Querci, Maddalena Wiesenthal, Tobias Information Retrieval Safe and trustworthy use of Large Language Models (LLM) in the processing of healthcare documents and scientific papers could substantially help clinicians, scientists and policymakers in overcoming information overload and focusing on the most relevant information at a given moment. Retrieval Augmented Generation (RAG) is a promising method to leverage the potential of LLMs while enhancing the accuracy of their outcomes. This report assesses the potentials and shortcomings of such approaches in the automatic knowledge synthesis of different types of documents in the health domain. To this end, it describes: (1) an internally developed proof of concept pipeline that employs state-of-the-art practices to deliver safe and trustable analysis for healthcare documents and scientific papers called RAGEv (Retrieval Augmented Generation Evaluation); (2) a set of evaluation tools for LLM-based document retrieval and generation; (3) a benchmark dataset to verify the accuracy and veracity of the results called RAGEv-Bench. It concludes that careful implementations of RAG techniques could minimize most of the common problems in the use of LLMs for document processing in the health domain, obtaining very high scores both on short yes/no answers and long answers. There is a high potential for incorporating it into the day-to-day work of policy support tasks, but additional efforts are required to obtain a consistent and trustworthy tool.
title	Retrieval Augmented Generation Evaluation for Health Documents
topic	Information Retrieval
url	https://arxiv.org/abs/2505.04680

Similar Items