Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Mansouri, Sasan, Pilla, Edoardo, Wahrenburg, Mark, Woebbeking, Fabian
Format:	Preprint
Published:	2026
Subjects:	Information Retrieval Artificial Intelligence I.2.7
Online Access:	https://arxiv.org/abs/2603.20316
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910061756940288
author	Mansouri, Sasan Pilla, Edoardo Wahrenburg, Mark Woebbeking, Fabian
author_facet	Mansouri, Sasan Pilla, Edoardo Wahrenburg, Mark Woebbeking, Fabian
contents	Answering financial questions is often treated as an information retrieval problem. In practice, however, much of the relevant information is already available in curated vendor systems, especially for quantitative analysis. We study whether, and under which conditions, Model Context Protocol (MCP) offers a more reliable alternative to standard retrieval-augmented generation (RAG) by allowing large language models (LLMs) to interact directly with data rather than relying on document ingestion and chunk retrieval. We test this by building a custom MCP server that exposes LSEG APIs as tools and evaluating it on the FinDER benchmark. The approach performs particularly well on the Financials subset, achieving up to 80.4% accuracy on multi-step numerical questions when relevant context is retrieved. The paper thus provides both a baseline for MCP-based financial question answering (QA) and evidence on where this approach breaks down, such as for questions requiring qualitative or document-specific context. Overall, direct access to curated data is a lightweight and effective alternative to document-centric RAG for quantitative financial QA, but not a substitute for all financial QA tasks.
format	Preprint
id	arxiv_https___arxiv_org_abs_2603_20316
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Bypassing Document Ingestion: An MCP Approach to Financial Q&A Mansouri, Sasan Pilla, Edoardo Wahrenburg, Mark Woebbeking, Fabian Information Retrieval Artificial Intelligence I.2.7 Answering financial questions is often treated as an information retrieval problem. In practice, however, much of the relevant information is already available in curated vendor systems, especially for quantitative analysis. We study whether, and under which conditions, Model Context Protocol (MCP) offers a more reliable alternative to standard retrieval-augmented generation (RAG) by allowing large language models (LLMs) to interact directly with data rather than relying on document ingestion and chunk retrieval. We test this by building a custom MCP server that exposes LSEG APIs as tools and evaluating it on the FinDER benchmark. The approach performs particularly well on the Financials subset, achieving up to 80.4% accuracy on multi-step numerical questions when relevant context is retrieved. The paper thus provides both a baseline for MCP-based financial question answering (QA) and evidence on where this approach breaks down, such as for questions requiring qualitative or document-specific context. Overall, direct access to curated data is a lightweight and effective alternative to document-centric RAG for quantitative financial QA, but not a substitute for all financial QA tasks.
title	Bypassing Document Ingestion: An MCP Approach to Financial Q&A
topic	Information Retrieval Artificial Intelligence I.2.7
url	https://arxiv.org/abs/2603.20316

Similar Items