Saved in:
Bibliographic Details
Main Authors: Mansouri, Sasan, Pilla, Edoardo, Wahrenburg, Mark, Woebbeking, Fabian
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2603.20316
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910061756940288
author Mansouri, Sasan
Pilla, Edoardo
Wahrenburg, Mark
Woebbeking, Fabian
author_facet Mansouri, Sasan
Pilla, Edoardo
Wahrenburg, Mark
Woebbeking, Fabian
contents Answering financial questions is often treated as an information retrieval problem. In practice, however, much of the relevant information is already available in curated vendor systems, especially for quantitative analysis. We study whether, and under which conditions, Model Context Protocol (MCP) offers a more reliable alternative to standard retrieval-augmented generation (RAG) by allowing large language models (LLMs) to interact directly with data rather than relying on document ingestion and chunk retrieval. We test this by building a custom MCP server that exposes LSEG APIs as tools and evaluating it on the FinDER benchmark. The approach performs particularly well on the Financials subset, achieving up to 80.4% accuracy on multi-step numerical questions when relevant context is retrieved. The paper thus provides both a baseline for MCP-based financial question answering (QA) and evidence on where this approach breaks down, such as for questions requiring qualitative or document-specific context. Overall, direct access to curated data is a lightweight and effective alternative to document-centric RAG for quantitative financial QA, but not a substitute for all financial QA tasks.
format Preprint
id arxiv_https___arxiv_org_abs_2603_20316
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Bypassing Document Ingestion: An MCP Approach to Financial Q&A
Mansouri, Sasan
Pilla, Edoardo
Wahrenburg, Mark
Woebbeking, Fabian
Information Retrieval
Artificial Intelligence
I.2.7
Answering financial questions is often treated as an information retrieval problem. In practice, however, much of the relevant information is already available in curated vendor systems, especially for quantitative analysis. We study whether, and under which conditions, Model Context Protocol (MCP) offers a more reliable alternative to standard retrieval-augmented generation (RAG) by allowing large language models (LLMs) to interact directly with data rather than relying on document ingestion and chunk retrieval. We test this by building a custom MCP server that exposes LSEG APIs as tools and evaluating it on the FinDER benchmark. The approach performs particularly well on the Financials subset, achieving up to 80.4% accuracy on multi-step numerical questions when relevant context is retrieved. The paper thus provides both a baseline for MCP-based financial question answering (QA) and evidence on where this approach breaks down, such as for questions requiring qualitative or document-specific context. Overall, direct access to curated data is a lightweight and effective alternative to document-centric RAG for quantitative financial QA, but not a substitute for all financial QA tasks.
title Bypassing Document Ingestion: An MCP Approach to Financial Q&A
topic Information Retrieval
Artificial Intelligence
I.2.7
url https://arxiv.org/abs/2603.20316