Saved in:
Bibliographic Details
Main Author: Harris, Lee
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2507.22921
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912511395102720
author Harris, Lee
author_facet Harris, Lee
contents Language models can capture complex relationships in given text, but these are notorious for being costly and for producing information that does not exist (i.e., hallucinations). Furthermore, the resources invested into producing this information would be wasted if it were incorrect. We address these issues by proposing, implementing, and applying the Language Model Chain (LMC) algorithm. In this, a language model's response to a given prompt about given text is only correct if it exists in the collection of possible (i.e., candidate) answers, and text corresponding to incorrect responses is fed into a more predictive (but slower) language model. This process is repeated for a collection of language models, or until all predictions about the text are correct. We used the LMC algorithm to extract patient dates of birth from medical documents, and combining a collection of language models in a multi-stage cascade significantly increased prediction speed and accuracy over individual language models, while greatly reducing the number of corresponding hallucinations. We believe that the novel LMC algorithm significantly contributes to the knowledge extraction field, and that this should be explored much further in the future.
format Preprint
id arxiv_https___arxiv_org_abs_2507_22921
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Fast and Accurate Contextual Knowledge Extraction Using Cascading Language Model Chains and Candidate Answers
Harris, Lee
Computation and Language
Artificial Intelligence
Information Retrieval
Language models can capture complex relationships in given text, but these are notorious for being costly and for producing information that does not exist (i.e., hallucinations). Furthermore, the resources invested into producing this information would be wasted if it were incorrect. We address these issues by proposing, implementing, and applying the Language Model Chain (LMC) algorithm. In this, a language model's response to a given prompt about given text is only correct if it exists in the collection of possible (i.e., candidate) answers, and text corresponding to incorrect responses is fed into a more predictive (but slower) language model. This process is repeated for a collection of language models, or until all predictions about the text are correct. We used the LMC algorithm to extract patient dates of birth from medical documents, and combining a collection of language models in a multi-stage cascade significantly increased prediction speed and accuracy over individual language models, while greatly reducing the number of corresponding hallucinations. We believe that the novel LMC algorithm significantly contributes to the knowledge extraction field, and that this should be explored much further in the future.
title Fast and Accurate Contextual Knowledge Extraction Using Cascading Language Model Chains and Candidate Answers
topic Computation and Language
Artificial Intelligence
Information Retrieval
url https://arxiv.org/abs/2507.22921