Saved in:
Bibliographic Details
Main Authors: Xin, Amy, Qi, Yunjia, Yao, Zijun, Zhu, Fangwei, Zeng, Kaisheng, Bin, Xu, Hou, Lei, Li, Juanzi
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2407.04020
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914057636806656
author Xin, Amy
Qi, Yunjia
Yao, Zijun
Zhu, Fangwei
Zeng, Kaisheng
Bin, Xu
Hou, Lei
Li, Juanzi
author_facet Xin, Amy
Qi, Yunjia
Yao, Zijun
Zhu, Fangwei
Zeng, Kaisheng
Bin, Xu
Hou, Lei
Li, Juanzi
contents Specialized entity linking (EL) models are well-trained at mapping mentions to unique knowledge base (KB) entities according to a given context. However, specialized EL models struggle to disambiguate long-tail entities due to their limited training data. Meanwhile, extensively pre-trained large language models (LLMs) possess broader knowledge of uncommon entities. Yet, with a lack of specialized EL training, LLMs frequently fail to generate accurate KB entity names, limiting their standalone effectiveness in EL. With the observation that LLMs are more adept at context generation instead of EL execution, we introduce LLM-Augmented Entity Linking (LLMAEL), the first framework to enhance specialized EL models with LLM data augmentation. LLMAEL leverages off-the-shelf, tuning-free LLMs as context augmenters, generating entity descriptions to serve as additional input for specialized EL models. Experiments show that LLMAEL sets new state-of-the-art results across 6 widely adopted EL benchmarks: compared to prior methods that integrate tuning-free LLMs into EL, LLMAEL achieves an absolute 8.9% gain in EL accuracy. We release our code and datasets.
format Preprint
id arxiv_https___arxiv_org_abs_2407_04020
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle LLMAEL: Large Language Models are Good Context Augmenters for Entity Linking
Xin, Amy
Qi, Yunjia
Yao, Zijun
Zhu, Fangwei
Zeng, Kaisheng
Bin, Xu
Hou, Lei
Li, Juanzi
Computation and Language
Specialized entity linking (EL) models are well-trained at mapping mentions to unique knowledge base (KB) entities according to a given context. However, specialized EL models struggle to disambiguate long-tail entities due to their limited training data. Meanwhile, extensively pre-trained large language models (LLMs) possess broader knowledge of uncommon entities. Yet, with a lack of specialized EL training, LLMs frequently fail to generate accurate KB entity names, limiting their standalone effectiveness in EL. With the observation that LLMs are more adept at context generation instead of EL execution, we introduce LLM-Augmented Entity Linking (LLMAEL), the first framework to enhance specialized EL models with LLM data augmentation. LLMAEL leverages off-the-shelf, tuning-free LLMs as context augmenters, generating entity descriptions to serve as additional input for specialized EL models. Experiments show that LLMAEL sets new state-of-the-art results across 6 widely adopted EL benchmarks: compared to prior methods that integrate tuning-free LLMs into EL, LLMAEL achieves an absolute 8.9% gain in EL accuracy. We release our code and datasets.
title LLMAEL: Large Language Models are Good Context Augmenters for Entity Linking
topic Computation and Language
url https://arxiv.org/abs/2407.04020