Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Z., Erwin D. López, Tang, Cheng, Shimada, Atsushi
Format: Preprint
Veröffentlicht: 2024
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2409.10907
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866912157696786432
author Z., Erwin D. López
Tang, Cheng
Shimada, Atsushi
author_facet Z., Erwin D. López
Tang, Cheng
Shimada, Atsushi
contents This paper proposes Attention-Seeker, an unsupervised keyphrase extraction method that leverages self-attention maps from a Large Language Model to estimate the importance of candidate phrases. Our approach identifies specific components - such as layers, heads, and attention vectors - where the model pays significant attention to the key topics of the text. The attention weights provided by these components are then used to score the candidate phrases. Unlike previous models that require manual tuning of parameters (e.g., selection of heads, prompts, hyperparameters), Attention-Seeker dynamically adapts to the input text without any manual adjustments, enhancing its practical applicability. We evaluate Attention-Seeker on four publicly available datasets: Inspec, SemEval2010, SemEval2017, and Krapivin. Our results demonstrate that, even without parameter tuning, Attention-Seeker outperforms most baseline models, achieving state-of-the-art performance on three out of four datasets, particularly excelling in extracting keyphrases from long documents.
format Preprint
id arxiv_https___arxiv_org_abs_2409_10907
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Attention-Seeker: Dynamic Self-Attention Scoring for Unsupervised Keyphrase Extraction
Z., Erwin D. López
Tang, Cheng
Shimada, Atsushi
Computation and Language
Information Retrieval
This paper proposes Attention-Seeker, an unsupervised keyphrase extraction method that leverages self-attention maps from a Large Language Model to estimate the importance of candidate phrases. Our approach identifies specific components - such as layers, heads, and attention vectors - where the model pays significant attention to the key topics of the text. The attention weights provided by these components are then used to score the candidate phrases. Unlike previous models that require manual tuning of parameters (e.g., selection of heads, prompts, hyperparameters), Attention-Seeker dynamically adapts to the input text without any manual adjustments, enhancing its practical applicability. We evaluate Attention-Seeker on four publicly available datasets: Inspec, SemEval2010, SemEval2017, and Krapivin. Our results demonstrate that, even without parameter tuning, Attention-Seeker outperforms most baseline models, achieving state-of-the-art performance on three out of four datasets, particularly excelling in extracting keyphrases from long documents.
title Attention-Seeker: Dynamic Self-Attention Scoring for Unsupervised Keyphrase Extraction
topic Computation and Language
Information Retrieval
url https://arxiv.org/abs/2409.10907