UNIMARC/MARC: :: Library Catalog

Uloženo v:

Podrobná bibliografie
Hlavní autoři:	Arabzadeh, Negar, Clarke, Charles L. A.
Médium:	Preprint
Vydáno:	2024
Témata:	Information Retrieval
On-line přístup:	https://arxiv.org/abs/2401.17543
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

_version_	1866909111318216704
author	Arabzadeh, Negar Clarke, Charles L. A.
author_facet	Arabzadeh, Negar Clarke, Charles L. A.
contents	The rapid advancement of natural language processing, information retrieval (IR), computer vision, and other technologies has presented significant challenges in evaluating the performance of these systems. One of the main challenges is the scarcity of human-labeled data, which hinders the fair and accurate assessment of these systems. In this work, we specifically focus on evaluating IR systems with sparse labels, borrowing from recent research on evaluating computer vision tasks. taking inspiration from the success of using Fréchet Inception Distance (FID) in assessing text-to-image generation systems. We propose leveraging the Fréchet Distance to measure the distance between the distributions of relevant judged items and retrieved results. Our experimental results on MS MARCO V1 dataset and TREC Deep Learning Tracks query sets demonstrate the effectiveness of the Fréchet Distance as a metric for evaluating IR systems, particularly in settings where a few labels are available. This approach contributes to the advancement of evaluation methodologies in real-world scenarios such as the assessment of generative IR systems.
format	Preprint
id	arxiv_https___arxiv_org_abs_2401_17543
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Fréchet Distance for Offline Evaluation of Information Retrieval Systems with Sparse Labels Arabzadeh, Negar Clarke, Charles L. A. Information Retrieval The rapid advancement of natural language processing, information retrieval (IR), computer vision, and other technologies has presented significant challenges in evaluating the performance of these systems. One of the main challenges is the scarcity of human-labeled data, which hinders the fair and accurate assessment of these systems. In this work, we specifically focus on evaluating IR systems with sparse labels, borrowing from recent research on evaluating computer vision tasks. taking inspiration from the success of using Fréchet Inception Distance (FID) in assessing text-to-image generation systems. We propose leveraging the Fréchet Distance to measure the distance between the distributions of relevant judged items and retrieved results. Our experimental results on MS MARCO V1 dataset and TREC Deep Learning Tracks query sets demonstrate the effectiveness of the Fréchet Distance as a metric for evaluating IR systems, particularly in settings where a few labels are available. This approach contributes to the advancement of evaluation methodologies in real-world scenarios such as the assessment of generative IR systems.
title	Fréchet Distance for Offline Evaluation of Information Retrieval Systems with Sparse Labels
topic	Information Retrieval
url	https://arxiv.org/abs/2401.17543

Podobné jednotky