Vista Equipo: :: Library Catalog

Guardado en:

Detalles Bibliográficos
Autores principales:	Song, EuiYul, Kim, Sangryul, Lee, Haeju, Kim, Joonkee, Thorne, James
Formato:	Preprint
Publicado:	2024
Materias:	Information Retrieval 94C06 H.3.3
Acceso en línea:	https://arxiv.org/abs/2401.16979
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

_version_	1866910341352390656
author	Song, EuiYul Kim, Sangryul Lee, Haeju Kim, Joonkee Thorne, James
author_facet	Song, EuiYul Kim, Sangryul Lee, Haeju Kim, Joonkee Thorne, James
contents	Generative retrieval models encode pointers to information in a corpus as an index within the model's parameters. These models serve as part of a larger pipeline, where retrieved information conditions generation for knowledge-intensive NLP tasks. However, we identify two limitations: the generative retrieval does not account for contextual information. Secondly, the retrieval can't be tuned for the downstream readers as decoding the page title is a non-differentiable operation. This paper introduces Re3val, trained with generative reranking and reinforcement learning using limited data. Re3val leverages context acquired via Dense Passage Retrieval to rerank the retrieved page titles and utilizes REINFORCE to maximize rewards generated by constrained decoding. Additionally, we generate questions from our pre-training dataset to mitigate epistemic uncertainty and bridge the domain gap between the pre-training and fine-tuning datasets. Subsequently, we extract and rerank contexts from the KILT database using the rerank page titles. Upon grounding the top five reranked contexts, Re3val demonstrates the Top 1 KILT scores compared to all other generative retrieval models across five KILT datasets.
format	Preprint
id	arxiv_https___arxiv_org_abs_2401_16979
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Re3val: Reinforced and Reranked Generative Retrieval Song, EuiYul Kim, Sangryul Lee, Haeju Kim, Joonkee Thorne, James Information Retrieval 94C06 H.3.3 Generative retrieval models encode pointers to information in a corpus as an index within the model's parameters. These models serve as part of a larger pipeline, where retrieved information conditions generation for knowledge-intensive NLP tasks. However, we identify two limitations: the generative retrieval does not account for contextual information. Secondly, the retrieval can't be tuned for the downstream readers as decoding the page title is a non-differentiable operation. This paper introduces Re3val, trained with generative reranking and reinforcement learning using limited data. Re3val leverages context acquired via Dense Passage Retrieval to rerank the retrieved page titles and utilizes REINFORCE to maximize rewards generated by constrained decoding. Additionally, we generate questions from our pre-training dataset to mitigate epistemic uncertainty and bridge the domain gap between the pre-training and fine-tuning datasets. Subsequently, we extract and rerank contexts from the KILT database using the rerank page titles. Upon grounding the top five reranked contexts, Re3val demonstrates the Top 1 KILT scores compared to all other generative retrieval models across five KILT datasets.
title	Re3val: Reinforced and Reranked Generative Retrieval
topic	Information Retrieval 94C06 H.3.3
url	https://arxiv.org/abs/2401.16979

Ejemplares similares