Guardado en:
Detalles Bibliográficos
Autores principales: Song, EuiYul, Kim, Sangryul, Lee, Haeju, Kim, Joonkee, Thorne, James
Formato: Preprint
Publicado: 2024
Materias:
Acceso en línea:https://arxiv.org/abs/2401.16979
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
_version_ 1866910341352390656
author Song, EuiYul
Kim, Sangryul
Lee, Haeju
Kim, Joonkee
Thorne, James
author_facet Song, EuiYul
Kim, Sangryul
Lee, Haeju
Kim, Joonkee
Thorne, James
contents Generative retrieval models encode pointers to information in a corpus as an index within the model's parameters. These models serve as part of a larger pipeline, where retrieved information conditions generation for knowledge-intensive NLP tasks. However, we identify two limitations: the generative retrieval does not account for contextual information. Secondly, the retrieval can't be tuned for the downstream readers as decoding the page title is a non-differentiable operation. This paper introduces Re3val, trained with generative reranking and reinforcement learning using limited data. Re3val leverages context acquired via Dense Passage Retrieval to rerank the retrieved page titles and utilizes REINFORCE to maximize rewards generated by constrained decoding. Additionally, we generate questions from our pre-training dataset to mitigate epistemic uncertainty and bridge the domain gap between the pre-training and fine-tuning datasets. Subsequently, we extract and rerank contexts from the KILT database using the rerank page titles. Upon grounding the top five reranked contexts, Re3val demonstrates the Top 1 KILT scores compared to all other generative retrieval models across five KILT datasets.
format Preprint
id arxiv_https___arxiv_org_abs_2401_16979
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Re3val: Reinforced and Reranked Generative Retrieval
Song, EuiYul
Kim, Sangryul
Lee, Haeju
Kim, Joonkee
Thorne, James
Information Retrieval
94C06
H.3.3
Generative retrieval models encode pointers to information in a corpus as an index within the model's parameters. These models serve as part of a larger pipeline, where retrieved information conditions generation for knowledge-intensive NLP tasks. However, we identify two limitations: the generative retrieval does not account for contextual information. Secondly, the retrieval can't be tuned for the downstream readers as decoding the page title is a non-differentiable operation. This paper introduces Re3val, trained with generative reranking and reinforcement learning using limited data. Re3val leverages context acquired via Dense Passage Retrieval to rerank the retrieved page titles and utilizes REINFORCE to maximize rewards generated by constrained decoding. Additionally, we generate questions from our pre-training dataset to mitigate epistemic uncertainty and bridge the domain gap between the pre-training and fine-tuning datasets. Subsequently, we extract and rerank contexts from the KILT database using the rerank page titles. Upon grounding the top five reranked contexts, Re3val demonstrates the Top 1 KILT scores compared to all other generative retrieval models across five KILT datasets.
title Re3val: Reinforced and Reranked Generative Retrieval
topic Information Retrieval
94C06
H.3.3
url https://arxiv.org/abs/2401.16979