Saved in:
Bibliographic Details
Main Authors: Chen, Yiqun, Mao, Jiaxin, Zhang, Yi, Ma, Dehong, Xia, Long, Fan, Jun, Shi, Daiting, Cheng, Zhicong, Gu, Simiu, Yin, Dawei
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2403.17421
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912221574987776
author Chen, Yiqun
Mao, Jiaxin
Zhang, Yi
Ma, Dehong
Xia, Long
Fan, Jun
Shi, Daiting
Cheng, Zhicong
Gu, Simiu
Yin, Dawei
author_facet Chen, Yiqun
Mao, Jiaxin
Zhang, Yi
Ma, Dehong
Xia, Long
Fan, Jun
Shi, Daiting
Cheng, Zhicong
Gu, Simiu
Yin, Dawei
contents Search result diversification (SRD), which aims to ensure that documents in a ranking list cover a broad range of subtopics, is a significant and widely studied problem in Information Retrieval and Web Search. Existing methods primarily utilize a paradigm of "greedy selection", i.e., selecting one document with the highest diversity score at a time or optimize an approximation of the objective function. These approaches tend to be inefficient and are easily trapped in a suboptimal state. To address these challenges, we introduce Multi-Agent reinforcement learning (MARL) for search result DIVersity, which called MA4DIV. In this approach, each document is an agent and the search result diversification is modeled as a cooperative task among multiple agents. By modeling the SRD ranking problem as a cooperative MARL problem, this approach allows for directly optimizing the diversity metrics, such as $α$-NDCG, while achieving high training efficiency. We conducted experiments on public TREC datasets and a larger scale dataset in the industrial setting. The experiemnts show that MA4DIV achieves substantial improvements in both effectiveness and efficiency than existing baselines, especially on the industrial dataset. The code of MA4DIV can be seen on https://github.com/chenyiqun/MA4DIV.
format Preprint
id arxiv_https___arxiv_org_abs_2403_17421
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle MA4DIV: Multi-Agent Reinforcement Learning for Search Result Diversification
Chen, Yiqun
Mao, Jiaxin
Zhang, Yi
Ma, Dehong
Xia, Long
Fan, Jun
Shi, Daiting
Cheng, Zhicong
Gu, Simiu
Yin, Dawei
Information Retrieval
Artificial Intelligence
Search result diversification (SRD), which aims to ensure that documents in a ranking list cover a broad range of subtopics, is a significant and widely studied problem in Information Retrieval and Web Search. Existing methods primarily utilize a paradigm of "greedy selection", i.e., selecting one document with the highest diversity score at a time or optimize an approximation of the objective function. These approaches tend to be inefficient and are easily trapped in a suboptimal state. To address these challenges, we introduce Multi-Agent reinforcement learning (MARL) for search result DIVersity, which called MA4DIV. In this approach, each document is an agent and the search result diversification is modeled as a cooperative task among multiple agents. By modeling the SRD ranking problem as a cooperative MARL problem, this approach allows for directly optimizing the diversity metrics, such as $α$-NDCG, while achieving high training efficiency. We conducted experiments on public TREC datasets and a larger scale dataset in the industrial setting. The experiemnts show that MA4DIV achieves substantial improvements in both effectiveness and efficiency than existing baselines, especially on the industrial dataset. The code of MA4DIV can be seen on https://github.com/chenyiqun/MA4DIV.
title MA4DIV: Multi-Agent Reinforcement Learning for Search Result Diversification
topic Information Retrieval
Artificial Intelligence
url https://arxiv.org/abs/2403.17421