MA4DIV: Multi-Agent Reinforcement Learning for Search Result Diversification

doi:10.48550/arXiv.2403.17421

MA4DIV: Multi-Agent Reinforcement Learning for Search Result Diversification

The objective of search result diversification (SRD) is to ensure that selected documents cover as many different subtopics as possible. Existing methods primarily utilize a paradigm of "greedy selection", i.e., selecting one document with the highest diversity score at a time. These approaches tend to be inefficient and are easily trapped in a suboptimal state. In addition, some other methods aim to approximately optimize the diversity metric, such as $\alpha$-NDCG, but the results still remain suboptimal. To address these challenges, we introduce Multi-Agent reinforcement learning (MARL) for search result DIVersity, which called MA4DIV. In this approach, each document is an agent and the search result diversification is modeled as a cooperative task among multiple agents. This approach allows for directly optimizing the diversity metrics, such as $\alpha$-NDCG, while achieving high training efficiency. We conducted preliminary experiments on public TREC datasets to demonstrate the effectiveness and potential of MA4DIV. Considering the limited number of queries in public TREC datasets, we construct a large-scale dataset from industry sources and show that MA4DIV achieves substantial improvements in both effectiveness and efficiency than existing baselines on a industrial scale dataset.

Publication:

arXiv e-prints

Pub Date:

March 2024

DOI:

10.48550/arXiv.2403.17421

arXiv:

arXiv:2403.17421

Bibcode:

2024arXiv240317421C

Keywords:

Computer Science - Information Retrieval;
Computer Science - Artificial Intelligence

NASA/ADS

MA4DIV: Multi-Agent Reinforcement Learning for Search Result Diversification

Abstract