Personalization for Web-based Services using Offline Reinforcement Learning

doi:10.48550/arXiv.2102.05612

Personalization for Web-based Services using Offline Reinforcement Learning

Large-scale Web-based services present opportunities for improving UI policies based on observed user interactions. We address challenges of learning such policies through model-free offline Reinforcement Learning (RL) with off-policy training. Deployed in a production system for user authentication in a major social network, it significantly improves long-term objectives. We articulate practical challenges, compare several ML techniques, provide insights on training and evaluation of RL models, and discuss generalizations.

Publication:

arXiv e-prints

Pub Date:

February 2021

DOI:

10.48550/arXiv.2102.05612

arXiv:

arXiv:2102.05612

Bibcode:

2021arXiv210205612A

Keywords:

Computer Science - Machine Learning;
Computer Science - Human-Computer Interaction;
Computer Science - Software Engineering

E-Print:

9 pages, 8 figures, 3 tables

NASA/ADS

Personalization for Web-based Services using Offline Reinforcement Learning

Abstract