Personalization for Web-based Services using Offline Reinforcement Learning
Abstract
Large-scale Web-based services present opportunities for improving UI policies based on observed user interactions. We address challenges of learning such policies through model-free offline Reinforcement Learning (RL) with off-policy training. Deployed in a production system for user authentication in a major social network, it significantly improves long-term objectives. We articulate practical challenges, compare several ML techniques, provide insights on training and evaluation of RL models, and discuss generalizations.
- Publication:
-
arXiv e-prints
- Pub Date:
- February 2021
- DOI:
- 10.48550/arXiv.2102.05612
- arXiv:
- arXiv:2102.05612
- Bibcode:
- 2021arXiv210205612A
- Keywords:
-
- Computer Science - Machine Learning;
- Computer Science - Human-Computer Interaction;
- Computer Science - Software Engineering
- E-Print:
- 9 pages, 8 figures, 3 tables