Budget Optimization for Sponsored Search: Censored Learning in MDPs
Abstract
We consider the budget optimization problem faced by an advertiser participating in repeated sponsored search auctions, seeking to maximize the number of clicks attained under that budget. We cast the budget optimization problem as a Markov Decision Process (MDP) with censored observations, and propose a learning algorithm based on the wellknown Kaplan-Meier or product-limit estimator. We validate the performance of this algorithm by comparing it to several others on a large set of search auction data from Microsoft adCenter, demonstrating fast convergence to optimal performance.
- Publication:
-
arXiv e-prints
- Pub Date:
- October 2012
- DOI:
- 10.48550/arXiv.1210.4847
- arXiv:
- arXiv:1210.4847
- Bibcode:
- 2012arXiv1210.4847A
- Keywords:
-
- Computer Science - Computer Science and Game Theory
- E-Print:
- Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)