A Simple Convergence Time Analysis of Drift-Plus-Penalty for Stochastic Optimization and Convex Programs
Abstract
This paper considers the problem of minimizing the time average of a stochastic process subject to time average constraints on other processes. A canonical example is minimizing average power in a data network subject to multi-user throughput constraints. Another example is a (static) convex program. Under a Slater condition, the drift-plus-penalty algorithm is known to provide an $O(\epsilon)$ approximation to optimality with a convergence time of $O(1/\epsilon^2)$. This paper proves the same result with a simpler technique and in a more general context that does not require the Slater condition. This paper also emphasizes application to basic convex programs, linear programs, and distributed optimization problems.
- Publication:
-
arXiv e-prints
- Pub Date:
- December 2014
- DOI:
- 10.48550/arXiv.1412.0791
- arXiv:
- arXiv:1412.0791
- Bibcode:
- 2014arXiv1412.0791N
- Keywords:
-
- Mathematics - Optimization and Control
- E-Print:
- 10 pages