Convergence Acceleration of Markov Chain Monte Carlo-Based Gradient Descent by Deep Unfolding
Abstract
This study proposes a trainable sampling-based solver for combinatorial optimization problems (COPs) using a deep-learning technique called deep unfolding. The proposed solver is based on the Ohzeki method that combines Markov-chain Monte-Carlo (MCMC) and gradient descent, and its step sizes are trained by minimizing a loss function. In the training process, we propose a sampling-based gradient estimation that substitutes auto-differentiation with a variance estimation, thereby circumventing the failure of back propagation due to the non-differentiability of MCMC. The numerical results for a few COPs demonstrated that the proposed solver significantly accelerated the convergence speed compared with the original Ohzeki method.
- Publication:
-
Journal of the Physical Society of Japan
- Pub Date:
- June 2024
- DOI:
- 10.7566/JPSJ.93.063801
- arXiv:
- arXiv:2402.13608
- Bibcode:
- 2024JPSJ...93f3801H
- Keywords:
-
- Condensed Matter - Disordered Systems and Neural Networks;
- Computer Science - Machine Learning;
- Statistics - Machine Learning
- E-Print:
- 10 pages, 5 figures