Policy Gradients for Optimal Parallel Tempering MCMC

doi:10.48550/arXiv.2409.01574

Policy Gradients for Optimal Parallel Tempering MCMC

Parallel tempering is a meta-algorithm for Markov Chain Monte Carlo that uses multiple chains to sample from tempered versions of the target distribution, enhancing mixing in multi-modal distributions that are challenging for traditional methods. The effectiveness of parallel tempering is heavily influenced by the selection of chain temperatures. Here, we present an adaptive temperature selection algorithm that dynamically adjusts temperatures during sampling using a policy gradient approach. Experiments demonstrate that our method can achieve lower integrated autocorrelation times compared to traditional geometrically spaced temperatures and uniform acceptance rate schemes on benchmark distributions.

Publication:

arXiv e-prints

Pub Date:

September 2024

DOI:

10.48550/arXiv.2409.01574

arXiv:

arXiv:2409.01574

Bibcode:

2024arXiv240901574Z

Keywords:

Statistics - Computation;
Computer Science - Machine Learning;
Statistics - Machine Learning

E-Print:

12 pages, 5 figures, accepted to ICML 2024 Workshop on Structured Probabilistic Inference &amp

ADS

Policy Gradients for Optimal Parallel Tempering MCMC

Abstract