Reinforcement learning for optimal error correction of toric codes

doi:10.1016/j.physleta.2020.126353

Reinforcement learning for optimal error correction of toric codes

We apply deep reinforcement learning techniques to design high threshold decoders for the toric code under uncorrelated noise. By rewarding the agent only if the decoding procedure preserves the logical states of the toric code, and using deep convolutional networks for the training phase of the agent, we observe near-optimal performance for uncorrelated noise around the theoretically optimal threshold of 11%. We observe that, by and large, the agent implements a policy similar to that of minimum weight perfect matchings even though no bias towards any policy is given a priori.

Publication:

Physics Letters A

Pub Date:

June 2020

DOI:

10.1016/j.physleta.2020.126353

arXiv:

arXiv:1911.02308

Bibcode:

2020PhLA..38426353D

Keywords:

Reinforcement learning;
Error correction;
Toric code;
Neural networks;
Quantum Physics

E-Print:

v2: includes more details on teh Reinforcement learning algorithm used as well as the parameters of the neural network, and training phase of the agent

NASA/ADS

Reinforcement learning for optimal error correction of toric codes

Abstract