Implementing Online Reinforcement Learning with Temporal Neural Networks

doi:10.48550/arXiv.2204.05437

Implementing Online Reinforcement Learning with Temporal Neural Networks

Smith, James E.

A Temporal Neural Network (TNN) architecture for implementing efficient online reinforcement learning is proposed and studied via simulation. The proposed T-learning system is composed of a frontend TNN that implements online unsupervised clustering and a backend TNN that implements online reinforcement learning. The reinforcement learning paradigm employs biologically plausible neo-Hebbian three-factor learning rules. As a working example, a prototype implementation of the cart-pole problem (balancing an inverted pendulum) is studied via simulation.

Publication:

arXiv e-prints

Pub Date:

April 2022

DOI:

10.48550/arXiv.2204.05437

arXiv:

arXiv:2204.05437

Bibcode:

2022arXiv220405437S

Keywords:

Computer Science - Neural and Evolutionary Computing;
68T07;
I.2.6

NASA/ADS

Implementing Online Reinforcement Learning with Temporal Neural Networks

Abstract