Implementing Online Reinforcement Learning with Temporal Neural Networks
Abstract
A Temporal Neural Network (TNN) architecture for implementing efficient online reinforcement learning is proposed and studied via simulation. The proposed T-learning system is composed of a frontend TNN that implements online unsupervised clustering and a backend TNN that implements online reinforcement learning. The reinforcement learning paradigm employs biologically plausible neo-Hebbian three-factor learning rules. As a working example, a prototype implementation of the cart-pole problem (balancing an inverted pendulum) is studied via simulation.
- Publication:
-
arXiv e-prints
- Pub Date:
- April 2022
- DOI:
- 10.48550/arXiv.2204.05437
- arXiv:
- arXiv:2204.05437
- Bibcode:
- 2022arXiv220405437S
- Keywords:
-
- Computer Science - Neural and Evolutionary Computing;
- 68T07;
- I.2.6