Implementing Online Reinforcement Learning with Clustering Neural Networks

doi:10.48550/arXiv.2402.18472

Implementing Online Reinforcement Learning with Clustering Neural Networks

Smith, James E.

An agent employing reinforcement learning takes inputs (state variables) from an environment and performs actions that affect the environment in order to achieve some objective. Rewards (positive or negative) guide the agent toward improved future actions. This paper builds on prior clustering neural network research by constructing an agent with biologically plausible neo-Hebbian three-factor synaptic learning rules, with a reward signal as the third factor (in addition to pre- and post-synaptic spikes). The classic cart-pole problem (balancing an inverted pendulum) is used as a running example throughout the exposition. Simulation results demonstrate the efficacy of the approach, and the proposed method may eventually serve as a low-level component of a more general method.

Publication:

arXiv e-prints

Pub Date:

February 2024

DOI:

10.48550/arXiv.2402.18472

arXiv:

arXiv:2402.18472

Bibcode:

2024arXiv240218472S

Keywords:

Computer Science - Neural and Evolutionary Computing;
68T07;
I.2

NASA/ADS

Implementing Online Reinforcement Learning with Clustering Neural Networks

Abstract