Obstacle avoidance and navigation utilizing reinforcement learning with reward shaping

doi:10.1117/12.2558212

Obstacle avoidance and navigation utilizing reinforcement learning with reward shaping

In this paper, we investigate the obstacle avoidance and navigation problem in the robotic control area. For solving such a problem, we propose revised Deep Deterministic Policy Gradient (DDPG) and Proximal Policy Optimization algorithms with an improved reward shaping technique. We compare the performance between the original DDPG and PPO with the revised version of both on simulations with a real mobile robot and demonstrate that the proposed algorithms achieve better results.

Publication:

Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications II

Pub Date:

April 2020

DOI:

10.1117/12.2558212

arXiv:

arXiv:2003.12863

Bibcode:

2020SPIE11413E..1HZ

Keywords:

Computer Science - Robotics;
Computer Science - Machine Learning;
Electrical Engineering and Systems Science - Systems and Control

NASA/ADS

Obstacle avoidance and navigation utilizing reinforcement learning with reward shaping

Abstract