Obstacle avoidance and navigation utilizing reinforcement learning with reward shaping
Abstract
In this paper, we investigate the obstacle avoidance and navigation problem in the robotic control area. For solving such a problem, we propose revised Deep Deterministic Policy Gradient (DDPG) and Proximal Policy Optimization algorithms with an improved reward shaping technique. We compare the performance between the original DDPG and PPO with the revised version of both on simulations with a real mobile robot and demonstrate that the proposed algorithms achieve better results.
- Publication:
-
Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications II
- Pub Date:
- April 2020
- DOI:
- 10.1117/12.2558212
- arXiv:
- arXiv:2003.12863
- Bibcode:
- 2020SPIE11413E..1HZ
- Keywords:
-
- Computer Science - Robotics;
- Computer Science - Machine Learning;
- Electrical Engineering and Systems Science - Systems and Control