Learning Vision-based Pursuit-Evasion Robot Policies

doi:10.48550/arXiv.2308.16185

Learning Vision-based Pursuit-Evasion Robot Policies

Learning strategic robot behavior -- like that required in pursuit-evasion interactions -- under real-world constraints is extremely challenging. It requires exploiting the dynamics of the interaction, and planning through both physical state and latent intent uncertainty. In this paper, we transform this intractable problem into a supervised learning problem, where a fully-observable robot policy generates supervision for a partially-observable one. We find that the quality of the supervision signal for the partially-observable pursuer policy depends on two key factors: the balance of diversity and optimality of the evader's behavior and the strength of the modeling assumptions in the fully-observable policy. We deploy our policy on a physical quadruped robot with an RGB-D camera on pursuit-evasion interactions in the wild. Despite all the challenges, the sensing constraints bring about creativity: the robot is pushed to gather information when uncertain, predict intent from noisy measurements, and anticipate in order to intercept. Project webpage: https://abajcsy.github.io/vision-based-pursuit/

Publication:

arXiv e-prints

Pub Date:

August 2023

DOI:

10.48550/arXiv.2308.16185

arXiv:

arXiv:2308.16185

Bibcode:

2023arXiv230816185B

Keywords:

Computer Science - Robotics;
Computer Science - Artificial Intelligence

E-Print:

Includes Supplementary. Project webpage at https://abajcsy.github.io/vision-based-pursuit/

NASA/ADS

Learning Vision-based Pursuit-Evasion Robot Policies

Abstract