Pre- and post-contact policy decomposition for non-prehensile manipulation with zero-shot sim-to-real transfer

doi:10.48550/arXiv.2309.02754

Pre- and post-contact policy decomposition for non-prehensile manipulation with zero-shot sim-to-real transfer

We present a system for non-prehensile manipulation that require a significant number of contact mode transitions and the use of environmental contacts to successfully manipulate an object to a target location. Our method is based on deep reinforcement learning which, unlike state-of-the-art planning algorithms, does not require apriori knowledge of the physical parameters of the object or environment such as friction coefficients or centers of mass. The planning time is reduced to the simple feed-forward prediction time on a neural network. We propose a computational structure, action space design, and curriculum learning scheme that facilitates efficient exploration and sim-to-real transfer. In challenging real-world non-prehensile manipulation tasks, we show that our method can generalize over different objects, and succeed even for novel objects not seen during training. Project website: https://sites.google.com/view/nonprenehsile-decomposition

Publication:

arXiv e-prints

Pub Date:

September 2023

DOI:

10.48550/arXiv.2309.02754

arXiv:

arXiv:2309.02754

Bibcode:

2023arXiv230902754K

Keywords:

Computer Science - Robotics

E-Print:

Accepted to the 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

NASA/ADS

Pre- and post-contact policy decomposition for non-prehensile manipulation with zero-shot sim-to-real transfer

Abstract