Combining Self-Supervised Learning and Imitation for Vision-Based Rope Manipulation

doi:10.48550/arXiv.1703.02018

Combining Self-Supervised Learning and Imitation for Vision-Based Rope Manipulation

Manipulation of deformable objects, such as ropes and cloth, is an important but challenging problem in robotics. We present a learning-based system where a robot takes as input a sequence of images of a human manipulating a rope from an initial to goal configuration, and outputs a sequence of actions that can reproduce the human demonstration, using only monocular images as input. To perform this task, the robot learns a pixel-level inverse dynamics model of rope manipulation directly from images in a self-supervised manner, using about 60K interactions with the rope collected autonomously by the robot. The human demonstration provides a high-level plan of what to do and the low-level inverse model is used to execute the plan. We show that by combining the high and low-level plans, the robot can successfully manipulate a rope into a variety of target shapes using only a sequence of human-provided images for direction.

Publication:

arXiv e-prints

Pub Date:

March 2017

DOI:

10.48550/arXiv.1703.02018

arXiv:

arXiv:1703.02018

Bibcode:

2017arXiv170302018N

Keywords:

Computer Science - Computer Vision and Pattern Recognition;
Computer Science - Machine Learning;
Computer Science - Robotics

E-Print:

8 pages, accepted to International Conference on Robotics and Automation (ICRA) 2017

NASA/ADS

Combining Self-Supervised Learning and Imitation for Vision-Based Rope Manipulation

Abstract