Hierarchical Policy Blending As Optimal Transport
Abstract
We present hierarchical policy blending as optimal transport (HiPBOT). HiPBOT hierarchically adjusts the weights of low-level reactive expert policies of different agents by adding a look-ahead planning layer on the parameter space. The high-level planner renders policy blending as unbalanced optimal transport consolidating the scaling of the underlying Riemannian motion policies. As a result, HiPBOT effectively decides the priorities between expert policies and agents, ensuring the task's success and guaranteeing safety. Experimental results in several application scenarios, from low-dimensional navigation to high-dimensional whole-body control, show the efficacy and efficiency of HiPBOT. Our method outperforms state-of-the-art baselines -- either adopting probabilistic inference or defining a tree structure of experts -- paving the way for new applications of optimal transport to robot control. More material at https://sites.google.com/view/hipobot
- Publication:
-
arXiv e-prints
- Pub Date:
- December 2022
- DOI:
- 10.48550/arXiv.2212.01938
- arXiv:
- arXiv:2212.01938
- Bibcode:
- 2022arXiv221201938L
- Keywords:
-
- Computer Science - Robotics;
- Computer Science - Machine Learning;
- Electrical Engineering and Systems Science - Systems and Control
- E-Print:
- 16 pages, 5 figures, accepted to the 5th Annual Learning for Dynamics &