Learning Generalizable Tool Use with Non-rigid Grasp-pose Registration
Tool use, a hallmark feature of human intelligence, remains a challenging problem in robotics due the complex contacts and high-dimensional action space. In this work, we present a novel method to enable reinforcement learning of tool use behaviors. Our approach provides a scalable way to learn the operation of tools in a new category using only a single demonstration. To this end, we propose a new method for generalizing grasping configurations of multi-fingered robotic hands to novel objects. This is used to guide the policy search via favorable initializations and a shaped reward signal. The learned policies solve complex tool use tasks and generalize to unseen tools at test time. Visualizations and videos of the trained policies are available at https://maltemosbach.github.io/generalizable_tool_use.
- Published in:
2023 IEEE 19th International Conference on Automation Science and Engineering (CASE) - Type:
Inproceedings - Authors:
Mosbach, Malte; Behnke, Sven - Year:
2023 - Source:
https://ieeexplore.ieee.org/document/10260551
Citation information
Mosbach, Malte; Behnke, Sven: Learning Generalizable Tool Use with Non-rigid Grasp-pose Registration, 2023 IEEE 19th International Conference on Automation Science and Engineering (CASE), 2023, August, https://ieeexplore.ieee.org/document/10260551, Mosbach.Behnke.2023a,
@Inproceedings{Mosbach.Behnke.2023a,
author={Mosbach, Malte; Behnke, Sven},
title={Learning Generalizable Tool Use with Non-rigid Grasp-pose Registration},
booktitle={2023 IEEE 19th International Conference on Automation Science and Engineering (CASE)},
month={August},
url={https://ieeexplore.ieee.org/document/10260551},
year={2023},
abstract={Tool use, a hallmark feature of human intelligence, remains a challenging problem in robotics due the complex contacts and high-dimensional action space. In this work, we present a novel method to enable reinforcement learning of tool use behaviors. Our approach provides a scalable way to learn the operation of tools in a new category using only a single demonstration. To this end, we propose a...}}