Active learning of manipulation sequences

Martinez, David; Alenya, Guillem; Jimenez, Pablo; Torras, Carme; Rossmann, Jurgen; Wantia, Nils; Aksoy, Eren Erdal; Haller, Simon; Piater, Justus

Published in

2014 IEEE International Conference on Robotics and Automation (ICRA)

DOI: 10.1109/icra.2014.6907693

Tools

Export citation

Search in Google Scholar

Active learning of manipulation sequences

Proceedings article published in 2014 by David Martinez, Guillem Alenya

, Pablo Jimenez, Carme Torras, Jurgen Rossmann, Nils Wantia, Eren Erdal Aksoy, Simon Haller, Justus Piater

This paper is available in a repository.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

Trabajo presentado al ICRA 2014 celebrado en Hong Kong del 31 de mayo al 7 de junio. ; We describe a system allowing a robot to learn goal-directed manipulation sequences such as steps of an assembly task. Learning is based on a free mix of exploration and instruction by an external teacher, and may be active in the sense that the system tests actions to maximize learning progress and asks the teacher if needed. The main component is a symbolic planning engine that operates on learned rules, defined by actions and their pre- and postconditions. Learned by model-based reinforcement learning, rules are immediately available for planning. Thus, there are no distinct learning and application phases. We show how dynamic plans, replanned after every action if necessary, can be used for automatic execution of manipulation sequences, for monitoring of observed manipulation sequences, or a mix of the two, all while extending and refining the rule base on the fly. Quantitative results indicate fast convergence using few training examples, and highly effective teacher intervention at early stages of learning. ; The research leading to these results has received funding from the European Community’s Seventh Framework Programme FP7/2007-2013 (Specific Programme Cooperation, Theme 3, Information and Communication Technologies) under grant agreement no. 269959, IntellAct. D. Martínez is also supported by the Spanish Ministry of Education, Culture and Sport via a FPU doctoral grant (FPU12-04173). ; Peer Reviewed

Published in

Links

Tools

Active learning of manipulation sequences

Abstract