TY - GEN
T1 - Real-time inverse dynamics learning for musculoskeletal robots based on echo state Gaussian process regression
AU - Hartmann, Christoph
AU - Boedecker, Joschka
AU - Obst, Oliver
AU - Ikemoto, Shuhei
AU - Asada, Minoru
PY - 2013
Y1 - 2013
N2 - A challenging topic in articulated robots is the control of redundantly many degrees of freedom with artificial muscles. Actuation with these devices is difficult to solve because of nonlinearities, delays and unknown parameters such as friction. Machine learning methods can be used to learn control of these systems, but are faced with the additional problem that the size of the search space prohibits full exploration in reasonable time. We propose a novel method that is able to learn control of redundant robot arms with artificial muscles online from scratch using only the position of the end effector, without using any joint positions, accelerations or an analytical model of the system or the environment. To learn in real time, we use the so called online "goal babbling" method to effectively reduce the search space, a recurrent neural network to represent the state of the robot arm, and novel online Gaussian processes for regression. With our approach, we achieve good performance on trajectory tracking tasks for the end effector of two very challenging systems: a simulated 6 DOF redundant arm with artificial muscles, and a 7 DOF robot arm with McKibben pneumatic artificial muscles. We also show that the combination of techniques we propose results in significantly improved performance over using the individual techniques alone.
AB - A challenging topic in articulated robots is the control of redundantly many degrees of freedom with artificial muscles. Actuation with these devices is difficult to solve because of nonlinearities, delays and unknown parameters such as friction. Machine learning methods can be used to learn control of these systems, but are faced with the additional problem that the size of the search space prohibits full exploration in reasonable time. We propose a novel method that is able to learn control of redundant robot arms with artificial muscles online from scratch using only the position of the end effector, without using any joint positions, accelerations or an analytical model of the system or the environment. To learn in real time, we use the so called online "goal babbling" method to effectively reduce the search space, a recurrent neural network to represent the state of the robot arm, and novel online Gaussian processes for regression. With our approach, we achieve good performance on trajectory tracking tasks for the end effector of two very challenging systems: a simulated 6 DOF redundant arm with artificial muscles, and a 7 DOF robot arm with McKibben pneumatic artificial muscles. We also show that the combination of techniques we propose results in significantly improved performance over using the individual techniques alone.
KW - Gaussian processes
KW - real-time control
KW - robots
UR - http://handle.uws.edu.au:8081/1959.7/uws:35202
UR - http://rss2012.acfr.usyd.edu.au/pmwiki/
M3 - Conference Paper
SN - 9780262519687
SP - 113
EP - 120
BT - Robotics: Science and Systems VIII, 9-13 July 2012, University of Sydney, Sydney, NSW, Australia
PB - MIT Press
T2 - Robotics: Science and Systems Conference
Y2 - 9 July 2012
ER -