{"id":607,"date":"2016-02-21T07:06:48","date_gmt":"2016-02-21T07:06:48","guid":{"rendered":"http:\/\/www.labren.org\/mm\/?p=607"},"modified":"2018-05-31T07:08:49","modified_gmt":"2018-05-31T07:08:49","slug":"towards-transferring-skills-to-flexible-surgical-robots-with-programming-by-demonstration-and-reinforcement-learning","status":"publish","type":"post","link":"http:\/\/www.labren.org\/mm\/projects\/towards-transferring-skills-to-flexible-surgical-robots-with-programming-by-demonstration-and-reinforcement-learning\/","title":{"rendered":"Towards transferring skills to flexible surgical robots with programming by demonstration and reinforcement learning"},"content":{"rendered":"<div class=\"t m0 x2 h6 y10 ff2 fs4 fc0 sc0 lsb ws3\"><strong>Abstract:<\/strong><\/div>\n<div class=\"t m0 x2 h6 y10 ff2 fs4 fc0 sc0 lsb ws3\"><span class=\"ff3 lsc wsd\">Flexible manipulators such as tendon-driven ser-<\/span><\/div>\n<div class=\"t m0 xf h6 y11 ff3 fs4 fc0 sc0 lsd wse\">pentine manipulators perform be<span class=\"lse wsf\">tter than traditional rigid <\/span><\/div>\n<div class=\"t m0 xf h6 y12 ff3 fs4 fc0 sc0 lsf ws10\">ones in minimally in<span class=\"lsb ws11\">vasive surgical tasks, including navigation <\/span><\/div>\n<div class=\"t m0 xf h6 y13 ff3 fs4 fc0 sc0 ls1c ws12\">in confined space through key-hole like incisions. However, due<\/div>\n<div class=\"t m0 xf h6 y14 ff3 fs4 fc0 sc0 ls21 ws13\">to the inherent nonlinearities <span class=\"ls22 ws14\">and model uncertainties, motion <\/span><\/div>\n<div class=\"t m0 xf h6 y15 ff3 fs4 fc0 sc0 lsc ws15\">control of such manipulators beco<span class=\"ls10 ws16\">mes extremely challenging. <\/span><\/div>\n<div class=\"t m0 xf h6 y16 ff3 fs4 fc0 sc0 ls10 ws17\">In this work, a hybrid framework combining Programming by<\/div>\n<div class=\"t m0 xf h6 y17 ff3 fs4 fc0 sc0 ls11 ws18\">Demonstration (PbD) and reinforcement learning is proposed<\/div>\n<div class=\"t m0 xf h6 y18 ff3 fs4 fc0 sc0 ls23 ws19\">to solve this problem. Gaussian Mixture Models (GMM),<\/div>\n<div class=\"t m0 xf h6 y19 ff3 fs4 fc0 sc0 ls22 ws1a\">Gaussian Mixture Regression (GMR) and linear regression are<\/div>\n<div class=\"t m0 xf h6 y1a ff3 fs4 fc0 sc0 ls24 ws1b\">used to learn the inverse kinem<span class=\"ls25 ws1c\">atic model of the manipulator <\/span><\/div>\n<div class=\"t m0 xf h6 y1b ff3 fs4 fc0 sc0 ls26 ws1d\">from human demonstrations. The learned model is used as<\/div>\n<div class=\"t m0 xf h6 y1c ff3 fs4 fc0 sc0 lsc ws1e\">nominal model to calculate the ou<span class=\"ls12 ws1f\">tput end-effector trajectories <\/span><\/div>\n<div class=\"t m0 xf h6 y1d ff3 fs4 fc0 sc0 ls12 ws20\">of the manipulator. Two surgical tasks are performed to<\/div>\n<div class=\"t m0 xf h6 y1e ff3 fs4 fc0 sc0 ls2 ws21\">demonstrate the effectiveness <span class=\"ls10 ws22\">of reinforcement learning: tube <\/span><\/div>\n<div class=\"t m0 xf h6 y1f ff3 fs4 fc0 sc0 ls25 ws23\">insertion and circle following. Gaussian noise is introduced to<\/div>\n<div class=\"t m0 xf h6 y20 ff3 fs4 fc0 sc0 ls27 ws24\">the standard model and the disturbed models are fed to the<\/div>\n<div class=\"t m0 xf h6 y21 ff3 fs4 fc0 sc0 lsc ws25\">manipulator to calculate the actuator input with respect to the<\/div>\n<div class=\"t m0 xf h6 y22 ff3 fs4 fc0 sc0 lsb ws26\">task specific end-effector tr<span class=\"ls13 ws27\">ajectories. An expectation <\/span><\/div>\n<div class=\"t m0 xf h6 y23 ff3 fs4 fc0 sc0 ls14 ws28\">maximization (E-M) based reinforcement learning algorithm is<\/div>\n<div class=\"t m0 xf h6 y24 ff3 fs4 fc0 sc0 ls27 ws29\">used to update the disturbed model with returns from rollouts.<\/div>\n<div class=\"t m0 xf h6 y25 ff3 fs4 fc0 sc0 ls15 ws2a\">Simulation results have verifi<span class=\"ls21 ws2b\">ed that the disturbed model can <\/span><\/div>\n<div class=\"t m0 xf h6 y26 ff3 fs4 fc0 sc0 ls28 ws2c\">be converged to the standard one and the tracking accuracy is<\/div>\n<div class=\"t m0 xf h6 y27 ff3 fs4 fc0 sc0 ls28 ws4\">enhanced.<\/div>\n<div><strong>More Information:<\/strong><\/div>\n<div><a href=\"https:\/\/www.researchgate.net\/publication\/301258084_Towards_transferring_skills_to_flexible_surgical_robots_with_programming_by_demonstration_and_reinforcement_learning\">Full Text<\/a><\/div>\n","protected":false},"excerpt":{"rendered":"<p>Abstract: Flexible manipulators such as tendon-driven ser- pentine manipulators perform better than traditional rigid ones in minimally invasive surgical tasks, including navigation in confined space through key-hole like incisions. However, due to the inherent nonlinearities and model uncertainties, motion control of such manipulators becomes extremely challenging. In this work, a\u2026 <a class=\"continue-reading-link\" href=\"http:\/\/www.labren.org\/mm\/projects\/towards-transferring-skills-to-flexible-surgical-robots-with-programming-by-demonstration-and-reinforcement-learning\/\">Continue reading<\/a><\/p>\n","protected":false},"author":12,"featured_media":608,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"ngg_post_thumbnail":0,"footnotes":""},"categories":[9],"tags":[],"class_list":["post-607","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-projects"],"_links":{"self":[{"href":"http:\/\/www.labren.org\/mm\/wp-json\/wp\/v2\/posts\/607","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/www.labren.org\/mm\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/www.labren.org\/mm\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/www.labren.org\/mm\/wp-json\/wp\/v2\/users\/12"}],"replies":[{"embeddable":true,"href":"http:\/\/www.labren.org\/mm\/wp-json\/wp\/v2\/comments?post=607"}],"version-history":[{"count":1,"href":"http:\/\/www.labren.org\/mm\/wp-json\/wp\/v2\/posts\/607\/revisions"}],"predecessor-version":[{"id":609,"href":"http:\/\/www.labren.org\/mm\/wp-json\/wp\/v2\/posts\/607\/revisions\/609"}],"wp:featuredmedia":[{"embeddable":true,"href":"http:\/\/www.labren.org\/mm\/wp-json\/wp\/v2\/media\/608"}],"wp:attachment":[{"href":"http:\/\/www.labren.org\/mm\/wp-json\/wp\/v2\/media?parent=607"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/www.labren.org\/mm\/wp-json\/wp\/v2\/categories?post=607"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/www.labren.org\/mm\/wp-json\/wp\/v2\/tags?post=607"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}