Iterative Residual Policy for Goal-Conditioned Dynamic Manipulation of Deformable Objects
FOS: Computer and information sciences
Computer Science - Robotics
0209 industrial biotechnology
02 engineering and technology
Robotics (cs.RO)
DOI:
10.15607/rss.2022.xviii.016
Publication Date:
2022-07-02T22:49:15Z
AUTHORS (5)
ABSTRACT
This paper tackles the task of goal-conditioned dynamic manipulation deformable objects.This is highly challenging due to its complex dynamics (introduced by object deformation and high-speed action) strict requirements (defined a precise goal specification).To address these challenges, we present Iterative Residual Policy (IRP), general learning framework applicable repeatable tasks with dynamics.IRP learns an implicit policy via delta dynamicsinstead modeling entire dynamical system inferring actions from that model, IRP predict effects action on previously-observed trajectory.When combined adaptive sampling, can quickly optimize online reach specified goal.We demonstrate effectiveness two tasks: whipping rope hit target point swinging cloth pose.Despite being trained only in simulation fixed robot setup, able efficiently generalize noisy real-world dynamics, new objects unseen physical properties, even different hardware embodiments, demonstrating excellent generalization capability relative alternative approaches.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (0)
CITATIONS (24)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....