Self-supervised Learning for Joint Pushing and Grasping Policies in Highly Cluttered Environments

Yongliang Wang, Kamal Mokhtar, Cock Heemskerk, Hamidreza Kasaei

OnderzoeksoutputAcademicpeer review

Samenvatting

Robotic systems often face challenges when attempting to grasp a target object due to interference from surrounding items. We propose a Deep Reinforcement Learning (DRL) method that develops joint policies for grasping and pushing, enabling effective manipulation of target objects within untrained, densely cluttered environments. In particular, a dual RL model is introduced, which presents high resilience in handling complicated scenes, reaching an average of 98% task completion in simulation and real-world scenes. To evaluate the proposed method, we conduct comprehensive simulation experiments in three distinct environments: densely packed building blocks, randomly positioned building blocks, and common household objects. Further, real-world tests are conducted using actual robots to confirm the robustness of our approach in various untrained and highly cluttered environments. The results from experiments underscore the superior efficacy of our method in both simulated and real-world scenarios, outperforming recent state-of-the-art methods. To ensure reproducibility and further the academic discourse, we make available a demonstration video, the trained models, and the source code for public access. https://sites.google.com/view/pushandgrasp/home.
Originele taal-2English
Titel2024 IEEE International Conference on Robotics and Automation (ICRA)
UitgeverijIEEE
Pagina's13840-13847
Aantal pagina's8
ISBN van geprinte versie979-8-3503-8458-1
DOI's
StatusPublished - 17-mei-2024
Evenement2024 IEEE International Conference on Robotics and Automation (ICRA) - Yokohama, Japan
Duur: 13-mei-202417-mei-2024

Conference

Conference2024 IEEE International Conference on Robotics and Automation (ICRA)
Periode13/05/202417/05/2024

Vingerafdruk

Duik in de onderzoeksthema's van 'Self-supervised Learning for Joint Pushing and Grasping Policies in Highly Cluttered Environments'. Samen vormen ze een unieke vingerafdruk.

Citeer dit