Boosting Reinforcement Learning Algorithms in Continuous Robotic Reaching Tasks Using Adaptive Potential Functions

Yifei Chen*, Lambert Schomaker, Francisco Cruz

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

In reinforcement learning, reward shaping is an efficient way to augment the reward signal, so to guide the learning process of an agent. A well-known reward shaping framework is the potential-based reward shaping (PBRS) framework, which uses a so-called potential function to guarantee the policy invariance after reward shaping, to prevent undesirable behavior. Different from using a predefined potential function in many works, [3] proposed a novel adaptive potential function (APF) method to learn the potential function concurrently with the RL training from the agent’s training history. However, the APF method was only deployed and evaluated in small discrete environments. This paper bridges the gap by adapting the APF method in robotics, a typical continuous scenario. We apply the APF method with the Deep Deterministic Policy Gradient (DDPG) algorithm to form a new APF-DDPG algorithm. To evaluate our method, we deploy the APF-DDPG to control a Baxtor robot for a series of reaching tasks in both simulations and the real world. The experimental results show that the APF-DDPG algorithm significantly outperforms the baseline DDPG algorithm. The code is available at https://github.com/yfchenShirley/APF_DDPG.

Original languageEnglish
Title of host publicationAI 2024
Subtitle of host publicationAdvances in Artificial Intelligence - 37th Australasian Joint Conference on Artificial Intelligence, AI 2024, Proceedings
EditorsMingming Gong, Yiliao Song, Yun Sing Koh, Wei Xiang, Derui Wang
PublisherSpringer
Pages52-64
Number of pages13
ISBN (Electronic)978-981-96-0351-0
ISBN (Print)978-981-96-0350-3
DOIs
Publication statusPublished - 20-Nov-2024
Event37th Australasian Joint Conference on Artificial Intelligence, AJCAI 2024 - Melbourne, Australia
Duration: 25-Nov-202429-Nov-2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PublisherSpringer
Volume15443 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference37th Australasian Joint Conference on Artificial Intelligence, AJCAI 2024
Country/TerritoryAustralia
CityMelbourne
Period25/11/202429/11/2024

Keywords

  • Reinforcement learning
  • Reward shaping
  • Robot tasks

Fingerprint

Dive into the research topics of 'Boosting Reinforcement Learning Algorithms in Continuous Robotic Reaching Tasks Using Adaptive Potential Functions'. Together they form a unique fingerprint.

Cite this