Research output per year
Research output per year
Yifei Chen*, Lambert Schomaker, Francisco Cruz
Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Academic › peer-review
In reinforcement learning, reward shaping is an efficient way to augment the reward signal, so to guide the learning process of an agent. A well-known reward shaping framework is the potential-based reward shaping (PBRS) framework, which uses a so-called potential function to guarantee the policy invariance after reward shaping, to prevent undesirable behavior. Different from using a predefined potential function in many works, [3] proposed a novel adaptive potential function (APF) method to learn the potential function concurrently with the RL training from the agent’s training history. However, the APF method was only deployed and evaluated in small discrete environments. This paper bridges the gap by adapting the APF method in robotics, a typical continuous scenario. We apply the APF method with the Deep Deterministic Policy Gradient (DDPG) algorithm to form a new APF-DDPG algorithm. To evaluate our method, we deploy the APF-DDPG to control a Baxtor robot for a series of reaching tasks in both simulations and the real world. The experimental results show that the APF-DDPG algorithm significantly outperforms the baseline DDPG algorithm. The code is available at https://github.com/yfchenShirley/APF_DDPG.
Original language | English |
---|---|
Title of host publication | AI 2024 |
Subtitle of host publication | Advances in Artificial Intelligence - 37th Australasian Joint Conference on Artificial Intelligence, AI 2024, Proceedings |
Editors | Mingming Gong, Yiliao Song, Yun Sing Koh, Wei Xiang, Derui Wang |
Publisher | Springer |
Pages | 52-64 |
Number of pages | 13 |
ISBN (Electronic) | 978-981-96-0351-0 |
ISBN (Print) | 978-981-96-0350-3 |
DOIs | |
Publication status | Published - 20-Nov-2024 |
Event | 37th Australasian Joint Conference on Artificial Intelligence, AJCAI 2024 - Melbourne, Australia Duration: 25-Nov-2024 → 29-Nov-2024 |
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Publisher | Springer |
Volume | 15443 LNAI |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference | 37th Australasian Joint Conference on Artificial Intelligence, AJCAI 2024 |
---|---|
Country/Territory | Australia |
City | Melbourne |
Period | 25/11/2024 → 29/11/2024 |
Research output: Working paper › Preprint › Academic