Skip to main navigation Skip to search Skip to main content

Boosting Reinforcement Learning Algorithms in Continuous Robotic Reaching Tasks Using Adaptive Potential Functions

  • University of Groningen
  • Data61 of CSIRO
  • UNSW Sydney

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

In reinforcement learning, reward shaping is an efficient way to augment the reward signal, so to guide the learning process of an agent. A well-known reward shaping framework is the potential-based reward shaping (PBRS) framework, which uses a so-called potential function to guarantee the policy invariance after reward shaping, to prevent undesirable behavior. Different from using a predefined potential function in many works, [3] proposed a novel adaptive potential function (APF) method to learn the potential function concurrently with the RL training from the agent’s training history. However, the APF method was only deployed and evaluated in small discrete environments. This paper bridges the gap by adapting the APF method in robotics, a typical continuous scenario. We apply the APF method with the Deep Deterministic Policy Gradient (DDPG) algorithm to form a new APF-DDPG algorithm. To evaluate our method, we deploy the APF-DDPG to control a Baxtor robot for a series of reaching tasks in both simulations and the real world. The experimental results show that the APF-DDPG algorithm significantly outperforms the baseline DDPG algorithm. The code is available at https://github.com/yfchenShirley/APF_DDPG.

Original languageEnglish
Title of host publicationAI 2024
Subtitle of host publicationAdvances in Artificial Intelligence - 37th Australasian Joint Conference on Artificial Intelligence, AI 2024, Proceedings
EditorsMingming Gong, Yiliao Song, Yun Sing Koh, Wei Xiang, Derui Wang
PublisherSpringer Science and Business Media Deutschland GmbH
Pages52-64
Number of pages13
ISBN (Print)9789819603503
DOIs
StatePublished - 2025

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume15443 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Keywords

  • Reinforcement learning
  • Reward shaping
  • Robot tasks

Fingerprint

Dive into the research topics of 'Boosting Reinforcement Learning Algorithms in Continuous Robotic Reaching Tasks Using Adaptive Potential Functions'. Together they form a unique fingerprint.

Cite this