Boosting Reinforcement Learning Algorithms in Continuous Robotic Reaching Tasks Using Adaptive Potential Functions

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

1 Cita (Scopus)

Resumen

In reinforcement learning, reward shaping is an efficient way to augment the reward signal, so to guide the learning process of an agent. A well-known reward shaping framework is the potential-based reward shaping (PBRS) framework, which uses a so-called potential function to guarantee the policy invariance after reward shaping, to prevent undesirable behavior. Different from using a predefined potential function in many works, [3] proposed a novel adaptive potential function (APF) method to learn the potential function concurrently with the RL training from the agent’s training history. However, the APF method was only deployed and evaluated in small discrete environments. This paper bridges the gap by adapting the APF method in robotics, a typical continuous scenario. We apply the APF method with the Deep Deterministic Policy Gradient (DDPG) algorithm to form a new APF-DDPG algorithm. To evaluate our method, we deploy the APF-DDPG to control a Baxtor robot for a series of reaching tasks in both simulations and the real world. The experimental results show that the APF-DDPG algorithm significantly outperforms the baseline DDPG algorithm. The code is available at https://github.com/yfchenShirley/APF_DDPG.

Idioma originalInglés
Título de la publicación alojadaAI 2024
Subtítulo de la publicación alojadaAdvances in Artificial Intelligence - 37th Australasian Joint Conference on Artificial Intelligence, AI 2024, Proceedings
EditoresMingming Gong, Yiliao Song, Yun Sing Koh, Wei Xiang, Derui Wang
EditorialSpringer Science and Business Media Deutschland GmbH
Páginas52-64
Número de páginas13
ISBN (versión impresa)9789819603503
DOI
EstadoPublicada - 2025
Evento37th Australasian Joint Conference on Artificial Intelligence, AJCAI 2024 - Melbourne, Australia
Duración: 25 nov. 202429 nov. 2024

Serie de la publicación

NombreLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen15443 LNAI
ISSN (versión impresa)0302-9743
ISSN (versión digital)1611-3349

Conferencia

Conferencia37th Australasian Joint Conference on Artificial Intelligence, AJCAI 2024
País/TerritorioAustralia
CiudadMelbourne
Período25/11/2429/11/24

Huella

Profundice en los temas de investigación de 'Boosting Reinforcement Learning Algorithms in Continuous Robotic Reaching Tasks Using Adaptive Potential Functions'. En conjunto forman una huella única.

Citar esto