Skip to main navigation Skip to search Skip to main content

A Robust Approach for Continuous Interactive Actor-Critic Algorithms

  • Cristian C. Millan-Arias
  • , Bruno J.T. Fernandes
  • , Francisco Cruz
  • , Richard Dazeley
  • , Sergio Fernandes
  • Universidade de Pernambuco
  • Deakin University

Research output: Contribution to journalArticlepeer-review

23 Scopus citations

Abstract

Reinforcement learning refers to a machine learning paradigm in which an agent interacts with the environment to learn how to perform a task. The characteristics of the environment may change over time or be affected by disturbances not controlled, avoiding the agent finding a proper policy. Some approaches attempt to address these problems, as interactive reinforcement learning, where an external entity helps the agent learn through advice. Other approaches, such as robust reinforcement learning, allow the agent to learn the task, acting in a disturbed environment. In this paper, we propose an approach that addresses interactive reinforcement learning problems in a dynamic environment, where advice provides information on the task and the dynamics of the environment. Thus, an agent learns a policy in a disturbed environment while receiving advice. We implement our approach in the dynamic version of the cart-pole balancing task and a simulated robotic arm dynamic environment to organize objects. Our results show that the proposed approach allows an agent to complete the task satisfactorily in a dynamic, continuous state-action domain. Moreover, experimental results suggest agents trained with our approach are less sensitive to changes in the characteristics of the environment than interactive reinforcement learning agents.

Original languageEnglish
Article number9493212
Pages (from-to)104242-104260
Number of pages19
JournalIEEE Access
Volume9
DOIs
StatePublished - 2021

Keywords

  • Continuous interactive reinforcement learning
  • interactive robust reinforcement learning
  • reinforcement learning
  • robust reinforcement learning

Fingerprint

Dive into the research topics of 'A Robust Approach for Continuous Interactive Actor-Critic Algorithms'. Together they form a unique fingerprint.

Cite this