Skip to main navigation Skip to search Skip to main content

Human engagement providing evaluative and informative advice for interactive reinforcement learning

  • Adam Bignold
  • , Francisco Cruz
  • , Richard Dazeley
  • , Peter Vamplew
  • , Cameron Foale
  • Federation University Australia
  • Deakin University

Research output: Contribution to journalArticlepeer-review

13 Scopus citations

Abstract

Interactive reinforcement learning proposes the use of externally sourced information in order to speed up the learning process. When interacting with a learner agent, humans may provide either evaluative or informative advice. Prior research has focused on the effect of human-sourced advice by including real-time feedback on the interactive reinforcement learning process, specifically aiming to improve the learning speed of the agent, while minimising the time demands on the human. This work focuses on answering which of two approaches, evaluative or informative, is the preferred instructional approach for humans. Moreover, this work presents an experimental setup for a human trial designed to compare the methods people use to deliver advice in terms of human engagement. The results obtained show that users giving informative advice to the learner agents provide more accurate advice, are willing to assist the learner agent for a longer time, and provide more advice per episode. Additionally, self-evaluation from participants using the informative approach has indicated that the agent’s ability to follow the advice is higher, and therefore, they feel their own advice to be of higher accuracy when compared to people providing evaluative advice.

Original languageEnglish
Pages (from-to)18215-18230
Number of pages16
JournalNeural Computing and Applications
Volume35
Issue number25
DOIs
StatePublished - Sep 2023

Keywords

  • Assisted reinforcement learning
  • Evaluative and informative advice
  • Interactive reinforcement learning
  • Policy shaping
  • Reward shaping
  • User study

Fingerprint

Dive into the research topics of 'Human engagement providing evaluative and informative advice for interactive reinforcement learning'. Together they form a unique fingerprint.

Cite this