Skip to main navigation Skip to search Skip to main content

Agent-advising approaches in an interactive reinforcement learning scenario

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

16 Scopus citations

Abstract

Reinforcement learning has become one of the fundamental topics in the field of robotics and machine learning. In this paper, we expand the classical reinforcement learning framework by the idea of external interaction to support the learning process. To this end, we review a number of proposed advising approaches for interactive reinforcement learning and discuss their implications, namely, probabilistic advising, early advising, importance advising, and mistake correcting. Moreover, we implement the advice strategies for interactive reinforcement learning based on a simulated robotic scenario of a domestic cleaning task. The obtained results show that the mistake correcting approach outperforms a purely probabilistic advice approach as well as the early and importance advising approaches allowing to collect more reward and also to converge faster.

Original languageEnglish
Title of host publication7th Joint IEEE International Conference on Development and Learning and on Epigenetic Robotics, ICDL-EpiRob 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages209-214
Number of pages6
ISBN (Electronic)9781538637159
DOIs
StatePublished - 2 Jul 2017
Event7th Joint IEEE International Conference on Development and Learning and on Epigenetic Robotics, ICDL-EpiRob 2017 - Lisbon, Portugal
Duration: 18 Sep 201721 Sep 2017

Publication series

Name7th Joint IEEE International Conference on Development and Learning and on Epigenetic Robotics, ICDL-EpiRob 2017
Volume2018-January

Conference

Conference7th Joint IEEE International Conference on Development and Learning and on Epigenetic Robotics, ICDL-EpiRob 2017
Country/TerritoryPortugal
CityLisbon
Period18/09/1721/09/17

Fingerprint

Dive into the research topics of 'Agent-advising approaches in an interactive reinforcement learning scenario'. Together they form a unique fingerprint.

Cite this