TY - GEN
T1 - Agent-advising approaches in an interactive reinforcement learning scenario
AU - Cruz, Francisco
AU - Wuppen, Peter
AU - Magg, Sven
AU - Fazrie, Alvin
AU - Wermter, Stefan
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/7/2
Y1 - 2017/7/2
N2 - Reinforcement learning has become one of the fundamental topics in the field of robotics and machine learning. In this paper, we expand the classical reinforcement learning framework by the idea of external interaction to support the learning process. To this end, we review a number of proposed advising approaches for interactive reinforcement learning and discuss their implications, namely, probabilistic advising, early advising, importance advising, and mistake correcting. Moreover, we implement the advice strategies for interactive reinforcement learning based on a simulated robotic scenario of a domestic cleaning task. The obtained results show that the mistake correcting approach outperforms a purely probabilistic advice approach as well as the early and importance advising approaches allowing to collect more reward and also to converge faster.
AB - Reinforcement learning has become one of the fundamental topics in the field of robotics and machine learning. In this paper, we expand the classical reinforcement learning framework by the idea of external interaction to support the learning process. To this end, we review a number of proposed advising approaches for interactive reinforcement learning and discuss their implications, namely, probabilistic advising, early advising, importance advising, and mistake correcting. Moreover, we implement the advice strategies for interactive reinforcement learning based on a simulated robotic scenario of a domestic cleaning task. The obtained results show that the mistake correcting approach outperforms a purely probabilistic advice approach as well as the early and importance advising approaches allowing to collect more reward and also to converge faster.
UR - https://www.scopus.com/pages/publications/85050360077
U2 - 10.1109/DEVLRN.2017.8329809
DO - 10.1109/DEVLRN.2017.8329809
M3 - Conference contribution
AN - SCOPUS:85050360077
T3 - 7th Joint IEEE International Conference on Development and Learning and on Epigenetic Robotics, ICDL-EpiRob 2017
SP - 209
EP - 214
BT - 7th Joint IEEE International Conference on Development and Learning and on Epigenetic Robotics, ICDL-EpiRob 2017
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 7th Joint IEEE International Conference on Development and Learning and on Epigenetic Robotics, ICDL-EpiRob 2017
Y2 - 18 September 2017 through 21 September 2017
ER -