TY - GEN
T1 - Human feedback in continuous actor-critic reinforcement learning
AU - Millán, Cristian
AU - Fernandes, Bruno
AU - Cruz, Francisco
N1 - Publisher Copyright:
© 2019 ESANN (i6doc.com). All rights reserved.
PY - 2019
Y1 - 2019
N2 - Reinforcement learning is utilized in contexts where an agent tries to learn from the environment. Using continuous actions, the performance may be improved in comparison to using discrete actions, however, this leads to excessive time to find a proper policy. In this work, we focus on including human feedback in reinforcement learning for a continuous action space. We unify the policy and the feedback to favor actions of low probability density. Furthermore, we compare the performance of the feedback for the continuous actor-critic algorithm and test our experiments in the cart-pole balancing task. The obtained results show that the proposed approach increases the accumulated reward in comparison to the autonomous learning method.
AB - Reinforcement learning is utilized in contexts where an agent tries to learn from the environment. Using continuous actions, the performance may be improved in comparison to using discrete actions, however, this leads to excessive time to find a proper policy. In this work, we focus on including human feedback in reinforcement learning for a continuous action space. We unify the policy and the feedback to favor actions of low probability density. Furthermore, we compare the performance of the feedback for the continuous actor-critic algorithm and test our experiments in the cart-pole balancing task. The obtained results show that the proposed approach increases the accumulated reward in comparison to the autonomous learning method.
UR - https://www.scopus.com/pages/publications/85071323792
M3 - Conference contribution
AN - SCOPUS:85071323792
T3 - ESANN 2019 - Proceedings, 27th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning
SP - 661
EP - 666
BT - ESANN 2019 - Proceedings, 27th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning
PB - ESANN (i6doc.com)
T2 - 27th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, ESANN 2019
Y2 - 24 April 2019 through 26 April 2019
ER -