Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

Human feedback in continuous actor-critic reinforcement learning

  • Universidade de Pernambuco

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

16 Citas (Scopus)

Resumen

Reinforcement learning is utilized in contexts where an agent tries to learn from the environment. Using continuous actions, the performance may be improved in comparison to using discrete actions, however, this leads to excessive time to find a proper policy. In this work, we focus on including human feedback in reinforcement learning for a continuous action space. We unify the policy and the feedback to favor actions of low probability density. Furthermore, we compare the performance of the feedback for the continuous actor-critic algorithm and test our experiments in the cart-pole balancing task. The obtained results show that the proposed approach increases the accumulated reward in comparison to the autonomous learning method.

Idioma originalInglés
Título de la publicación alojadaESANN 2019 - Proceedings, 27th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning
EditorialESANN (i6doc.com)
Páginas661-666
Número de páginas6
ISBN (versión digital)9782875870650
EstadoPublicada - 2019

Serie de la publicación

NombreESANN 2019 - Proceedings, 27th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning

Huella

Profundice en los temas de investigación de 'Human feedback in continuous actor-critic reinforcement learning'. En conjunto forman una huella única.

Citar esto