TY - GEN
T1 - Moody Learners-Explaining Competitive Behaviour of Reinforcement Learning Agents
AU - Barros, Pablo
AU - Tanevska, Ana
AU - Cruz, Francisco
AU - Sciutti, Alessandra
N1 - Publisher Copyright:
© ICDL-EpiRob 2020. All rights reserved.
PY - 2020/10/26
Y1 - 2020/10/26
N2 - Designing the decision-making processes of artificial agents that are involved in competitive interactions is a challenging task. In a competitive scenario, the agent does not only have a dynamic environment but also is directly affected by the opponents' actions. Observing the Q-values of the agent is usually a way of explaining its behavior, however, it does not show the temporal-relation between the selected actions. We address this problem by proposing the Moody framework that creates an intrinsic representation for each agent based on the Pleasure/Arousal model. We evaluate our model by performing a series of experiments using the competitive multiplayer Chef's Hat card game and discuss how by observing the intrinsic state generated by our model allows us to obtain a holistic representation of the competitive dynamics within the game.
AB - Designing the decision-making processes of artificial agents that are involved in competitive interactions is a challenging task. In a competitive scenario, the agent does not only have a dynamic environment but also is directly affected by the opponents' actions. Observing the Q-values of the agent is usually a way of explaining its behavior, however, it does not show the temporal-relation between the selected actions. We address this problem by proposing the Moody framework that creates an intrinsic representation for each agent based on the Pleasure/Arousal model. We evaluate our model by performing a series of experiments using the competitive multiplayer Chef's Hat card game and discuss how by observing the intrinsic state generated by our model allows us to obtain a holistic representation of the competitive dynamics within the game.
KW - Explainable artificial intelligence
KW - Intrinsic confidence
KW - Reinforcement learning
UR - https://www.scopus.com/pages/publications/85095521006
U2 - 10.1109/ICDL-EpiRob48136.2020.9278125
DO - 10.1109/ICDL-EpiRob48136.2020.9278125
M3 - Conference contribution
AN - SCOPUS:85095521006
T3 - ICDL-EpiRob 2020 - 10th IEEE International Conference on Development and Learning and Epigenetic Robotics
BT - ICDL-EpiRob 2020 - 10th IEEE International Conference on Development and Learning and Epigenetic Robotics
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 10th Joint IEEE International Conference on Development and Learning and Epigenetic Robotics, ICDL-EpiRob 2020
Y2 - 26 October 2020 through 30 October 2020
ER -