Skip to main navigation Skip to search Skip to main content

Moody Learners-Explaining Competitive Behaviour of Reinforcement Learning Agents

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

12 Scopus citations

Abstract

Designing the decision-making processes of artificial agents that are involved in competitive interactions is a challenging task. In a competitive scenario, the agent does not only have a dynamic environment but also is directly affected by the opponents' actions. Observing the Q-values of the agent is usually a way of explaining its behavior, however, it does not show the temporal-relation between the selected actions. We address this problem by proposing the Moody framework that creates an intrinsic representation for each agent based on the Pleasure/Arousal model. We evaluate our model by performing a series of experiments using the competitive multiplayer Chef's Hat card game and discuss how by observing the intrinsic state generated by our model allows us to obtain a holistic representation of the competitive dynamics within the game.

Original languageEnglish
Title of host publicationICDL-EpiRob 2020 - 10th IEEE International Conference on Development and Learning and Epigenetic Robotics
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728173061
DOIs
StatePublished - 26 Oct 2020
Event10th Joint IEEE International Conference on Development and Learning and Epigenetic Robotics, ICDL-EpiRob 2020 - Virtual, Valparaiso, Chile
Duration: 26 Oct 202030 Oct 2020

Publication series

NameICDL-EpiRob 2020 - 10th IEEE International Conference on Development and Learning and Epigenetic Robotics

Conference

Conference10th Joint IEEE International Conference on Development and Learning and Epigenetic Robotics, ICDL-EpiRob 2020
Country/TerritoryChile
CityVirtual, Valparaiso
Period26/10/2030/10/20

Keywords

  • Explainable artificial intelligence
  • Intrinsic confidence
  • Reinforcement learning

Fingerprint

Dive into the research topics of 'Moody Learners-Explaining Competitive Behaviour of Reinforcement Learning Agents'. Together they form a unique fingerprint.

Cite this