Skip to main navigation Skip to search Skip to main content

Human Decision-Making Concepts with Goal-Oriented Reasoning for Explainable Deep Reinforcement Learning

  • UNSW Sydney

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Recently, the development and integration of Artificial Intelligence (AI) has accelerated and been popularized widely throughout modern society. AI is becoming a powerful tool ranging from leisurely use to critical applications. However, due to the black-box nature of some AI approaches such as Deep Reinforcement Learning (DRL), complex AI algorithms now face growing concerns of trust in ethical and responsible decision-making. EXplainable Artificial Intelligence (XAI) is a subfield of AI focused on deriving interpretable information from incomprehensible statistics to generate explanations for an AI’s decisions. This paper proposes an architecture that combines 2 XAI techniques, Testable Concept Activation Vectors (TCAV) and Reward Decomposition, to create goal-oriented explanations. The XAI approach is tested in a simulated movement prediction environment where a DRL agent is trained to represent different human concepts and goal prioritizations; we can confidently distinguish those concepts between agents in a human-centric framework. Results obtained demonstrate our method allows users to insert their own high-level thinking into XAI and use it to generate explanations.

Original languageEnglish
Title of host publicationAI 2024
Subtitle of host publicationAdvances in Artificial Intelligence - 37th Australasian Joint Conference on Artificial Intelligence, AI 2024, Proceedings
EditorsMingming Gong, Yiliao Song, Yun Sing Koh, Wei Xiang, Derui Wang
PublisherSpringer Science and Business Media Deutschland GmbH
Pages228-240
Number of pages13
ISBN (Print)9789819603473
DOIs
StatePublished - 2025

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume15442 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Keywords

  • Artificial Intelligence
  • Explainable Artificial Intelligence
  • Neural Networks
  • Reinforcement Learning
  • Reward Decomposition

Fingerprint

Dive into the research topics of 'Human Decision-Making Concepts with Goal-Oriented Reasoning for Explainable Deep Reinforcement Learning'. Together they form a unique fingerprint.

Cite this