Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

Human Decision-Making Concepts with Goal-Oriented Reasoning for Explainable Deep Reinforcement Learning

  • UNSW Sydney

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

1 Cita (Scopus)

Resumen

Recently, the development and integration of Artificial Intelligence (AI) has accelerated and been popularized widely throughout modern society. AI is becoming a powerful tool ranging from leisurely use to critical applications. However, due to the black-box nature of some AI approaches such as Deep Reinforcement Learning (DRL), complex AI algorithms now face growing concerns of trust in ethical and responsible decision-making. EXplainable Artificial Intelligence (XAI) is a subfield of AI focused on deriving interpretable information from incomprehensible statistics to generate explanations for an AI’s decisions. This paper proposes an architecture that combines 2 XAI techniques, Testable Concept Activation Vectors (TCAV) and Reward Decomposition, to create goal-oriented explanations. The XAI approach is tested in a simulated movement prediction environment where a DRL agent is trained to represent different human concepts and goal prioritizations; we can confidently distinguish those concepts between agents in a human-centric framework. Results obtained demonstrate our method allows users to insert their own high-level thinking into XAI and use it to generate explanations.

Idioma originalInglés
Título de la publicación alojadaAI 2024
Subtítulo de la publicación alojadaAdvances in Artificial Intelligence - 37th Australasian Joint Conference on Artificial Intelligence, AI 2024, Proceedings
EditoresMingming Gong, Yiliao Song, Yun Sing Koh, Wei Xiang, Derui Wang
EditorialSpringer Science and Business Media Deutschland GmbH
Páginas228-240
Número de páginas13
ISBN (versión impresa)9789819603473
DOI
EstadoPublicada - 2025

Serie de la publicación

NombreLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volumen15442 LNAI
ISSN (versión impresa)0302-9743
ISSN (versión digital)1611-3349

Huella

Profundice en los temas de investigación de 'Human Decision-Making Concepts with Goal-Oriented Reasoning for Explainable Deep Reinforcement Learning'. En conjunto forman una huella única.

Citar esto