Skip to main navigation Skip to search Skip to main content

Proxemic behavior in navigation tasks using reinforcement learning

  • Universidade de Pernambuco
  • UNSW Sydney

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

Human interaction starts with a person approaching another one, respecting their personal space to prevent uncomfortable feelings. Spatial behavior, called proxemics, allows defining an acceptable distance so that the interaction process begins appropriately. In recent decades, human-agent interaction has been an area of interest for researchers, where it is proposed that artificial agents naturally interact with people. Thus, new alternatives are needed to allow optimal communication, avoiding humans feeling uncomfortable. Several works consider proxemic behavior with cognitive agents, where human-robot interaction techniques and machine learning are implemented. However, it is assumed that the personal space is fixed and known in advance, and the agent is only expected to make an optimal trajectory toward the person. In this work, we focus on studying the behavior of a reinforcement learning agent in a proxemic-based environment. Experiments were carried out implementing a grid-world problem and a continuous simulated robotic approaching environment. These environments assume that there is an issuer agent that provides non-conformity information. Our results suggest that the agent can identify regions where the issuer feels uncomfortable and find the best path to approach the issuer. The results obtained highlight the usefulness of reinforcement learning in order to identify proxemic regions.

Original languageEnglish
Pages (from-to)16723-16738
Number of pages16
JournalNeural Computing and Applications
Volume35
Issue number23
DOIs
StatePublished - Aug 2023

Keywords

  • Cognitive agents
  • Proxemics
  • Reinforcement learning

Fingerprint

Dive into the research topics of 'Proxemic behavior in navigation tasks using reinforcement learning'. Together they form a unique fingerprint.

Cite this