Modeling Interactions of Autonomous Vehicles and Pedestrians with Deep Multi-Agent Reinforcement Learning for Collision Avoidance

Raphael Trumpp; Harald Bayerlein; David Gesbert

Pré-Publication, Document De Travail Année : 2021

Modeling Interactions of Autonomous Vehicles and Pedestrians with Deep Multi-Agent Reinforcement Learning for Collision Avoidance

(1) , (2) , (2)

1
2

Raphael Trumpp

Fonction : Auteur

Technische Universität Munchen - Technical University Munich - Université Technique de Munich

Harald Bayerlein

Fonction : Auteur

Eurecom [Sophia Antipolis]

David Gesbert

Fonction : Auteur
PersonId : 846409
ORCID : 0000-0002-4806-704X
IdRef : 057482535

Eurecom [Sophia Antipolis]

Résumé

Reliable pedestrian crash avoidance mitigation (PCAM) systems are crucial components of safe autonomous vehicles (AVs). The sequential nature of the vehicle-pedestrian interaction, i.e., where immediate decisions of one agent directly influence the following decisions of the other agent, is an often neglected but important aspect. In this work, we model the corresponding interaction sequence as a Markov decision process (MDP) that is solved by deep reinforcement learning (DRL) algorithms to define the PCAM system's policy. The simulated driving scenario is based on an AV acting as a DRL agent driving along an urban street, facing a pedestrian at an unmarked crosswalk who tries to cross. Since modeling realistic crossing behavior of the pedestrian is challenging, we introduce two levels of intelligent pedestrian behavior: While the baseline model follows a predefined strategy, our advanced model captures continuous learning and the inherent uncertainty in human behavior by defining the pedestrian as a second DRL agent, i.e., we introduce a deep multi-agent reinforcement learning (DMARL) problem. The presented PCAM system with different levels of intelligent pedestrian behavior is benchmarked according to the agents' collision rate and the resulting traffic flow efficiency. In this analysis, our focus lies on evaluating the influence of observation noise on the decision making of the agents. The results show that the AV is able to completely mitigate collisions under the majority of the investigated conditions and that the DRL-based pedestrian model indeed learns a more human-like crossing behavior.

Domaines

Ingénierie assistée par ordinateur

Fichier principal

IV_Conference_2022.pdf (388.44 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Centre De Documentation Eurecom : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03372895

Soumis le : mercredi 19 octobre 2022-11:28:46

Dernière modification le : lundi 13 novembre 2023-10:56:04

Archivage à long terme le : vendredi 20 janvier 2023-19:33:38

Dates et versions

hal-03372895 , version 1 (19-10-2022)

Identifiants

HAL Id : hal-03372895 , version 1
ARXIV : 2109.15266

Citer

Raphael Trumpp, Harald Bayerlein, David Gesbert. Modeling Interactions of Autonomous Vehicles and Pedestrians with Deep Multi-Agent Reinforcement Learning for Collision Avoidance. 2021. ⟨hal-03372895⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EURECOM 3IA-COTEDAZUR ANR

57 Consultations

18 Téléchargements

Modeling Interactions of Autonomous Vehicles and Pedestrians with Deep Multi-Agent Reinforcement Learning for Collision Avoidance

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager