Modeling Driver Behavior From Demonstrations in Dynamic Environments Using Spatiotemporal Lattices

Abstract : One of the most challenging tasks in the development of path planners for intelligent vehicles is the design of the cost function that models the desired behavior of the vehicle. While this task has been traditionally accomplished by hand-tuning the model parameters, recent approaches propose to learn the model automatically from demonstrated driving data using Inverse Reinforcement Learning (IRL). To determine if the model has correctly captured the demonstrated behavior, most IRL methods require obtaining a policy by solving the forward control problem repetitively. Calculating the full policy is a costly task in continuous or large domains and thus often approximated by finding a single trajectory using traditional path-planning techniques. In this work, we propose to find such a trajectory using a conformal spatiotemporal state lattice, which offers two main advantages. First, by conforming the lattice to the environment, the search is focused only on feasible motions for the robot, saving computational power. And second, by considering time as part of the state, the trajectory is optimized with respect to the motion of the dynamic obstacles in the scene. As a consequence, the resulting trajectory can be used for the model assessment. We show how the proposed IRL framework can successfully handle highly dynamic environments by modeling the highway tactical driving task from demonstrated driving data gathered with an instrumented vehicle.
Type de document :
Communication dans un congrès
ICRA 2018 - Proceedings of the 2018 IEEE International Conference on Robotics and Automation, May 2018, Brisbane, Australia. pp.3384-3390, 〈10.1109/ICRA.2018.8460208〉
Liste complète des métadonnées

Littérature citée [40 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01729960
Contributeur : David Sierra González <>
Soumis le : lundi 12 mars 2018 - 18:45:30
Dernière modification le : vendredi 16 novembre 2018 - 11:00:57
Document(s) archivé(s) le : mercredi 13 juin 2018 - 15:02:00

Identifiants

Collections

Citation

David Sierra González, Özgür Erkent, Víctor Romero-Cano, Jilles Dibangoye, Christian Laugier. Modeling Driver Behavior From Demonstrations in Dynamic Environments Using Spatiotemporal Lattices. ICRA 2018 - Proceedings of the 2018 IEEE International Conference on Robotics and Automation, May 2018, Brisbane, Australia. pp.3384-3390, 〈10.1109/ICRA.2018.8460208〉. 〈hal-01729960〉

Partager

Métriques

Consultations de la notice

604

Téléchargements de fichiers

383