Local Feature Swapping for Generalization in Reinforcement Learning - Institut de Mathématiques de Toulouse Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2022

Local Feature Swapping for Generalization in Reinforcement Learning

Résumé

Over the past few years, the acceleration of computing resources and research in deep learning has led to significant practical successes in a range of tasks, including in particular in computer vision. Building on these advances, reinforcement learning has also seen a leap forward with the emergence of agents capable of making decisions directly from visual observations. Despite these successes, the over-parametrization of neural architectures leads to memorization of the data used during training and thus to a lack of generalization. Reinforcement learning agents based on visual inputs also suffer from this phenomenon by erroneously correlating rewards with unrelated visual features such as background elements. To alleviate this problem, we introduce a new regularization technique consisting of channel-consistent local permutations (CLOP) of the feature maps. The proposed permutations induce robustness to spatial correlations and help prevent overfitting behaviors in RL. We demonstrate, on the OpenAI Procgen Benchmark, that RL agents trained with the CLOP method exhibit robustness to visual changes and better generalization properties than agents trained using other state-of-the-art regularization techniques. We also demonstrate the effectiveness of CLOP as a general regularization technique in supervised learning.
Fichier principal
Vignette du fichier
DiCyR_Hal_Arxiv.pdf (4.67 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03632784 , version 1 (12-04-2022)
hal-03632784 , version 2 (16-09-2022)
hal-03632784 , version 3 (01-10-2022)

Identifiants

Citer

David Bertoin, Emmanuel Rachelson. Local Feature Swapping for Generalization in Reinforcement Learning. 2022. ⟨hal-03632784v2⟩
85 Consultations
71 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More