ModeNet: Mode Selection Network For Learned Video Coding - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

ModeNet: Mode Selection Network For Learned Video Coding

Résumé

In this paper, a mode selection network (ModeNet) is proposed to enhance deep learning-based video compression. Inspired by traditional video coding, ModeNet purpose is to enable competition among several coding modes. The proposed ModeNet learns and conveys a pixel-wise partitioning of the frame, used to assign each pixel to the most suited coding mode. ModeNet is trained alongside the different coding modes to minimize a rate-distortion cost. It is a flexible component which can be generalized to other systems to allow competition between different coding tools. Mod-eNet interest is studied on a P-frame coding task, where it is used to design a method for coding a frame given its prediction. ModeNet-based systems achieve compelling performance when evaluated under the Challenge on Learned Image Compression 2020 (CLIC20) P-frame coding track conditions.
Fichier principal
Vignette du fichier
article.pdf (2.12 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-02888453 , version 1 (03-07-2020)
hal-02888453 , version 2 (24-07-2020)

Identifiants

Citer

Théo Ladune, Pierrick Philippe, Wassim Hamidouche, Lu Zhang, Olivier Déforges. ModeNet: Mode Selection Network For Learned Video Coding. Machine Learning for Signal Processing (MLSP) 2020, Sep 2020, Espoo, Finland. ⟨hal-02888453v2⟩
106 Consultations
63 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More