ModeNet: Mode Selection Network For Learned Video Coding

Théo Ladune; Pierrick Philippe; Wassim Hamidouche; Lu Zhang; Olivier Déforges

Communication Dans Un Congrès Année : 2020

ModeNet: Mode Selection Network For Learned Video Coding

(1, 2) , (2) , (1) , (1) , (1)

1
2

Théo Ladune

Fonction : Auteur
PersonId : 1064711

Institut d'Électronique et des Technologies du numéRique

Orange Labs R&D [Rennes]

Pierrick Philippe

Fonction : Auteur

Orange Labs R&D [Rennes]

Wassim Hamidouche

Fonction : Auteur
PersonId : 17949
IdHAL : wassim-hamidouche
ORCID : 0000-0002-0143-1756
IdRef : 155804839

Institut d'Électronique et des Technologies du numéRique

Lu Zhang

Fonction : Auteur
PersonId : 179223
IdHAL : lu-zhang
ORCID : 0000-0002-8859-5453
IdRef : 238287084

Institut d'Électronique et des Technologies du numéRique

Olivier Déforges

Fonction : Auteur
PersonId : 838257
ORCID : 0000-0003-0750-0959

Institut d'Électronique et des Technologies du numéRique

Résumé

In this paper, a mode selection network (ModeNet) is proposed to enhance deep learning-based video compression. Inspired by traditional video coding, ModeNet purpose is to enable competition among several coding modes. The proposed ModeNet learns and conveys a pixel-wise partitioning of the frame, used to assign each pixel to the most suited coding mode. ModeNet is trained alongside the different coding modes to minimize a rate-distortion cost. It is a flexible component which can be generalized to other systems to allow competition between different coding tools. Mod-eNet interest is studied on a P-frame coding task, where it is used to design a method for coding a frame given its prediction. ModeNet-based systems achieve compelling performance when evaluated under the Challenge on Learned Image Compression 2020 (CLIC20) P-frame coding track conditions.

Mots clés

Video Coding Autoencoder Deep Learning Coding Mode Selection

Domaines

Traitement du signal et de l'image [eess.SP] Réseau de neurones [cs.NE]

Fichier principal

article.pdf (2.12 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Théo Ladune : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02888453

Soumis le : vendredi 24 juillet 2020-14:05:41

Dernière modification le : vendredi 24 mars 2023-14:53:18

Dates et versions

hal-02888453 , version 1 (03-07-2020)

hal-02888453 , version 2 (24-07-2020)

Identifiants

HAL Id : hal-02888453 , version 2
ARXIV : 2007.02532

Citer

Théo Ladune, Pierrick Philippe, Wassim Hamidouche, Lu Zhang, Olivier Déforges. ModeNet: Mode Selection Network For Learned Video Coding. Machine Learning for Signal Processing (MLSP) 2020, Sep 2020, Espoo, Finland. ⟨hal-02888453v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-NANTES UNIV-RENNES1 CNRS INSA-RENNES IETR SUP_IETR CENTRALESUPELEC UR1-MATH-STIC UR1-UFR-ISTIC IETR-VAADER UNIV-RENNES INSA-GROUPE UR1-MATH-NUM NANTES-UNIVERSITE

106 Consultations

63 Téléchargements

ModeNet: Mode Selection Network For Learned Video Coding

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager