Multichannel audio source separation with deep neural networks - Archive ouverte HAL Accéder directement au contenu
Rapport (Rapport De Recherche) Année : 2015

Multichannel audio source separation with deep neural networks

Résumé

This technical report considers the problem of multichannel audio source separation. A few studies have addressed the problem of single-channel audio source separation with deep neural networks (DNNs). We introduce a new framework for multichannel source separation where (1) spectral and spatial parameters are updated iteratively similarly to the expectation-maximization (EM) algorithm and (2) DNNs are used in the spectral updates. We evaluated several systems based on the proposed framework by participating in the "professionally-produced music recording" task of SiSEC 2015. Experimental results show that the framework performed well in separating singing voice and other instruments from a mixture containing multiple musical instruments.
Fichier principal
Vignette du fichier
RR-8740.pdf (709.37 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-01163369 , version 1 (12-06-2015)
hal-01163369 , version 2 (16-07-2015)
hal-01163369 , version 3 (05-02-2016)
hal-01163369 , version 4 (12-05-2016)
hal-01163369 , version 5 (21-06-2016)

Identifiants

  • HAL Id : hal-01163369 , version 1

Citer

Aditya Arie Nugraha, Antoine Liutkus, Emmanuel Vincent. Multichannel audio source separation with deep neural networks. [Research Report] RR-8740, INRIA. 2015. ⟨hal-01163369v1⟩
2145 Consultations
10565 Téléchargements

Partager

Gmail Facebook X LinkedIn More