Two Multimodal Approaches for Single Microphone Source Separation

Farnaz Sedighin; Massoud Babaie-Zadeh; Bertrand Rivet; Christian Jutten

Communication Dans Un Congrès Année : 2016

Two Multimodal Approaches for Single Microphone Source Separation

(1) , (1) , (2) , (2)

1
2

Farnaz Sedighin

Fonction : Auteur
PersonId : 993946

Department of Electrical Engineering [Tehran]

Massoud Babaie-Zadeh

Fonction : Auteur

Department of Electrical Engineering [Tehran]

Bertrand Rivet

Fonction : Auteur
PersonId : 1783
IdHAL : rivetb
ORCID : 0000-0003-4901-5302
IdRef : 113674422

GIPSA - Vision and Brain Signal Processing

Christian Jutten

Fonction : Auteur
PersonId : 4384
IdHAL : christianjutten
ORCID : 0000-0002-4477-4847
IdRef : 032689896

GIPSA - Vision and Brain Signal Processing

Résumé

—In this paper, the problem of single microphone source separation via Nonnegative Matrix Factorization (NMF) by exploiting video information is addressed. Respective audio and video modalities coming from a single human speech usually have similar time changes. It means that changes in one of them usually corresponds to changes in the other one. So it is expected that activation coefficient matrices of their NMF decomposition are similar. Based on this similarity, in this paper the activation coefficient matrix of the video modality is used as an initialization for audio source separation via NMF. In addition, the mentioned similarity is used for post-processing and for clustering the rows of the activation coefficient matrix which were resulted from randomly initialized NMF. Simulation results confirm the effectiveness of the proposed multimodal approaches in single microphone source separation.

Mots clés

Multimodal source separation Nonnegative matrix factorization Single microphone source separation

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

1570251892.pdf (189.02 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Bertrand Rivet : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01400542

Soumis le : mardi 22 novembre 2016-10:25:45

Dernière modification le : jeudi 4 avril 2024-21:07:34

Archivage à long terme le : mardi 21 mars 2017-00:34:22

Dates et versions

hal-01400542 , version 1 (22-11-2016)

Identifiants

HAL Id : hal-01400542 , version 1

Citer

Farnaz Sedighin, Massoud Babaie-Zadeh, Bertrand Rivet, Christian Jutten. Two Multimodal Approaches for Single Microphone Source Separation. EUSIPCO 2016 - 24th European Signal Processing Conference, Aug 2016, Budapest, Hungary. pp.110-114. ⟨hal-01400542⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS GIPSA GIPSA-DIS GIPSA-VIBS POLYTECH-GRENOBLE

301 Consultations

328 Téléchargements

Two Multimodal Approaches for Single Microphone Source Separation

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager