Audio source separation into the wild

Laurent Girin 1, 2 Sharon Gannot 3 Xiaofei Li 2
1 GIPSA-CRISSP - CRISSP
GIPSA-DPC - Département Parole et Cognition
2 PERCEPTION - Interpretation and Modelling of Images and Videos
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : This review chapter is dedicated to multichannel audio source separation in real-life environment. We explore some of the major achievements in the field and discuss some of the remaining challenges. We will explore several important practical scenarios, e.g. moving sources and/or microphones, varying number of sources and sensors, high reverberation levels, spatially diffuse sources, and synchronization problems. Several applications such as smart assistants, cellular phones, hearing aids and robots, will be discussed. Our perspectives on the future of the field will be given as concluding remarks of this chapter.
Type de document :
Chapitre d'ouvrage
Multimodal Behavior Analysis in the Wild, Academic Press (Elsevier), pp.53-78, 2018, Computer Vision and Pattern Recognition, 〈10.1016/B978-0-12-814601-9.00022-5〉
Liste complète des métadonnées

https://hal.inria.fr/hal-01943375
Contributeur : Team Perception <>
Soumis le : lundi 3 décembre 2018 - 17:02:45
Dernière modification le : vendredi 28 décembre 2018 - 18:50:55

Fichier

Book_plain.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Citation

Laurent Girin, Sharon Gannot, Xiaofei Li. Audio source separation into the wild. Multimodal Behavior Analysis in the Wild, Academic Press (Elsevier), pp.53-78, 2018, Computer Vision and Pattern Recognition, 〈10.1016/B978-0-12-814601-9.00022-5〉. 〈hal-01943375〉

Partager

Métriques

Consultations de la notice

116

Téléchargements de fichiers

200