An Image-Inspired Audio Sharpness Index

Abstract : We propose a new non-intrusive (reference-free) objective measure of speech intelligibility that is inspired from previous works on image sharpness. We define the audio Sharpness Index (aSI) as the sensitivity of the spectrogram sparsity to the convolution of the signal with a white noise, and we calculate a closed-form formula of the aSI. Experiments with various speakers, noise and reverberation conditions show a high correlation between the aSI and the well-established Speech Transmission Index (STI), which is intrusive (full-reference). Additionally, the aSI can be used as an intelligibility or clarity criterion to drive sound enhancement algorithms. Experimental results on stereo mixtures of two sounds show that blind source separation based on aSI maximization performs well for speech and for music.
Type de document :
Pré-publication, Document de travail
MAP5 2017-12. 2017
Liste complète des métadonnées

Littérature citée [19 références]  Voir  Masquer  Télécharger
Contributeur : Lionel Moisan <>
Soumis le : samedi 27 mai 2017 - 20:07:32
Dernière modification le : jeudi 11 janvier 2018 - 06:27:35
Document(s) archivé(s) le : lundi 28 août 2017 - 17:17:28


Fichiers produits par l'(les) auteur(s)


  • HAL Id : hal-01528172, version 1


Gaël Mahé, Lionel Moisan, Mihai Mitrea. An Image-Inspired Audio Sharpness Index. MAP5 2017-12. 2017. 〈hal-01528172〉



Consultations de la notice


Téléchargements de fichiers