On normalized mean square error analysis of speech fundamental frequency in the cochlear implant-like spectrally reduced speech - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue IEEE Transactions on Biomedical Engineering Année : 2010

On normalized mean square error analysis of speech fundamental frequency in the cochlear implant-like spectrally reduced speech

Résumé

In this paper, we present a quantitative study on the speech fundamental frequency (F0) of the cochlear implantlike spectrally reduced speech (SRS). The SRS was synthesized from the subband amplitude and frequency modulations (AM and FM) of original clean speech utterances selected from the TIdigits database. The SRS synthesis algorithm was derived from the frequency amplitude modulation encoding (FAME) strategy, proposed by Nie et al., 2005. The normalized mean square errors (NMSEs), calculated between the F0 of the original clean speech and that of the SRSs, were analyzed. The NMSEs analysis of F0 revealed the greater F0 distortion in the AM-based SRS, which is the acoustic simulation of present-day cochlear implants, compared to the FAME-based SRS. This evidence supports the fact that current cochlear implant users have difficulty in the speaker recognition task as reported by Zeng et al., 2005. Further, the analysis results showed that it is better to keep the rapidly varying FM components to reduce the F0 distortion in the FAMEbased SRS at low spectral resolution.
Fichier non déposé

Dates et versions

hal-00472883 , version 1 (13-04-2010)

Identifiants

Citer

Cong Thanh Do, Dominique Pastor, André Goalic. On normalized mean square error analysis of speech fundamental frequency in the cochlear implant-like spectrally reduced speech. IEEE Transactions on Biomedical Engineering, 2010, 57 (3), pp.572 - 577. ⟨10.1109/TBME.2009.2031097⟩. ⟨hal-00472883⟩
72 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More