Localization and Selection of Speaker Specific Information with Statistical Modeling

Abstract : Statistical modeling of the speech signal has been widely used in speaker recognition. The performance obtained with this type of modeling is excellent in laboratories but decreases dramatically for telephone or noisy speech. Moreover, it is difficult to know which piece of information is taken into account by the system. In order to solve this problem and to improve the current systems, a better understanding of the nature of the information used by statistical methods is needed. This knowledge should allow to select only the relevant information or to add new sources of information. The first part of this paper presents experiments that aim at localizing the most useful acoustic events for speaker recognition. The relation between the discriminant ability and the speech's events nature is studied. Particularly, the phonetic content, the signal stability and the frequency domain are explored. Finally, the potential of dynamic information contained in the relation between a frame and its p neighbours is investigated. In the second part, the authors suggest a new selection procedure designed to select the pertinent features. Conventional feature selection techniques (ascendant selection, knockout) allow only global and a posteriori knowledge about the relevance of an information source. However, some speech clusters may be very efficient to recognize a particular speaker, whereas they can be non informative for another one. Moreover, some information classes may be corrupted or even missing for particular recording conditions. This necessity for
Complete list of metadatas

Cited literature [34 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-02157126
Contributor : Jean-François Bonastre <>
Submitted on : Saturday, June 15, 2019 - 12:09:13 PM
Last modification on : Wednesday, June 19, 2019 - 1:26:20 AM

File

10.1.1.10.8986.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02157126, version 1

Collections

Citation

L Besacier, J.F. Bonastre, C. Fredouille. Localization and Selection of Speaker Specific Information with Statistical Modeling. Speech Communication, Elsevier : North-Holland, 2000. ⟨hal-02157126⟩

Share

Metrics

Record views

8

Files downloads

10