Localization and Selection of Speaker Specific Information with Statistical Modeling

L Besacier; J.F. Bonastre; C. Fredouille

Article Dans Une Revue Speech Communication Année : 2000

Localization and Selection of Speaker Specific Information with Statistical Modeling

, (1) , (1)

L Besacier

Fonction : Auteur

J.F. Bonastre

Fonction : Auteur
PersonId : 172421
IdHAL : jean-francois-bonastre
ORCID : 0000-0001-7741-3346
IdRef : 079112978

Laboratoire Informatique d'Avignon

C. Fredouille

Fonction : Auteur
PersonId : 173870
IdHAL : corinne-fredouille
ORCID : 0000-0002-0413-8950
IdRef : 079420516

Laboratoire Informatique d'Avignon

Résumé

Statistical modeling of the speech signal has been widely used in speaker recognition. The performance obtained with this type of modeling is excellent in laboratories but decreases dramatically for telephone or noisy speech. Moreover, it is difficult to know which piece of information is taken into account by the system. In order to solve this problem and to improve the current systems, a better understanding of the nature of the information used by statistical methods is needed. This knowledge should allow to select only the relevant information or to add new sources of information. The first part of this paper presents experiments that aim at localizing the most useful acoustic events for speaker recognition. The relation between the discriminant ability and the speech's events nature is studied. Particularly, the phonetic content, the signal stability and the frequency domain are explored. Finally, the potential of dynamic information contained in the relation between a frame and its p neighbours is investigated. In the second part, the authors suggest a new selection procedure designed to select the pertinent features. Conventional feature selection techniques (ascendant selection, knockout) allow only global and a posteriori knowledge about the relevance of an information source. However, some speech clusters may be very efficient to recognize a particular speaker, whereas they can be non informative for another one. Moreover, some information classes may be corrupted or even missing for particular recording conditions. This necessity for

Domaines

Informatique [cs] Intelligence artificielle [cs.AI] Interface homme-machine [cs.HC] Multimédia [cs.MM]

Fichier principal

10.1.1.10.8986.pdf (204.18 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Jean-François Bonastre : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02157126

Soumis le : samedi 15 juin 2019-12:09:13

Dernière modification le : dimanche 29 novembre 2020-17:02:01

Dates et versions

hal-02157126 , version 1 (15-06-2019)

Identifiants

HAL Id : hal-02157126 , version 1

Citer

L Besacier, J.F. Bonastre, C. Fredouille. Localization and Selection of Speaker Specific Information with Statistical Modeling. Speech Communication, 2000. ⟨hal-02157126⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-AVIGNON LIA

33 Consultations

73 Téléchargements

Localization and Selection of Speaker Specific Information with Statistical Modeling

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager