Speaker Utterances tying among speaker segmented audio documents using hierarchical classification: towards speaker indexing of audio databases - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2002

Speaker Utterances tying among speaker segmented audio documents using hierarchical classification: towards speaker indexing of audio databases

Résumé

Speaker indexing of an audio database consists in organizing the audio data according to the speakers present in the database. It is composed of three steps: (1) segmentation by speakers of each audio document; (2) speaker tying among the various segmented portions of the audio documents; and (3) generation of a speaker- based index. This paper focuses on the second step, the speaker tying task, which has not been addressed in the literature. The re- sult of this task is a classification of the segmented acoustic data by clusters; each cluster should represent one speaker. This paper investigates on hierarchical classification approaches for speaker tying. Two new discriminant dissimilarity measures and a new bottom-up algorithm are also proposed. The experiments are con- ducted on a subset of the Switchboard database, a conversational telephone database, and show that the proposed method allows a very satisfying speaker tying among various audio documents, with a good level of purity for the clusters, but with a number of clusters significantly higher than the number of speakers.
Fichier principal
Vignette du fichier
mei-icslp2002.pdf (142.86 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-01434586 , version 1 (29-03-2017)

Identifiants

  • HAL Id : hal-01434586 , version 1

Citer

Sylvain Meignier, Jean-François Bonastre, Ivan Magrin-Chagnolleau. Speaker Utterances tying among speaker segmented audio documents using hierarchical classification: towards speaker indexing of audio databases. ISCA International Conference on Spoken Language Processing (ICSLP 2002), 2002, Denver, CO, United States. pp.577--580. ⟨hal-01434586⟩

Collections

UNIV-AVIGNON LIA
111 Consultations
49 Téléchargements

Partager

Gmail Facebook X LinkedIn More