SPEAKER DIARIZATION OF HETEROGENEOUS WEB VIDEO FILES: A PRELIMINARY STUDY

Abstract : In the last ten years, internet as well as its applications changed significantly, mainly thanks to the raising of available personal resources. Concerning multimedia, the most impressive evolution is the continuous growing success of the video sharing websites. But with this success come the difficulties to efficiently search, index and access relevant information about these documents. Speaker diariza-tion is an important task in the overall information retrieval process. This paper describes an audio/video database, especially built for the speaker diarization task, based on different video genres. Through some preliminary experiments, it highlights the difficulties encountered in this context, mainly linked to the database heterogeneity. Index Terms: speaker diarization, heterogeneous web videos, di-arization error rate
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01314783
Contributor : Bibliothèque Universitaire Déposants Hal-Avignon <>
Submitted on : Thursday, May 12, 2016 - 9:55:13 AM
Last modification on : Friday, March 29, 2019 - 2:36:04 PM

Identifiers

  • HAL Id : hal-01314783, version 1

Citation

Pierre Clement, Thierry Bazillon, Corinne Fredouille. SPEAKER DIARIZATION OF HETEROGENEOUS WEB VIDEO FILES: A PRELIMINARY STUDY. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2011, Prague, Czech Republic. ⟨hal-01314783⟩

Share

Metrics

Record views

151