Audio Indexing Including Frequency Tracking of Simultaneous Multiple Sources in Speech and Music

Abstract : In this paper, we present a complete system for audio indexing. This system is based state-of-the-art methods of Speech-Music-Noise segmentation and Monophonic/Polyphonic estimation. After those methods we propose an original system of superposed sources detection. This approach is based on the analysis of the evolution of the predominant frequencies. In order to validate the whole system we used different corpora : Radio broadcasts, studio music and degraded field records. The first results are encouraging and show the potential of our approach which is generic and can be used on both music and speech contents.
Liste complète des métadonnées

Cited literature [10 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01228711
Contributor : Open Archive Toulouse Archive Ouverte (oatao) <>
Submitted on : Friday, November 13, 2015 - 4:01:47 PM
Last modification on : Friday, April 12, 2019 - 4:23:02 PM
Document(s) archivé(s) le : Sunday, February 14, 2016 - 1:56:24 PM

File

lecoz_12639.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01228711, version 1
  • OATAO : 12639

Collections

Citation

Maxime Le Coz, Julien Pinquier, Régine André-Obrecht, Julie Mauclair. Audio Indexing Including Frequency Tracking of Simultaneous Multiple Sources in Speech and Music. 11th International Workshop on Content-Based Multimedia Indexing (CBMI 2013), Jun 2013, Veszprem, Hungary. pp. 23-25. ⟨hal-01228711⟩

Share

Metrics

Record views

108

Files downloads

75