Singer diarization: application to ethnomusicological recordings

Abstract : A music audio document can be structured automatically by many ways according to the final objective. In the context of a project on indexing ethno-musicological au­dio documents, we asked ourselves the questions: who is singing and when. By analogy with speaker diarization which consists in detecting who is speaking and when, we called the fact of detecting changes of singers, singer diarization. Figure 1 illustrates the task. The ground truth consists of a manual annotation in singing turns, and eventual entry/exit of instruments. In the context of the ANR DIADEMS1 project (Descrip­tion, Indexing, Access to ethno-musicological and Sound Documents) on indexing ethno-musicological audio doc­ uments, singer diarization automatically appeared to be essential. In this paper, we present our developed singer diarization system which is applied on ethno­ musicological recordings.
Complete list of metadatas

Cited literature [5 references]  Display  Hide  Download
Contributor : Open Archive Toulouse Archive Ouverte (oatao) <>
Submitted on : Friday, June 9, 2017 - 5:33:13 PM
Last modification on : Friday, June 14, 2019 - 6:31:15 PM
Long-term archiving on : Sunday, September 10, 2017 - 1:26:05 PM


Files produced by the author(s)


  • HAL Id : hal-01535953, version 1
  • OATAO : 16911


Marwa Thlithi, Claude Barras, Julien Pinquier, Thomas Pellegrini. Singer diarization: application to ethnomusicological recordings. 5th International workshop on Folk Music Anaysis (FMA 2015), Jun 2015, Paris, France. pp. 124-125. ⟨hal-01535953⟩



Record views


Files downloads