Skip to Main content Skip to Navigation
Conference papers

Data Quality Matters: Iterative Corrections on a Corpus of Mendelssohn String Quartets and Implications for MIR Analysis

Jacob Degroot-Maggetti 1 Timothy de Reuse 1 Laurent Feisthauer 2, 3 Samuel Howes 1 Yaolong Ju 1 Suzaka Kokubu 1 Sylvain Margot 1 Néstor Nápoles López 1 Finn Upham 1 
2 Algomus
MIS - Modélisation, Information et Systèmes - UR UPJV 4290, CRIStAL - Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189
Abstract : In this paper, we describe a workflow of successive corrections on Optical Music Recognition (OMR) generated MusicXML files and their respective outputs under Music Information Retrieval (MIR) tasks. The original OMR-generated files of six Mendelssohn String Quartets were initially corrected by individual members of this interdisciplinary group, then reviewed by others to further standardize the quality and music analysis priorities of the team. Four MIR tasks are applied to each round of corrections on this collection: cadence detection, chord labeling, key finding, and monophonic pattern discovery. We measure changes in the outputs of these four MIR tasks from one round of corrections to the next in order to evaluate the impact of corrections. Results show that expert revision is more beneficial to some MIR tasks than to others. The resulting corpus of curated MusicXML files is available as an open-source repository under a Creative Commons Attribution 4.0 International License for further MIR research.
Complete list of metadata

Cited literature [17 references]  Display  Hide  Download
Contributor : Mathieu Giraud Connect in order to contact the contributor
Submitted on : Friday, November 6, 2020 - 10:58:39 AM
Last modification on : Wednesday, September 7, 2022 - 8:14:05 AM
Long-term archiving on: : Sunday, February 7, 2021 - 6:35:38 PM


Publisher files allowed on an open archive


  • HAL Id : hal-02934884, version 1


Jacob Degroot-Maggetti, Timothy de Reuse, Laurent Feisthauer, Samuel Howes, Yaolong Ju, et al.. Data Quality Matters: Iterative Corrections on a Corpus of Mendelssohn String Quartets and Implications for MIR Analysis. International Society for Music Information Retrieval Conference (ISMIR 2020), 2020, Montréal, Canada. ⟨hal-02934884⟩



Record views


Files downloads