Toward Automatic Music Audio Summary Generation from Signal Analysis

Geoffroy Peeters 1 Amaury La Burthe 1 Xavier Rodet 1
1 Analyse et synthèse sonores [Paris]
STMS - Sciences et Technologies de la Musique et du Son
Abstract : This paper deals with the automatic generation of music audio summaries from signal analysis without the use of any other information. The strategy employed here is to consider the audio signal as a succession of “states” (at various scales) corresponding to the structure (at various scales) of a piece of music. This is, of course, only applicable to certain kinds of musical genres based on some kind of repetition. From the audio signal, we first derive dynamic features representing the time evolution of the energy content in various frequency bands. These features constitute our observations from which we derive a representation of the music in terms of “states”. Since human segmentation and grouping performs better upon subsequent hearings, this “natural” approach is followed here. The first pass of the proposed algorithm uses segmentation in order to create “templates”. The second pass uses these templates in order to propose a structure of the music using unsupervised learning methods (Kmeans and hidden Markov model). The audio summary is finally constructed by choosing a representative example of each state. Further refinements of the summary audio signal construction, uses overlapadd, and a tempo detection/ beat alignment in order to improve the audio quality of the created summary.
Mots-clés : Informatique musicale NA
Complete list of metadatas

Cited literature [14 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01161322
Contributor : Ircam Ircam <>
Submitted on : Monday, June 8, 2015 - 2:35:59 PM
Last modification on : Monday, January 27, 2020 - 6:00:28 PM
Long-term archiving on: Tuesday, April 25, 2017 - 4:51:22 AM

File

index.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01161322, version 1

Citation

Geoffroy Peeters, Amaury La Burthe, Xavier Rodet. Toward Automatic Music Audio Summary Generation from Signal Analysis. ISMIR, Oct 2002, Paris, France. pp.1-1. ⟨hal-01161322⟩

Share

Metrics

Record views

577

Files downloads

451