Service interruption on Monday 11 July from 12:30 to 13:00: all the sites of the CCSD (HAL, EpiSciences, SciencesConf, AureHAL) will be inaccessible (network hardware connection).
Skip to Main content Skip to Navigation
Conference papers

Reducing Interference with Phase Recovery in DNN-based Monaural Singing Voice Separation

Abstract : State-of-the-art methods for monaural singing voice separation consist in estimating the magnitude spectrum of the voice in the short-term Fourier transform (STFT) domain by means of deep neural networks (DNNs). The resulting magnitude estimate is then combined with the mixture's phase to retrieve the complex-valued STFT of the voice, which is further synthesized into a time-domain signal. However, when the sources overlap in time and frequency, the STFT phase of the voice differs from the mixture's phase, which results in interference and artifacts in the estimated signals. In this paper, we investigate on recent phase recovery algorithms that tackle this issue and can further enhance the separation quality. These algorithms exploit phase constraints that originate from a sinusoidal model or from consistency , a property that is a direct consequence of the STFT redundancy. Experiments conducted on real music songs show that those algorithms are efficient for reducing interference in the estimated voice compared to the baseline approach.
Document type :
Conference papers
Complete list of metadata

Cited literature [31 references]  Display  Hide  Download
Contributor : Paul Magron Connect in order to contact the contributor
Submitted on : Friday, June 15, 2018 - 4:01:35 PM
Last modification on : Wednesday, November 3, 2021 - 8:08:59 AM
Long-term archiving on: : Monday, September 17, 2018 - 11:54:52 AM


Files produced by the author(s)


  • HAL Id : hal-01741278, version 2


Paul Magron, Konstantinos Drossos, Stylianos Ioannis Mimilakis, Tuomas Virtanen. Reducing Interference with Phase Recovery in DNN-based Monaural Singing Voice Separation. Interspeech, Sep 2018, Hyderabad, India. ⟨hal-01741278v2⟩



Record views


Files downloads