Singing voice separation with deep u-net convolutional networks, Proc. of ISMIR (International Society for Music Information Retrieval), 2017. ,
Monoaural audio source separation using deep convolutional neural networks, Proc. of LVA/ICA (International Conference on Latent Variable Analysis and Signal Separation), 2017. ,
Improving singing voice separation using deep u-net and waveu-net with data augmentation, 2019. ,
Modulating early visual processing by language, Proc. of NIPS (Annual Conference on Neural Information Processing Systems), 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01648683
Feature-wise transformations. Distill, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01841985
Onsets and frames: Dual-objective piano transcription, Proc. of ISMIR (International Society for Music Information Retrieval), 2018. ,
An improved relative self-attention mechanism for transformer with application to music generation, 2018. ,
Joint optimization of masks and deep recurrent neural networks for monaural source separation, IEEE/ACM TASLP (Transactions on Audio Speech and Language Processing), vol.23, issue.12, 2015. ,
Batch normalization: Accelerating deep network training by reducing internal covariate shift, Proc. of ICML (International Conference on Machine Learning), 2015. ,
Semi-blind source separation with multichannel variational autoencoder, 2018. ,
Dynamic layer normalization for adaptive neural acoustic modeling in speech recognition, CoRR, 2017. ,
Adam: A method for stochastic optimization, Proc. of ICLR (International Conference on Learning Representations), 2014. ,
Impact of phase estimation on single-channel speech separation based on time-frequency masking, The Journal of the Acoustical Society of America, vol.141, pp.4668-4679, 2017. ,
Film: Visual reasoning with a general conditioning layer, Proc. of AAAI (Conference on Artificial Intelligence), 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01648685
mir eval: a transparent implementation of common mir metrics, Proc. of ISMIR (International Society for Music Information Retrieval), 2014. ,
An Overview of Lead and Accompaniment Separation in Music, IEEE/ACM TASLP (Transactions on Audio Speech and Language Processing), vol.26, issue.8, 2018. ,
URL : https://hal.archives-ouvertes.fr/lirmm-01766781
Stylianos Ioannis Mimilakis, and Rachel Bittner. The MUSDB18 corpus for music separation, 2017. ,
U-net: Convolutional networks for biomedical image segmentation, Proc. of MICCAI (International Conference on Medical Image Computing and Computer Assisted Intervention), 2015. ,
Natural TTS synthesis by conditioning wavenet on mel spectrogram predictions, Proc. of ICASSP (International Conference on Acoustics, Speech and Signal Processing, 2018. ,
Wave-u-net: A multiscale neural network for end-to-end audio source separation, Proc. of ISMIR (International Society for Music Information Retrieval), 2018. ,
Visual reasoning with multihop feature modulation, Proc. of ECCV, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01927811
Between-class learning for image classification, Proc. of CVPR (Conference on Computer Vision and Pattern Recognition), 2018. ,
Wavenet: A generative model for raw audio, 2016. ,
, , 2017.
Performance measurement in blind audio source separation, IEEE/ACM TASLP (Transactions on Audio Speech and Language Processing), vol.14, issue.4, 2006. ,
URL : https://hal.archives-ouvertes.fr/inria-00544230
Midinet: A convolutional generative adversarial network for symbolicdomain music generation using 1d and 2d conditions, CoRR, 2017. ,