Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator, IEEE Transactions on Acoustics, Speech and Signal Processing, vol.32, issue.6, pp.1109-1121, 1984. ,
Speech enhancement for non-stationary noise environments, Signal processing, vol.81, issue.11, pp.2403-2418, 2001. ,
Minimum mean-square error estimation of discrete Fourier coefficients with generalized Gamma priors, IEEE Transactions on Audio, Speech, and Language Processing, vol.15, issue.6, pp.1741-1752, 2007. ,
A statistical model-based voice activity detection, IEEE Signal Processing Letters, vol.6, issue.1, pp.1-3, 1999. ,
Voice activity detection based on statistical likelihood ratio with adaptive thresholding, IEEE International Workshop on Acoustic Signal Enhancement (IWAENC), pp.1-5, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01349776
Acoustic environment identification and its applications to audio forensics, IEEE Transactions on Information Forensics and Security, vol.8, issue.11, pp.1827-1837, 2013. ,
Dynamic noise aware training for speech enhancement based on deep neural networks, Fifteenth Annual Conference of the International Speech Communication Association, 2014. ,
SNR-Aware convolutional neural network modeling for speech enhancement, pp.3768-3772, 2016. ,
Noise power spectral density estimation based on optimal smoothing and minimum statistics, IEEE Transactions on Speech and Audio Processing, vol.9, issue.5, pp.504-512, 2001. ,
Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging, IEEE Transactions on Speech and Audio Processing, vol.11, issue.5, pp.466-475, 2003. ,
A noise-estimation algorithm for highly non-stationary environments, Speech communication, vol.48, issue.2, pp.220-231, 2006. ,
Non-stationary noise power spectral density estimation based on regional statistics, IEEE International Conference on Acoustics, Speech and Signal Processing, pp.181-185, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01250892
MMSE based noise psd tracking with low complexity, IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), pp.4266-4269, 2010. ,
Unbiased MMSE-based noise power estimation with low complexity and low tracking delay, IEEE Transactions on Audio, Speech, and Language Processing, vol.20, issue.4, pp.1383-1393, 2012. ,
Supervised speech separation based on deep learning: An overview, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.26, issue.10, pp.1702-1726, 2018. ,
Neural network based spectral mask estimation for acoustic beamforming, Acoustics, Speech and Signal Processing, pp.196-200, 2016. ,
Robust MVDR beamforming using time-frequency masks for online/offline ASR in noise, Acoustics, Speech and Signal Processing, pp.5210-5214, 2016. ,
On timefrequency mask estimation for MVDR beamforming with application in robust speech recognition, Acoustics, Speech and Signal Processing, pp.3246-3250, 2017. ,
A speech enhancement algorithm by iterating single-and multi-microphone processing and its application to robust ASR, Acoustics, Speech and Signal Processing, pp.276-280, 2017. ,
Exploring practical aspects of neural mask-based beamforming for farfield speech recognition, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.6697-6701, 2018. ,
Single-channel speech separation with memory-enhanced recurrent neural networks, Acoustics, Speech and Signal Processing, pp.3709-3713, 2014. ,
Global SNR estimation of speech signals for unknown noise conditions using noise adapted nonlinear regression, Proc. Interspeech, pp.3842-3846, 2017. ,
Long short-term memory for speaker generalization in supervised speech separation, The Journal of the Acoustical Society of America, vol.141, issue.6, pp.4705-4714, 2017. ,
Long short-term memory, Neural computation, vol.9, issue.8, pp.1735-1780, 1997. ,
Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems, Speech communication, vol.12, issue.3, pp.247-251, 1993. ,
Getting started with the DARPA TIMIT CD-ROM: An acoustic phonetic continuous speech database, vol.107, 1988. ,
Keras, 2015. ,
Adam: A method for stochastic optimization, 2014. ,
Backpropagation through time: what it does and how to do it, Proceedings of the IEEE, vol.78, issue.10, pp.1550-1560, 1990. ,
Noise tracking using DFT domain subspace decompositions, IEEE Transactions on Audio, Speech, and Language Processing, vol.16, issue.3, pp.541-553, 2008. ,
Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs, IEEE International Conference on Acoustics, Speech, and Signal Processing, vol.2, pp.749-752, 2001. ,