Going deeper with convolutions, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014. ,
DOI : 10.1109/CVPR.2015.7298594
Neural network bottleneck features for language identification, Proc. IEEE Odyssey, pp.299-304, 2014. ,
Recent advances in deep learning for speech research at Microsoft, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.8604-8608, 2013. ,
DOI : 10.1109/ICASSP.2013.6639345
Deep Neural Network Approaches to Speaker and Language Recognition, IEEE Signal Processing Letters, vol.22, issue.10, p.1671, 2015. ,
DOI : 10.1109/LSP.2015.2420092
Robust language identification using convolutional neural network features, Proc. INTER- SPEECH, 2014. ,
I know that voice: Identifying the voice actor behind the voice, 2015 International Conference on Biometrics (ICB), pp.46-51, 2015. ,
DOI : 10.1109/ICB.2015.7139074
Analysis of cnnbased speech recognition system using raw speech as input, Proc. INTERSPEECH, 2015. ,
A deep convolutional neural network using heterogeneous pooling for trading acoustic invariance with phonetic confusion, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, pp.6669-6673, 2013. ,
DOI : 10.1109/ICASSP.2013.6638952
Unsupervised feature learning for audio classification using convolutional deep belief networks, Advances in neural information processing systems, pp.1096-1104, 2009. ,
Deepspeech: Scaling up end-to-end speech recognition, 2014. ,
Imagenet classification with deep convolutional neural networks, Advances in neural information processing systems, pp.1097-1105, 2012. ,
Very deep convolutional networks for large-scale image recognition, 1409. ,
Application of convolutional neural networks to speaker recognition in noisy conditions, Proc. INTERSPEECH, 2014. ,
Convoluted feelings convolutional and recurrent nets for detecting emotion from audio data ,
Speaker verification using adapted gaussian mixture models, Digital signal processing, vol.10, issue.1, pp.19-41, 2000. ,
Front-End Factor Analysis for Speaker Verification, Audio, Speech, and Language Processing, pp.788-798, 2011. ,
DOI : 10.1109/TASL.2010.2064307
DISTBIC: A speaker-based segmentation for audio data indexing, Speech Communication, vol.32, issue.1-2, pp.111-126, 2000. ,
DOI : 10.1016/S0167-6393(00)00027-3
Msr identity toolbox v1. 0: A matlab toolbox for speaker recognition research, Speech and Language Processing Technical Committee Newsletter, 2013. ,
Analysis of i-vector length normalization in speaker recognition systems, Proc. INTERSPEECH, pp.249-252, 2011. ,
Caffe, Proceedings of the ACM International Conference on Multimedia, MM '14, 2014. ,
DOI : 10.1145/2647868.2654889
Visualizing and understanding convolutional networks, Computer vision? ECCV 2014, pp.818-833, 2014. ,
The repere corpus: a multimodal corpus for person recognition, LREC, pp.1102-1107, 2012. ,
Feature warping for robust speaker verification IEEE Odyssey: The Speaker and Language Recognition Workshop, pp.213-218, 2001. ,
Inter dataset variability compensation for speaker recognition, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.4002-4006, 2014. ,
DOI : 10.1109/ICASSP.2014.6854353