. Asami-t, . Masumura-r, Y. Yamaguchi, H. &. Masataki, and . Aono-y, Domain adaptation of dnn acoustic models using knowledge distillation, International Conference on Acoustics, Speech and Signal Processing, 2017.

Y. Bengio, . Courville-a, and . Vincent-p, Representation learning : A review and new perspectives, IEEE transactions on pattern analysis and machine intelligence, vol.35, pp.1798-1828, 2013.

. Chollet-f, , 2015.

C. J. Nagrani-a.-&-zisserman-a, Voxceleb2 : Deep speaker recognition, INTERSPEECH, 2018.

G. X. Bengio-y, Understanding the difficulty of training deep feedforward neural networks, Proceedings of the international conference on artificial intelligence and statistics, 2010.

. Gresse-a, M. Quillot, . Dufour-r, and . Labatut-v.-&-bonastre-j.-f, Similarity metric based on siamese neural networks for voice casting, International Conference on Acoustics, Speech and Signal Processing, 2019.

. Gresse-a, M. Rouvier, . Dufour-r, and . Labatut-v.-&-bonastre-j.-f, Acoustic pairing of original and dubbed voices in the context of video game localization, INTERSPEECH, 2017.

. Hinton-g, . Vinyals-o, and . Dean-j, Distilling the knowledge in a neural network, 2015.

. M. Joy-n, . R. Kothinti-s, and . Umesh-s.-&-abraham-b, Generalized distillation framework for speaker normalization, INTERSPEECH, 2017.

L. J. Seltzer, M. L. Wang-x, and . Zhao-r.-&-gong-y, Large-scale domain adaptation via teacher-student learning, 2017.

P. D. Lopez-, L. Bottou, . &. Schölkopf-b, and . Vapnik-v, Unifying distillation and privileged information, International Conference on Learning Representations, 2016.

. Markov-k.-&-matsui-t, Robust speech recognition using generalized distillation framework, INTERSPEECH, 2016.

. Obin-n.-&-roebel-a, Similarity search of acted voices for automatic voice casting, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.24, pp.1642-1651, 2016.

. Obin-n, . Roebel-a, and . Bachman-g, On automatic voice casting for expressive speech : Speaker recognition vs. speech classification, International Conference on Acoustics, Speech and Signal Processing, 2014.

. Povey-d, . Ghoshal-a, G. Boulianne, L. Burget, and . Glembek-o, The kaldi speech recognition toolkit, IEEE 2011 ASRU, 2011.

. Price-r and . Iso-k.-i.-&-shinoda-k, Wise teachers train better dnn acoustic models, Speech, and Music Processing, 2016.

. Snyder-d, . Garcia-romero-d, and . Povey-d.-&-khudanpur-s, Deep neural network embeddings for text-independent speaker verification, INTERSPEECH, 2017.

. Snyder-d, . Garcia-romero-d, . Sell-g, and . Povey-d.-&-khudanpur-s, X-vectors : Robust dnn embeddings for speaker recognition, 2018.

. Snyder-d, P. Ghahremani, D. Povey, . Garcia-romero-d, and . Carmiel-y.-&-khudanpur-s, Deep neural network-based speaker embeddings for end-to-end speaker verification, Spoken Language Technology Workshop, 2016.

. Vapnik-v.-&-izmailov-r, Learning using privileged information : similarity control and knowledge transfer, Journal of machine learning research, vol.16, pp.2023-2049, 2015.

. Variani-e, X. Lei, . Mcdermott-e, . L. Moreno-i, and J. Gonzalez-dominguez, , 2014.

, Deep neural networks for small footprint text-dependent speaker verification, ICASSP

. Watanabe-s, T. Hori, and J. R. Le-roux-j.-&-hershey, Student-teacher network learning with enhanced features, Acoustics, Speech and Signal Processing, 2017.