K. Amino and T. Arai, Speaker-dependent characteristics of the nasals, Forensic science international, vol.185, pp.21-28, 2009.

H. Aronowitz, Text-Dependent Speaker Verification Using a Small Development Set, Odyssey Speaker and Language Recognition Workshop, 2012.

B. Avinash, S. Guruprasad, and B. Ygnannarayana, Exploring subsegmental and suprasegmental features for a text-dependent speaker verification in distant speech signals, Annual Conference of the International Speech Communication Association (Interspeech), pp.1073-1076, 2010.

E. Bailly-bailliere, S. Bengio, F. Bimbot, M. Hamouz, J. Kittler et al., The BANCA database and evaluation protocol, Lecture Notes in Computer Science, vol.2688, pp.625-638, 2003.

M. F. Benzeghiba and H. Bourlard, User-customized password speaker verification using multiple reference and background models, Speech Communication, vol.48, pp.1200-1213, 2006.

K. Boakye and B. Peskin, Text-Constrained Speaker Recognition on a Text-Independent Task, Odyssey Speaker and Language Recognition Workshop, pp.1-6, 2004.

D. Boies, M. Hébert, and L. P. Heck, Study on the effect of lexical mismatch in text-dependent speaker verification, Odyssey Speaker and Language Recognition Workshop, pp.1-5, 2004.

J. F. Bonastre, P. Morin, and J. C. Junqua, Gaussian dynamic warping (GDW) method applied to text-dependent speaker detection and verification, European Conference on Speech Communication and Technology (Eurospeech), pp.2013-2016, 2003.
URL : https://hal.archives-ouvertes.fr/hal-02157173

P. M. Bousquet, A. Larcher, D. Matrouf, J. F. Bonastre, and O. Plchot, Variance-Spectra based Normalization for I-vector Standard and Probabilistic Linear Discriminant Analysis, Odyssey Speaker and Language Recognition Workshop, pp.1-8, 2012.
URL : https://hal.archives-ouvertes.fr/hal-01927751

P. M. Bousquet, D. Matrouf, and J. F. Bonastre, Intersession compensation and scoring methods in the i-vectors space for speaker recognition, Annual Conference of the International Speech Communication Association (Interspeech), pp.485-488, 2011.
URL : https://hal.archives-ouvertes.fr/hal-01313266

N. Brümmer and E. De-villiers, The speaker partitioning problem, Odyssey Speaker and Language Recognition Workshop, pp.1-8, 2010.

J. Campbell and A. L. Higgins, A YOHO speaker verification corpus LDC94s16, 1994.

J. P. Campbell, Testing with the YOHO CD-ROM voice verification corpus, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.341-344, 1995.

J. P. Campbell and D. A. Reynolds, Corpora for the evaluation of speaker recognition systems, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.829-832, 1999.

J. P. Campbell, W. Shen, W. M. Campbell, R. Schwartz, J. F. Bonastre et al., Forensic speaker recognition. Signal processing magazine, IEEE, vol.26, pp.95-103, 2009.

D. Charlet and D. Jouvet, Optimizing feature set for speaker verification, Pattern Recognition Letters, vol.18, pp.873-879, 1997.
DOI : 10.1016/s0167-8655(97)00064-0

D. Charlet, D. Jouvet, and O. Collin, An alternative normalization scheme in HMM-based text-dependent speaker verification, Speech Communication, vol.31, pp.113-120, 2000.

S. Chatzis and T. Varvarigou, A Robust to Outliers Hidden Markov Model with Application in Text-Dependent Speaker Identification, International Conference on Signal Processing and Communications, pp.804-807, 2007.

C. W. Che, Q. Lin, and D. S. Yuk, An HMM approach to text-prompted speaker verification, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.673-676, 1996.

K. Chen, D. Xie, and H. Chi, A modified HME architecture for textdependent speaker identification, IEEE Transactions on Neural Networks, vol.7, pp.1309-1313, 1996.

W. Chen, Q. Hong, and X. Li, GMM-UBM for text-dependent speaker recognition, International Conference on Audio, Language and Image Processing, pp.432-435, 2012.
DOI : 10.1109/icalip.2012.6376656

G. Chollet, J. L. Cochard, A. Constantinescu, C. Jaboulet, and P. Langlais, Swiss French PolyPhone and PolyVar: telephone speech databases to model inter-and intra-speaker variability, 1996.

G. Doddington, The Effect of Target/Non-Target Age Difference on Speaker Recognition Performance, Odyssey Speaker and Language Recognition Workshop, pp.1-5, 2012.

G. R. Doddington, Speaker recognition evaluation methodology-an overview and perspective, Workshop on Speaker Recognition and its Commercial and Forensic Applications (RLA2C), pp.20-23, 1998.
DOI : 10.1016/s0167-6393(99)00080-1

C. Dong, Y. Dong, J. Li, and H. Wang, Support Vector Machines Based Text Dependent Speaker Verification Using HMM superverctors, Odyssey Speaker and Language Recognition Workshop, pp.1-7, 2008.

B. Dumas, C. Pugin, J. Hennebert, D. Petrovska-delacrétaz, A. Humm et al., MyIdea-Multimodal biometrics database, description of acquisition protocols, Biometrics on the Internet, vol.275, pp.59-62, 2005.

T. Dutta, Text dependent speaker identification based on spectrograms, Image and Vision Computing, pp.238-243, 2007.
DOI : 10.1109/cisp.2008.560

T. Dutta, Dynamic Time Warping Based Approach to Text-Dependent Speaker Identification Using Spectrograms, Congress on Image and Signal Processing, pp.354-360, 2008.
DOI : 10.1109/cisp.2008.560

, RUSTEN: Russian Switched Telephone Network speech database (STC, ELDA-Evaluations and Language resources Distribution Agency, vol.0050, 2003.

K. R. Farrell, Text-dependent speaker verification using data fusion, IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, INSTITUTE OF ELECTRICAL ENGINEERS INC (IEE), pp.349-349, 1995.
DOI : 10.1109/icassp.1995.479545

K. R. Farrell, R. P. Ramachandran, and R. J. Mammone, An analysis of data fusion methods for speaker verification, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.1129-1132, 1998.

M. Faundez-zanuy, J. Fierrez-aguilar, J. Ortega-garcia, and J. Gonzalezrodriguez, Multimodal biometric databases: An overview, IEEE Aerospace and Electronic Systems Magazine, vol.21, pp.29-37, 2006.
DOI : 10.1109/maes.2006.1703234

B. Fauve, Tackling Variabilities in Speaker Verification with a Focus on Short Durations, 2009.

J. Fierrez, J. Galbally, J. Ortega-garcia, M. Freire, F. Alonso-fernandez et al., BiosecurID: a multimodal biometric database, Analysis & Applications, vol.13, pp.235-246, 2010.
DOI : 10.1007/s10044-009-0151-4

J. Fierrez, J. Ortega-garcia, D. Torre-toledano, and J. Gonzalez-rodriguez, Biosec baseline corpus: A multimodal biometric database, Pattern Recognition, vol.40, pp.1389-1392, 2007.
DOI : 10.1016/j.patcog.2006.10.014
URL : https://repositorio.uam.es/bitstream/10486/662302/5/biosec_fierrez_PR_2007_ps.pdf

R. Finan, A. Sapeluk, and R. Damper, Comparison of multilayer and radial basis function neural networks for text-dependent speaker recognition, IEEE International Conference on Neural Networks, IEEE, pp.1992-1997, 1996.

M. Forsyth, Discriminating observation probability (DOP) HMM for speaker verification, Speech communication, vol.17, pp.117-129, 1995.
DOI : 10.1016/0167-6393(95)00020-o

N. A. Fox, B. A. O'mullane, and R. B. Reilly, The Realistic Multi-modal VALID database and Visual Speaker Identification Comparison Experiments, International Conference of Audio and Video-Based Person Authentication, pp.777-786, 2005.

S. Furui, Cepstral analysis technique for automatic speaker verification, IEEE Transactions on Acoustics, Speech, and Signal Processing, vol.29, pp.254-272, 1981.
DOI : 10.1109/tassp.1981.1163530
URL : http://t2r2.star.titech.ac.jp/rrws/file/CTT100418590/ATD100000413/

S. Furui, Comparison of speaker recognition methods using statistical features and dynamic features, IEEE Transactions on Acoustics, Speech and Signal Processing, vol.29, pp.342-350, 1981.
DOI : 10.1109/tassp.1981.1163605
URL : http://t2r2.star.titech.ac.jp/rrws/file/CTT100418591/ATD100000413/

D. Garcia-romero and C. Y. Espy-wilson, Analysis of i-vector length normalization in speaker recognition systems, Annual Conference of the International Speech Communication Association (Interspeech), pp.249-252, 2011.

S. Garcia-salicetti, C. Beumier, G. Chollet, B. Dorizzi, J. Jardins et al., BIOMET: A Multimodal Person Authentication Database Including Face, Voice, Fingerprint, Hand and Signature Modalities, Lecture Notes in Computer Science, pp.845-853, 2003.
DOI : 10.1007/3-540-44887-x_98

J. S. Garofolo, L. F. Lamel, W. M. Fisher, J. G. Fiscus, D. S. Pallett et al., Timit acoustic-phonetic continuous speech corpus linguistic data consortium, 1993.

Y. Gu and T. Thomas, An implementation and evaluation of an on-line speaker verification system for field trials, Annual Conference of the International Speech Communication Association (Interspeech), pp.125-128, 1998.

T. Hasan, R. Saeidi, J. H. Hansen, and D. A. Van-leeuwen, Duration Mismatch Compensation for I-Vector Based Speaker Recognition Systems, IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP, pp.7663-7667, 2013.
DOI : 10.1109/icassp.2013.6639154
URL : https://repository.ubn.ru.nl/bitstream/2066/116064/1/116064.pdf

M. Hébert, Heidelberg. chapter Text-dependent speaker recognition, pp.743-762, 2008.

M. Hébert and D. Boies, T-norm for text-dependent commercial speaker verification applications: Effect of lexical mismatch, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.729-732, 2005.

M. Hebert and L. P. Heck, Phonetic class-based speaker verification, European Conference on Speech Communication and Technology, pp.1665-1668, 2003.

L. Heck and D. Genoud, Integrating Speaker and Speech Recognizers: Automatic Identity Claim Capture for Speaker Verification, Odyssey Speaker and Language Recognition Workshop, pp.249-254, 2001.

J. Hennebert, H. Melin, D. Petrovska, and D. Genoud, POLYCOST: A telephone-speech database for speaker recognition, Speech Communication, vol.31, pp.265-270, 2000.
DOI : 10.1016/s0167-6393(99)00082-5

Y. Jiang, K. A. Lee, Z. Tang, B. Ma, A. Larcher et al., PLDA Modeling in I-vector and Supervector Space for Speaker Verification, Annual Conference of the International Speech Communication Association (Interspeech), pp.1680-1683, 2012.
DOI : 10.1186/preaccept-1667880097114310
URL : https://hal.archives-ouvertes.fr/hal-01927743

J. Kahn, N. Audibert, J. F. Bonastre, and S. Rossato, Inter and intraspeaker variability in French: an analysis of oral vowels and itsimplication for automatic speaker verification, International Congress of Phonetic Sciences (ICPhS), pp.1002-1005, 2011.

J. Kahn, N. Audibert, S. Rossato, and J. F. Bonastre, Intra-speaker variability effects on Speaker Verification performance, Odyssey Speaker and Language Recognition Workshop, pp.109-116, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00959188

A. Kanagasundaram, R. Vogt, D. Dean, S. Sridharan, and M. Mason, I-vector Based Speaker Recognition on Short Utterances, in: Annual Conference of the International Speech Communication Association (Interspeech), pp.2341-2344, 2011.

Z. N. Karam, W. M. Campbell, and N. Dehak, Graph relational features for speaker recognition and mining, Statistical Signal Processing Workshop (SSP), IEEE, pp.525-528, 2011.
DOI : 10.1109/ssp.2011.5967749

I. Karlsson, Within-speaker variability in the VeriVox database. Gothenburg papers in theoretical linguistics, pp.93-96, 1999.

I. Karlsson, T. Banziger, J. Dankovicová, T. Johnstone, J. Lindberg et al., Speaker verification with elicited speaking styles in the VeriVox project, Speech Communication, vol.31, pp.121-129, 2000.

T. Kato and T. Shimizu, Improved speaker verification over the cellular phone network using phoneme-balanced and digit-sequence-preserving connected digit patterns, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.57-60, 2003.

H. Kekre, T. Sarode, S. Natu, and P. Natu, Performance Comparison Of 2-D DCT On Full/Block Spectrogram And 1-D DCT On Row Mean Of Spectrogram For Speaker Identification, International Journal of Biometrics and Bioinformatics, p.100, 2010.

F. Kelly, A. Drygajlo, and N. Harte, Speaker verification with long-term ageing data, International Conference on Biometrics (ICB), pp.478-483, 2012.
DOI : 10.1109/icb.2012.6199796
URL : http://www.mee.tcd.ie/%7Esigmedia/pmwiki/uploads/Main.Publications/finnianICB2012.pdf

F. Kelly and N. Harte, Effects of long-term ageing on speaker verification, Biometrics and ID Management, pp.113-124, 2011.
DOI : 10.1007/978-3-642-19530-3_11
URL : http://www.mee.tcd.ie/%7Esigmedia/pmwiki/uploads/Main.Publications/fkelly_BioID2011.pdf

P. Kenny, G. Boulianne, P. Ouellet, and P. Dumouchel, Joint factor analysis versus eigenchannels in speaker recognition, IEEE Transactions on Audio, Speech, and Language Processing, vol.15, pp.1435-1447, 2007.

P. Kenny and P. Dumouchel, Disentangling speaker and channel effects in speaker verification, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.37-40, 2004.

P. Kenny, T. Stafylakis, P. Ouellet, J. Alam, and P. Dumouchel, PLDA for Speaker Verification with Utterances of Arbitrary Duration, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.7649-7653, 2013.

T. Kinnunen and H. Li, An overview of text-independent speaker recognition: From features to supervectors, Speech Communication, vol.52, pp.12-40, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00587602

A. Larcher, J. F. Bonastre, B. Fauve, K. A. Lee, C. Lévy et al., ALIZE 3.0-Open Source Toolkit for State-ofthe-Art Speaker Recognition, Annual Conference of the International Speech Communication Association (Interspeech), pp.2768-2773, 2013.
URL : https://hal.archives-ouvertes.fr/hal-01927586

A. Larcher, J. F. Bonastre, and J. S. Mason, Reinforced temporal structure of acoustic models for speaker recognition, Digital Signal Processing, vol.23, pp.1910-1917, 2013.
URL : https://hal.archives-ouvertes.fr/tel-00453645

A. Larcher, J. F. Bonastre, and J. S. Mason, Reinforced temporal structure information for embedded utterance-based speaker recognition, Annual Conference of the International Speech Communication Association (Interspeech), pp.371-374, 2008.
URL : https://hal.archives-ouvertes.fr/hal-01312944

A. Larcher, P. M. Bousquet, K. A. Lee, D. Matrouf, H. Li et al., I-vectors in the context of phonetically-constrained short utterances for speaker verification, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.4773-4776, 2012.
URL : https://hal.archives-ouvertes.fr/hal-01927733

A. Larcher, K. A. Lee, B. Ma, and H. Li, The RSR2015: Database for Text-Dependent Speaker Verification using Multiple Pass-Phrases, Annual Conference of the International Speech Communication Association (Interspeech), pp.1580-1583, 2012.
URL : https://hal.archives-ouvertes.fr/hal-01927726

A. Larcher, K. A. Lee, B. Ma, and H. Li, Phonetically-Constrained PLDA Modeling for Text-Dependent Speaker Verification with Multiple Short Utterances, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.7673-7677, 2013.
DOI : 10.1109/icassp.2013.6639156
URL : https://hal.archives-ouvertes.fr/hal-01927589

A. D. Lawson, A. Staufer, B. Smolenski, B. Pokines, M. Leoanrd et al., Long term examination of intra-session and inter-session speaker variability, Annual Conference of the International Speech Communication Association (Interspeech), pp.2899-2902, 2009.

K. A. Lee, A. Larcher, H. Thai, B. Ma, and H. Li, Joint Application of Speech and Speaker Recognition for Automation and Security in Smart Home, Annual Conference of the International Speech Communication Association (Interspeech), pp.3317-3318, 2011.
URL : https://hal.archives-ouvertes.fr/hal-01927763

K. A. Lee, A. Larcher, C. H. You, B. Ma, and H. Li, Multi-session PLDA Scoring of I-vector for Partially Open-Set Speaker Detection, Annual Conference of the International Speech Communication Association (Interspeech), pp.3651-3655, 2013.
URL : https://hal.archives-ouvertes.fr/hal-01927584

K. A. Lee, B. Ma, and H. Li, Speaker verification makes its debut in smartphone, 2013.

D. A. Van-leeuwen and N. Brümmer, The distribution of calibrated likelihood-ratios in speaker recognition, Annual Conference of the International Speech Communication Association (Interspeech), pp.1619-1623, 2013.

Y. Lei and J. H. Hansen, The Role of Age in Factor Analysis for Speaker Identification, Annual Conference of the International Speech Communication Association (Interspeech), pp.2371-2374, 2009.

H. Li, B. Ma, and K. A. Lee, Spoken language recognition: from fundamentals to practice, Proceedings of the IEEE, vol.101, pp.1136-1159, 2013.
DOI : 10.1109/jproc.2012.2237151
URL : https://doi.org/10.1109/jproc.2012.2237151

Q. Li, J. Zheng, A. Tsai, and Q. Zhou, Robust endpoint detection and energy normalization for real-time speech and speaker recognition, IEEE Transactions on Speech and Audio Processing, vol.10, pp.146-157, 2002.

J. Luan, J. Hao, T. Kakino, and T. Ikumi, Template Compression and Distance Normalization for Reliable Text-dependent Speaker Verification, in: Odyssey Speaker and Language Recognition Workshop, IEEE, pp.1-4, 2006.
DOI : 10.1109/odyssey.2006.248141

M. I. Mandasari, M. Mclaren, and D. Van-leeuwen, Evaluation of ivector Speaker Recognition Systems for Forensic Application, Annual Conference of the International Speech Communication Association (Interspeech), pp.21-24, 2011.

S. Marcel, C. Mccool, P. Matejka, J. Cernocky, J. Kittler et al., On the Results of the First Mobile Biometry (MOBIO) Face and Speaker Verification Evaluation, Lecture Notes in Computer Science, pp.210-225, 2010.

A. F. Martin and C. S. Greenberg, NIST 2008 speaker recognition evaluation: performance across telephone and room microphone channels, Annual Conference of the International Speech Communication Association (Interspeech), pp.2579-2582, 2009.

A. F. Martin and C. S. Greenberg, The NIST 2010 speaker recognition evaluation, Annual Conference of the International Speech Communication Association (Interspeech), pp.2726-2729, 2010.

D. Martinez, O. Plchot, L. Burget, O. Glembek, and P. Matejka, Language Recognition in i-vectors Space, Annual Conference of the International Speech Communication Association (Interspeech), pp.861-864, 2011.

J. S. Mason, F. Deravi, C. C. Chibelushi, and S. Gandon, Project: DAVID (Digital Audio Visual Integrated Database), 1996.

T. Matsui and S. Furui, Concatenated phoneme models for text-variable speaker recognition, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.391-394, 1993.
DOI : 10.1109/icassp.1993.319321

H. Meng, P. Ching, T. Lee, M. W. Mak, B. Mak et al., The multi-biometric, multi-device and multilingual (m3) corpus, in: International Workshop on Multimodal User Authentication, pp.1-8, 2006.

K. Messer, J. Matas, J. Kittler, J. Luettin, and G. Maitre, XM2VTSDB: The Extended M2VTS Database, International Conference of Audio and Video-Based Person Authentication, AVBPA, pp.965-966, 1999.

W. Mistretta and K. Farrell, Model adaptation methods for speaker verification, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.113-116, 1998.
DOI : 10.1109/icassp.1998.674380

S. Nakagawa, Z. Wei, and M. Takahashi, Text-independent speaker recognition by combining speaker-specific GMM with speaker adapted syllablebased HMM, IEEE International Conference on Acoustics, Speech, and Signal Processing, p.81, 2004.
DOI : 10.1109/icassp.2004.1325927

M. Nosratighods, E. Ambikairajah, J. Epps, and M. J. Carey, A segment selection technique for speaker verification, Speech Communication, vol.52, pp.753-761, 2010.
DOI : 10.1016/j.specom.2010.04.007

J. Ortega-garcia, J. Fierrez, F. Alonso-fernandez, J. Galbally, M. R. Freire et al., The multiscenario multienvironment biosecure multimodal database (bmdb), IEEE transactions on Pattern Analysis and Machine intelligence, vol.32, pp.1097-1111, 2010.
DOI : 10.1109/tpami.2009.76
URL : https://hal.archives-ouvertes.fr/hal-01333456

J. Ortega-garcia, J. Gonzalez-rodriguez, and V. Marrero-aguiar, AHUMADA: A large speech corpus in Spanish for speaker characterization and identification, Speech communication, vol.31, pp.255-264, 2000.

S. Pigeon and L. Vandendorpe, The M2VTS multimodal face database (release 1.00). Lecture Notes in Computer Science, pp.403-409, 1206.

J. Prazak and J. Silovsky, Speaker Diarization Using PLDA-based Speaker Clustering, International Conference on Intelligent Data Acquisition and Advanced Computing Systems, pp.347-350, 2011.

S. J. Prince and J. H. Elder, Probabilistic linear discriminant analysis for inferences about identity, International Conference on Computer Vision, IEEE, pp.1-8, 2007.

M. A. Przybocki, A. F. Martin, and A. N. Le, NIST Speaker Recognition Evaluation Chronicles-Part 2, Odyssey Speaker and Language Recognition Workshop, pp.1-6, 2006.

V. Ramasubramanian, A. Das, and V. Kumar, Text-Dependent SpeakerRecognition Using One-Pass Dynamic Programming Algorithm, IEEE International Conference on Acoustics, Speech, and Signal Processing, p.1, 2006.

D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, Speaker Verification Using Adapted Gaussian Mixture Models, Digital Signal Processing, vol.10, pp.19-41, 2000.

A. E. Rosenberg, C. Lee, and S. Gokcen, Connected word talker verification using whole word Hidden Markov Models, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.381-384, 1991.

A. E. Rosenberg, O. Siohan, and S. Parthasarathy, Small group speaker identification with common password phrases, Speech communication, vol.31, pp.131-140, 2000.

M. Schmidt and H. Gish, Speaker identification via support vector classifiers, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.105-108, 1996.

M. Senoussaoui, P. Kenny, N. Brummer, E. De-villiers, and P. Dumouchel, Mixture of PLDA models in I-vector space for gender independent speaker recognition, Annual Conference of the International Speech Communication Association (Interspeech), pp.25-28, 2011.

J. Silovsky, J. Prazak, P. Cerva, J. Zdansky, and J. Nouza, Pldabased clustering for speaker diarization of broadcast streams, Annual Conference of the International Speech Communication Association (Interspeech), pp.2909-2912, 2011.

T. Stafylakis, P. Kenny, P. Ouellet, J. Perez, M. Kockmann et al., Text-dependent speaker recognition using PLDA with uncertainty propagation, Annual Conference of the International Speech Communication Association (Interspeech), pp.3684-3688, 2013.

S. Steininger, S. Rabold, O. Dioubina, and F. Schiel, Development of user-state conventions for the multimodal corpus in SmartKom, LREC Workshop on" Multimodal Resources, 2002.

A. Stolcke, A. Mandal, and E. Shriberg, Speaker Recognition with Region-Constrained MLLR Transforms, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.4397-4400, 2012.

D. Sturim, D. Reynolds, R. Dunn, and T. Quatieri, Speaker verification using text-constrained Gaussian mixture models, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.677-680, 1999.

A. Subramanya, Z. Zhang, A. C. Surendran, P. Nguyen, M. Narasimhan et al., A generative-discriminative framework using ensemble methods for text-dependent speaker verification, IEEE International Conference on Acoustics, Speech, and Signal Processing, p.25, 2007.

D. T. Toledano, D. Hernandez-lopez, C. Esteve-elizalde, J. Fierrez, J. Ortega-garcia et al., BioSec Multimodal Biometric Database in Text-Dependent Speaker Recognition, 2008.

O. Toledo-ronen, H. Aronowitz, R. Hoory, J. Pelecanos, and D. Nahamoo, Towards Goat Detection in Text-Dependent Speaker Verification, Annual Conference of the International Speech Communication Association (Interspeech), pp.9-12, 2011.

R. Vogt and S. Sridharan, Explicit modelling of session variability for speaker verification, Computer Speech & Language, vol.22, pp.17-38, 2008.

R. J. Vogt, C. J. Lustri, and S. Sridharan, Factor analysis modelling for speaker verification with short utterances, in: Odyssey Speaker and Language Recognition Workshop, IEEE, pp.1-4, 2008.

R. J. Vogt, J. Pelecanos, N. Scheffer, S. Kajarekar, and S. Sridharan, Within-session variability modelling for factor analysis speaker verification, Annual Conference of the International Speech Communication Association (Interspeech), pp.1563-1566, 2009.
DOI : 10.1109/icassp.2006.1660166

M. Wagner, C. Summerfield, T. Dunstone, R. Summerfield, and J. Moss, An evaluation of "commercial off-the-shelf" speaker verification systems, Odyssey Speaker and Language Recognition Workshop, pp.1-8, 2006.

Y. W. Wong, S. I. Chang, K. P. Seng, L. M. Ang, S. W. Chin et al., A new multi-purpose audio-visual UNMC-VIER database with multiple variabilities, Pattern Recognition Letters, vol.32, pp.1503-1510, 2011.
DOI : 10.1016/j.patrec.2011.06.011

R. H. Woo, A. Park, and T. J. Hazen, The MIT Mobile Device Speaker Verification Corpus: Data Collection and Preliminary Experiments, Odyssey Speaker and Language Recognition Workshop, 2006.
DOI : 10.1109/odyssey.2006.248083
URL : http://groups.csail.mit.edu/sls/publications/2006/Odyssey_2006_Woo.pdf

S. C. Woo, C. P. Lim, and R. Osman, Text-dependent speaker recognition using the fuzzy ARTMAP neural network, Proceedings of IEEE Region 10 International Conference on Electrical and Electronic Technology, TENCON, 2000.

D. Wu, . Baojieli, and H. Jiang, Speech recognition, technologies and applications-Normalization and transformation techniques for robust speaker recognition. I-Tech, 2008.
DOI : 10.5772/6388
URL : https://www.intechopen.com/chapter/pdf-download/5887

J. Xu, Y. Zhang, Z. J. Yan, and Q. Huo, An i-vector based approach to acoustic sniffing for irrelevant variability normalization based acoustic model training and speech recognition, Annual Conference of the International Speech Communication Association (Interspeech), pp.1701-1704, 2011.

B. Yegnanarayana, S. M. Prasanna, J. M. Zachariah, and C. S. Gupta, Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system, IEEE Transactions on Speech and Audio Processing, vol.13, pp.575-582, 2005.
DOI : 10.1109/tsa.2005.848892

N. B. Yoma and T. F. Pegoraro, Robust speaker verification with state duration modeling, Speech Communication, vol.38, pp.77-88, 2002.
DOI : 10.1016/s0167-6393(01)00044-9

C. You, K. A. Lee, and H. Li, GMM-SVM kernel with a Bhattacharyyabased distance for speaker recognition, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, pp.1300-1312, 2010.

S. J. Young, The general use of tying in phoneme-based HMM speech recognisers, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.569-572, 1992.

S. J. Young, Springer Handbook of Speech Processing, chapter HMMs and Related Speech Recognition Technologies, pp.539-557, 2008.

T. F. Zheng, The voiceprint recognition activities over China, in: Oriental COCOSDA, 2005.