M. Chan, D. Estève, C. Escriba, and E. Campo, A review of smart homes-present state and future challenges, Computer Methods and Programs in Biomedicine, vol.91, issue.1, pp.55-81, 2008.

M. Vacher, S. Caffiau, F. Portet, B. Meillon, C. Roux et al., Evaluation of a context-aware voice interface for Ambient Assisted Living: qualitative user study vs. quantitative system evaluation, ACM Transactions on Accessible Computing, vol.7, issue.2, p.36, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01138090

M. Weiser, The computer for the 21st century, Scientific American, vol.265, issue.3, pp.66-75, 1991.

G. M. Youngblood and D. J. Cook, Data mining for hierarchical model creation, IEEE Transactions on Systems, Man, and Cybernetics, Part C, vol.37, issue.4, pp.561-572, 2007.

T. Van-kasteren, A. Noulas, G. Englebienne, and B. Kröse, Accurate activity recognition in a home setting, Proceedings of UbiComp '08, 2008.

D. J. Cook and M. Schmitter-edgecombe, Assessing the quality of activities in a smart environment, Methods of Information in Medicine, vol.48, issue.5, pp.480-485, 2009.

N. Zouba, F. Bremond, M. Thonnat, A. Anfosso, E. Pascual et al., A computer system to monitor older adults at home: Preliminary results, Gerontechnology Journal, vol.8, issue.3, pp.129-139, 2009.
URL : https://hal.archives-ouvertes.fr/inria-00455131

J. Cumin, G. Lefebvre, F. Ramparany, and J. L. Crowley, A Dataset of Routine Daily Activities in an Instrumented Home, UCAmI 2017 -11th International Conference on Ubiquitous Computing and Ambient Intelligence, vol.10586, pp.413-425, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01639673

T. L. Van-kasteren, G. Englebienne, and B. J. Kröse, Transferring knowledge of activity recognition across sensor networks, Proceedings of the 8th International Conference on Pervasive Computing, pp.283-300, 2010.

H. Alemdar, H. Ertan, O. D. Incel, and C. Ersoy, Aras human activity datasets in multiple homes with multiple residents, 7th International Conference on Pervasive Computing Technologies for Healthcare and Workshops, pp.232-235, 2013.
DOI : 10.4108/pervasivehealth.2013.252120

URL : http://eudl.eu/pdf/10.4108/icst.pervasivehealth.2013.252120

J. Barker, R. Marxer, E. Vincent, and S. Watanabe, The CHiME challenges: Robust speech recognition in everyday environments, New Era for Robust Speech Recognition -Exploiting Deep Learning, pp.327-344, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01383263

E. Principi, S. Squartini, F. Piazza, D. Fuselli, and M. Bonifazi, A distributed system for recognizing home automation commands and distress calls in the italian language, Proceedings of Interspeech, pp.2049-2053, 2013.

M. Ravanelli, L. Cristoforetti, R. Gretter, M. Pellin, A. Sosi et al., The DIRHA-English corpus and related tasks for distantspeech recognition in domestic environments, Proceedings of 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), pp.275-282, 2015.

N. Bertin, E. Camberlein, E. Vincent, R. Lebarbenchon, S. Peillon et al., A French corpus for distant-microphone speech processing in real homes, Proceedings of Interspeech, pp.2781-2785, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01343060

G. Dekkers, S. Lauwereins, B. Thoen, M. W. Adhana, H. Brouckxon et al., The SINS database for detection of daily activities in a home environment using an acoustic sensor network, Proceedings of the Detection and Classification of Acoustic Scenes and Events, pp.32-36, 2017.

S. Intille, K. Larson, E. Tapia, J. Beaudin, P. Kaushik et al., Using a Live-In Laboratory for Ubiquitous Computing Research, 2006.

A. Fleury, M. Vacher, F. Portet, P. Chahuara, and N. Noury, A multimodal corpus recorded in a health smart home, Proceedings of LREC Workshop Multimodal Corpora and Evaluation, pp.99-105, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00486922

M. Vacher, B. Lecouteux, P. Chahuara, F. Portet, B. Meillon et al., The Sweet-Home speech and multimodal corpus for home automation interaction, Proceedings of 9th edition of the Language Resources and Evaluation Conference (LREC), pp.4499-4506, 2014.
URL : https://hal.archives-ouvertes.fr/hal-00953006

P. Lago, F. Lang, C. Roncancio, C. Jiménez-guarín, R. Mateescu et al., The ContextAct@A4H real-life dataset of daily-living activities Activity recognition using model checking, CONTEXT, ser. LNCS, vol.10257, pp.175-188, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01551418

F. Aman, Automatic speech recognition for ageing voices in the context of assisted living, 2014.
URL : https://hal.archives-ouvertes.fr/tel-01347155

S. Takahashi, T. Morimoto, S. Maeda, and N. Tsuruta, Dialogue experiment for elderly people in home health care system, Text, Speech and Dialogue, pp.418-423, 2003.

S. Möller, F. Gödde, and M. Wolters, Corpus analysis of spoken smart-home interactions with older users, Proceedings of the 6th International Conference on Language Resources and Evaluation, 2008.

K. Woo, T. Yang, K. Park, and C. Lee, Robust voice activity detection algorithm for estimating noise spectrum, IET Electronics Letters, vol.36, issue.2, pp.180-181, 2000.

S. Mousazadeh and I. Cohen, AR-GARCH in Presence of Noise: Parameter Estimation and its Application to Voice Activity Detection, IEEE Transactions on Audio Speech and Language Processing, vol.19, issue.4, pp.916-926, 2011.

A. Misra, Speech/nonspeech segmentation in web videos, Proceedings of Interspeech 2012. ISCA, 2012.

T. Ng, B. Zhang, L. Nguyen, S. Matsoukas, X. Zhou et al., Developing a speech activity detection system for the DARPA RATS program, Proceedings of Interspeech, 2012.

F. Eyben, F. Weninger, S. Squartini, and B. Schuller, Real-life Voice Activity Detection with LSTM Recurrent Neural Networks and an Application to Hollywood Movies, Proc. of INTERSPEECH, 2013.
DOI : 10.1109/icassp.2013.6637694

F. Eyben, F. Weninger, F. Groß, and B. Schuller, Recent developments in openSMILE, the Munich open-source multimedia feature extractor, Proceedings of the 21st ACM International Conference on Multimedia (ACM MM), pp.835-838, 2013.

M. Pitt, L. Dilley, K. Johnson, S. Kiesling, W. Raymond et al., Buckeye corpus of conversational speech, 2007.

J. Garofolo, L. Lamel, W. Fisher, J. Fiscus, D. Pallett et al., Timit acoustic-phonetic continuous speech corpus, 1993.

A. S. Crandall and D. J. Cook, Tracking systems for multiple smart home residents, Human behavior recognition technologies: Intelligent applications for monitoring and security, pp.111-129, 2013.
DOI : 10.4018/978-1-4666-3682-8.ch006

URL : http://www.eecs.wsu.edu/~cook/pubs/hbrt11.1.pdf

S. A. Mehdi and K. Berns, A survey of human location estimation in a home environment, The 23rd IEEE International Symposium on Robot and Human Interactive Communication, pp.135-140, 2014.

T. Miyazaki and Y. Kasama, Multiple human tracking using binary infrared sensors, Sensors, vol.15, issue.6, pp.13-459, 2015.
DOI : 10.3390/s150613459

URL : http://www.mdpi.com/1424-8220/15/6/13459/pdf

D. Bahdanau, K. Cho, and Y. Bengio, Neural machine translation by jointly learning to align and translate, 2014.

A. Milan, S. H. Rezatofighi, A. Dick, I. Reid, and K. Schindler, Online multi-target tracking using recurrent neural networks, Thirty-First AAAI Conference on Artificial Intelligence, 2017.

C. R. Wren and E. M. Tapia, Toward scalable activity recognition for sensor networks, Location-and context-awareness, 2006.

P. Chahuara, F. Portet, and M. Vacher, Location of an Inhabitant for Domotic Assistance Through Fusion of Audio and Non-Visual Data, Pervasive Health, pp.1-4, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00953556

A. Benmansour, A. Bouchachia, and M. Feham, Multioccupant activity recognition in pervasive smart home environments, ACM Comput. Surv, vol.48, issue.3, pp.1-36, 2015.
DOI : 10.1145/2835372

URL : http://dl.acm.org/ft_gateway.cfm?id=2835372&type=pdf

S. Sivasankaran, E. Vincent, and D. Fohr, Keyword-based speaker localization: Localizing a target speaker in a multi-speaker environment, Proceedings of Interspeech, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01817519

M. Vacher, E. Vincent, M. Bobillier-chaumon, T. Joubert, F. Portet et al., The VocADom Project: Speech Interaction for Well-being and Reliance Improvement, MobileHCI 2018 -workshop Designing Speech and Language Interactions for Mobiles and Wearables, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01830217

T. Desot, S. Raimondo, A. Mishakova, F. Portet, and M. Vacher, Towards a French Smart-Home Voice Command Corpus: Design and NLU Experiments, Proceedings of 21st International Conference on Text, Speech and Dialogue TSD 2018, pp.509-517, 2018.
DOI : 10.1007/978-3-030-00794-2_55

URL : https://hal.archives-ouvertes.fr/hal-01802758

A. Mishakova, F. Portet, T. Desot, and M. Vacher, Learning Natural Language Understanding Systems from Unaligned Labels for Voice Command in Smart Homes, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02013174