Towards a French Smart-Home Voice Command Corpus: Design and NLU Experiments

Abstract : Despite growing interest in smart-homes, semantically annotated large voice command corpora for Natural Language development (NLU) are scarce, especially for languages other than English. In this paper, we present an approach to generate customizable synthetic corpora of semantically-annotated French commands for a smart-home. This corpus was used to train three NLU models-a triangular CRF, an attention-based RNN and the Rasa framework-evaluated using a small corpus of real users interacting with a smart home. While the attention model performs best on another large French dataset, on the small smart home corpus the models vary performance across to intent, slot and slot value classification. To the best of our knowledge, no other French corpus of semantically annotated voice commands is currently publicly available
Document type :
Conference papers
Liste complète des métadonnées
Contributor : Michel Vacher <>
Submitted on : Wednesday, September 12, 2018 - 11:22:41 AM
Last modification on : Monday, February 11, 2019 - 4:36:02 PM
Document(s) archivé(s) le : Thursday, December 13, 2018 - 1:50:15 PM


Files produced by the author(s)




Thierry Desot, Stefania Raimondo, Anastasia. Mishakova, François Portet, Michel Vacher. Towards a French Smart-Home Voice Command Corpus: Design and NLU Experiments. 21st International Conference on Text, Speech and Dialogue TSD 2018, Sep 2018, Brno, Czech Republic. pp.509-517, ⟨10.1007/978-3-030-00794-2_55⟩. ⟨hal-01802758⟩



Record views


Files downloads