Serial order: A parallel, distributed processing approach Advances in Connectionist Theory: Speech, 1989. ,
DOI : 10.1016/s0166-4115(97)80111-2
Finding Structure in Time, Cognitive Science, vol.49, issue.2, pp.179-211, 1990. ,
DOI : 10.1007/BF00308682
Long Short-Term Memory, Neural Computation, vol.4, issue.8, pp.1735-1780, 1997. ,
DOI : 10.1016/0893-6080(88)90007-X
Recurrent neural network based language model, 11th Annual Conference of the International Speech Communication Association, pp.1045-1048, 2010. ,
Extensions of recurrent neural network language model, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.5528-5531, 2011. ,
DOI : 10.1109/ICASSP.2011.5947611
A unified architecture for natural language processing, Proceedings of the 25th international conference on Machine learning, ICML '08, pp.160-167, 2008. ,
DOI : 10.1145/1390156.1390177
Natural language processing (almost) from scratch, J. Mach. Learn. Res, pp.12-2493, 2011. ,
Investigation of recurrent-neuralnetwork architectures and learning methods for spoken language understanding, 2013. ,
Is it time to switch to word embedding and recurrent neural networks for spoken language understanding? In: InterSpeech, 2015. ,
Learning long-term dependencies with gradient descent is difficult, IEEE Transactions on Neural Networks, vol.5, issue.2, pp.157-166, 1994. ,
DOI : 10.1109/72.279181
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.41.7128
Learning Phrase Representations using RNN Encoder???Decoder for Statistical Machine Translation, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), p.1078, 2014. ,
DOI : 10.3115/v1/D14-1179
URL : https://hal.archives-ouvertes.fr/hal-01433235
Bidirectional lstm-crf models for sequence tagging, 2015. ,
Neural architectures for named entity recognition. arXiv preprint, 2016. ,
DOI : 10.18653/v1/n16-1030
URL : http://arxiv.org/abs/1603.01360
End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) ,
DOI : 10.18653/v1/P16-1101
URL : http://arxiv.org/abs/1603.01354
Glove: Global Vectors for Word Representation, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp.1532-1543, 2014. ,
DOI : 10.3115/v1/D14-1162
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.645.8863
Practical very large scale CRFs, Proceedings the 48th Annual Meeting of the Association for Computational Linguistics (ACL), pp.504-513, 2010. ,
Models cascade for tree-structured named entity detection, Proceedings of International Joint Conference of Natural Language Processing (IJCNLP), 2011. ,
Improving recurrent neural networks for sequence labelling, p.2555, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01489976
Conditional random fields: Probabilistic models for segmenting and labeling sequence data, Proceedings of the Eighteenth International Conference on Machine Learning (ICML), pp.282-289, 2001. ,
New recurrent neural network variants for sequence labeling, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01489955
Dropout: A simple way to prevent neural networks from overfitting, Journal of Machine Learning Research, vol.15, pp.1929-1958, 2014. ,
Feature-rich part-of-speech tagging with a cyclic dependency network, Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology , NAACL '03, pp.173-180, 2003. ,
DOI : 10.3115/1073445.1073478
Guided learning for bidirectional sequence classification, Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pp.760-767, 2007. ,
Spoken language understanding, IEEE Signal Processing Magazine, vol.25, issue.3, pp.50-58, 2008. ,
DOI : 10.1109/MSP.2008.918413
URL : https://hal.archives-ouvertes.fr/hal-01314884
Expanding the scope of the ATIS task, Proceedings of the workshop on Human Language Technology , HLT '94, pp.43-48, 1994. ,
DOI : 10.3115/1075812.1075823
Results of the french evalda-media evaluation campaign for literal understanding, pp.2054-2059, 2006. ,
URL : https://hal.archives-ouvertes.fr/hal-01160167
Efficient estimation of word representations in vector space, p.3781, 2013. ,
A Fast and Accurate Dependency Parser using Neural Networks, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp.740-750, 2014. ,
DOI : 10.3115/v1/D14-1082
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.654.8984
Linguistic regularities in continuous space word representations In: Human Language Technologies: Conference of the North American Chapter, the Association of Computational Linguistics, pp.746-751, 2013. ,
Practical Recommendations for Gradient-Based Training of Deep Architectures, p.5533, 2012. ,
DOI : 10.1162/089976602317318938
Backpropagation through time: what it does and how to do it, Proceedings of IEEE, pp.1550-1560, 1990. ,
DOI : 10.1109/5.58337
Named entity recognition with bidirectional lstm-cnns, p.8308, 2015. ,
Bidirectional recurrent neural networks, IEEE Transactions on Signal Processing, vol.45, issue.11, pp.2673-2681, 1997. ,
DOI : 10.1109/78.650093
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.331.9441
Generative and discriminative algorithms for spoken language understanding, Proceedings of the International Conference of the Speech Communication Assosiation (Interspeech), pp.1605-1608, 2007. ,
Using Recurrent Neural Networks for Slot Filling in Spoken Language Understanding, Speech, and Language Processing, 2015. ,
DOI : 10.1109/TASLP.2014.2383614
Text Chunking Using Transformation-Based Learning, Proceedings of the 3rd Workshop on Very Large Corpora, pp.84-94, 1995. ,
DOI : 10.1007/978-94-017-2390-9_10
URL : http://arxiv.org/abs/cmp-lg/9505040
Neural Probabilistic Language Models, JOURNAL OF MACHINE LEARNING RESEARCH, vol.3, pp.1137-1155, 2003. ,
DOI : 10.1007/3-540-33486-6_6
URL : https://hal.archives-ouvertes.fr/hal-01434258
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification, 2015 IEEE International Conference on Computer Vision (ICCV), pp.1026-1034, 2015. ,
DOI : 10.1109/ICCV.2015.123
URL : http://arxiv.org/pdf/1502.01852
Etude des reseaux de neurones recurrents pour etiquetage de sequences, Actes de la 23eme conf ? ©rence sur le Traitement Automatique des Langues Naturelles, 2016. ,
Building a large annotated corpus of english: The penn treebank, COMPUTATIONAL LINGUISTICS, vol.19, pp.313-330, 1993. ,
A Step Beyond Local Observations with a Dialog Aware Bidirectional GRU Network for Spoken Language Understanding, Interspeech 2016, 2016. ,
DOI : 10.21437/Interspeech.2016-1301
URL : https://hal.archives-ouvertes.fr/hal-01351733
Discriminative Reranking for Spoken Language Understanding, IEEE Transactions on Audio, Speech, and Language Processing, vol.20, issue.2, pp.526-539, 2011. ,
DOI : 10.1109/TASL.2011.2162322
URL : https://hal.archives-ouvertes.fr/hal-01478984
Hypotheses selection criteria in a reranking framework for spoken language understanding, Conference of Empirical Methods for Natural Language Processing, pp.1104-1115, 2011. ,
Comparing Stochastic Approaches to Spoken Language Understanding in Multiple Languages, IEEE Transactions on Audio, Speech, and Language Processing, vol.19, issue.6, p.99, 2010. ,
DOI : 10.1109/TASL.2010.2093520
URL : https://hal.archives-ouvertes.fr/hal-00746965
In: Large Margin Rank Boundaries for Ordinal Regression, 2000. ,
Optimizing crfs for slu tasks in various languages using modified training criteria, Proceedings of the International Conference of the Speech Communication Assosiation (Interspeech), 2009. ,
A post-processing system to yield reduced word error rates: Recognizer Output Voting Error Reduction (ROVER), 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings, pp.347-352, 1997. ,
DOI : 10.1109/ASRU.1997.659110
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.23.5624