Building a Treebank for French, Treebanks, pp.165-187, 2003. ,
Contextual string embeddings for sequence labeling, Proceedings of the 27th International Conference on Computational Linguistics, pp.1638-1649, 2018. ,
Le corpus sequoia : annotation syntaxique et exploitation pour l'adaptation d'analyseur par pont lexical (the sequoia corpus : Syntactic annotation and use for a parser lexical domain adaptation method), Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, vol.2, pp.321-334, 2012. ,
, , 2019.
Introduction to deep learning, 2019. ,
, , 2019.
,
XNLI : evaluating cross-lingual sentence representations, Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp.2475-2485, 2018. ,
Semi-supervised sequence learning, Advances in Neural Information Processing Systems 28 : Annual Conference on Neural Information Processing Systems, pp.3079-3087, 2015. ,
RobBERT : a Dutch RoBERTa-based Language Model, 2020. ,
, Multilingual bert, 2018.
BERT : pre-training of deep bidirectional transformers for language understanding, Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics : Human Language Technologies, NAACL-HLT 2019, vol.1, pp.4171-4186, 2019. ,
Deep biaffine attention for neural dependency parsing, 5th International Conference on Learning Representations, 2017. ,
Learning word vectors for 157 languages, Proceedings of the Eleventh International Conference on, p.62, 2018. ,
, Language Resources and Evaluation, European Language Resources Association (ELRA), 2018.
Bag of tricks for efficient text classification, Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, vol.2, pp.427-431, 2017. ,
Universal language model fine-tuning for text classification, Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, ACL 2018, vol.1, pp.328-339, 2018. ,
What does bert learn about the structure of language ?, 57th Annual Meeting of the Association for Computational Linguistics (ACL), 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02131630
, Fasttext.zip : Compressing text classification models, 2016.
Adam : A method for stochastic optimization, 2014. ,
ALBERT : A lite BERT for self-supervised learning of language representations, 2019. ,
Flaubert : Unsupervised language model pre-training for french, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02784776
, Roberta : A robustly optimized BERT pretraining approach, pp.1907-11692, 2019.
, CamemBERT : a Tasty French Language Model, 2019.
Advances in pretraining distributed word representations, Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018. ,
Distributed representations of words and phrases and their compositionality, Advances in Neural Information Processing Systems 26 : 27th Annual Conference on Neural Information Processing Systems, pp.3111-3119, 2013. ,
, , p.63
, Faculty of Mathematics and Physics, 2018.
Asynchronous Pipeline for Processing Huge Corpora on Medium to Low Resource Infrastructures, 7th Workshop on the Challenges in the Management of Large Corpora (CMLC-7), p.2148693, 2019. ,
Glove : Global vectors for word representation, Éds., Proceedings of the, 2014. ,
, Conference on Empirical Methods in Natural Language Processing, pp.1532-1543, 2014.
Deep contextualized word representations, Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics : Human Language Technologies, vol.1, pp.2227-2237, 2018. ,
How multilingual is multilingual bert ?, Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 2019. ,
Language models are unsupervised multitask learners, 2019. ,
Exploring the limits of transfer learning with a unified text-to-text transformer, 2019. ,
Annotation référentielle du corpus arboré de Paris 7 en entités nommées (referential named entity annotation of the paris 7 french treebank), 2012. ,
, Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, vol.2, pp.535-542, 2012.
Evaluating contextualized embeddings on 54 languages in POS tagging, lemmatization and dependency parsing, 2019. ,
Neural architectures for nested NER through linearization, Proceedings of the 57th Conference of the Association for Computational Linguistics, ACL 2019, vol.1, pp.5326-5331, 2019. ,
BERT rediscovers the classical NLP pipeline, Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp.4593-4601, 2019. ,
, Multilingual is not enough : Bert for finnish, 2019.
, CCNet : Extracting High Quality Monolingual Datasets from Web Crawl Data, 2019.
A broad-coverage challenge corpus for sentence understanding through inference, Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics : Human Language Technologies, NAACL-HLT 2018, vol.1, pp.1112-1122, 2018. ,
Huggingface's transformers : State-of-the-art natural language processing, 2019. ,