B. D. Laaha, S. &. Maillochon-i, and . U. Dressler-w, Early acquisition of verb grammar and lexical development : Evidence from periphrastic constructions in french and austrian german, First Language, vol.24, issue.1, pp.33-70, 2004.
URL : https://hal.archives-ouvertes.fr/halshs-01064074

B. D. , Language and Species, 1990.

C. C. , The development of verb forms in french children at around two years of age : some comparisons with romance and non-romance languages, First Lisbon Meeting on Child Language. COPESTAKE A, pp.1-9, 1994.

D. K. Tremblay-a, Prosodically-conditioned variability in children's production of french determiners, Journal of Child Language, vol.35, pp.99-127, 2008.

D. P. Sagot-b, Coupling an annotated corpus and a morphosyntactic lexicon for state-of-the-art pos tagging with less human effort, 2009.

D. , Une histoire naturelle de la parole, 2000.

G. H. Buckley-m, Prosodic structure in child french : Evidence for the foot, Catalan Journal of Linguistics, vol.5, pp.109-142, 2006.

G. B. Perrier-g, Annotation sémantique du french treebank à l'aide de la réécriture modulaire de graphes, pp.293-306, 2012.

H. C. Ohayon, S. Dubé, S. Frauenfelder, U. Rizzi, and L. Starke-m.-&-zesiger-p, Aspects of grammatical development in young french children with sli, Developmental Science, vol.6, pp.151-159, 2003.

H. K. Wagner-h.-&-ratliff-f, Inhibition in the eye of limulus, Journal of General Physiology, vol.39, pp.651-673, 1956.

H. H. , Aspects of the evolution of the early lexicon in the interactions mother-child : Case study of two dizygotic twin children between 15 and 26 months, 2005.

C. J. Gulcehre, C. &. Cho-k, and . Bengio-y, Empirical evaluation of gated recurrent neural networks on sequence modeling, Workshop on Deep Learning and Representation Learning at the 28th Annual conference on Advances in Neural Information Processing Systems (NIPS'14, vol.9, pp.1735-1780, 1997.

K. R. Schmid-m, Improved deep learning baselines for ubuntu corpus dialogs, Workshop on Machine Learning for Spoken Language Understanding and Interaction at the 29th Annual Conference on Neural Information Processing Systems (NIPS'15), 2015.

K. ;. , S. Diego, C. A. , U. Li-j, M. Galley et al., A diversity-promoting objective function for neural conversation models, Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics : Human Language Technologies (NAACL'16), pp.110-119, 2015.

L. X. , C. , L. L. Gao, and J. &. Celikyilmaz-a, End-to-end task-completion neural dialogue systems, Proceedings of the Eighth International Joint Conference on Natural Language Processing, pp.733-743, 2017.

L. Lowe-r, . Serban-i, M. Noseworthy, U. Charlin-l.-&-pineau-j.-;, . Lowe-r et al., How not to evaluate your dialogue system : An empirical study of unsupervised evaluation metrics for dialogue response generation, Proceedings of the ACL Workshop on Effective Tools and Methodologies for Teaching Natural Language Processing and Computational Linguistics (ETMTNLP'02), pp.1116-1126, 2002.

L. R. Pow, N. &. Serban-i, and . Pineau-j, The ubuntu dialogue corpus : A large dataset for research in unstructured multi-turn dialogue systems, Proceedings of the 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL'15), pp.285-294, 2015.

. T. Lowe-r, N. Pow, . V. Serban-i, L. Charlin, and L. , Training end-to-end dialogue systems with the ubuntu dialogue corpus, Dialogue & Discourse, vol.8, issue.1, pp.31-65, 2017.

M. T. , C. K. , and C. , Efficient estimation of word representations in vector space, Proceedings of the International Conference on Learning Representations (ICLR'13), pp.1-12, 2013.

P. J. Socher-r.-&-manning-c.-;-doha, Q. ?. Reh?reh?-références, C. Y. Xu-l, . Liu-k, . &. Zeng-d et al., Event Extraction via Dynamic Multi-Pooling Convolutional Neural Networks, 53 rd Annual Meeting of the Association for Computational Linguistics and 7 th International Joint Conference on Natural Language Proce ssing (ACL-IJCNLP 2015), pp.167-176, 2014.

D. S. He-r and . Zhao-w, Exploiting Document Level Information to Improve Event Detection via Recurrent Neural Networks, Eighth International Joint Conference on Natural Language Processing, pp.352-361, 2017.

F. X. Huang-l, J. H. Tang-d, . &. Qin-b, and . Liu-t, A Language-Independent Neural Network for Event Detection, 54th Annual Meeting of the Association for Computational Linguistics (ACL 2016), pp.66-71, 2016.

G. K. Smith-n, Softmax-Margin CRFs: Training Log-Linear Models with Cost Functions, Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL HLT 2010), pp.733-736, 2010.

J. S. , L. Y. Qin, T. , and M. Q. Dong-b, SRCB Entity Discovery and Linking (EDL) and Event Nugget Systems for TAC 2017, Text Analysis Conference, 2017.

K. D. Besançon-r, O. Ferret, and . Le-borgne-h.-&-boros-e, CEA LIST Participation to the TAC 2017 Event Nugget Track, Text Analysis Conference, 2017.

L. Q. Mikolov-t, Distributed Representations of Sentences and Documents, 31st International Conference on International Conference on Machine Learning, pp.1188-1196, 2014.

M. , UZH at TAC KBP 2017: Event Nugget Detection via Joint Learning with Softmax-Margin Objective, Text Analysis Conference, 2017.

C. D. Manning, M. Surdeanu, J. Bauer, J. Finkel, . J. Bethard-s et al., The Stanford CoreNLP Natural Language Processing Toolkit, 52nd Annual Meeting of the Association for Computational Linguistics (ACL 2014), system demonstrations, pp.55-60, 2014.

M. T. Sutskever-i, C. K. Corrado, and G. S. Dean-j, Distributed Representations of Words and Phrases and Their Compositionality, 26th International Conference on Neural Information Processing Systems (NIPS 2013), pp.3111-3119, 2013.

. H. Nguyen-t and . Cho-k.-&-grishman-r, Joint Event Extraction via Recurrent Neural Networks, 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT 2016, pp.300-309, 2016.

. H. Nguyen-t and . Grishman-r, Event Extraction via Dynamic Multi-Pooling Convolutional Neural Networks, 53 rd Annual Meeting of the Association for Computational Linguistics and 7 th International Joint Conference on Natural Language Proce ssing (ACL-IJCNLP 2015, pp.365-371, 2015.

. H. Nguyen-t, . &. Grishman-r, and . Meyers-a, New York University 2016 System for KBP Event Nugget: A Deep Learning Approach, Text Analysis Conference, 2016.

R. , Reporting Score Distributions Makes a Difference: Performance Study of LSTM-networks for Sequence Tagging, 2017 Conference on Empirical Methods in Natural Language Processing, pp.338-348, 2017.

A. S. Références, C. Whitelaw, C. P. Hota, and S. R. Garg-n.-&-levitan-s, Stylistic text classification using functional lexical features, Journal of the Association for Information Science and Technology, vol.58, issue.6, pp.802-822, 2007.

B. M. Bernardini-s, Bootcat : Bootstrapping corpora and terms from the web, Proceedings of the Language Resources and Evaluation Conference (LREC), pp.1313-1316, 2004.

B. , Register, genre, and style, 2009.

B. , Sociolinguistic perspectives on register, 1994.

B. A. Fraenkel-b, Langage et travail (communication, cognition, action), 2005.

C. P. , Le discours d'information médiatique : la construction du miroir social, 1997.

C. , SMS Communication : A linguistic approach, vol.61, 2014.

D. E. Vel, O. , A. A. , and C. M. Mohay-g, Mining e-mail content for author identification forensics, ACM Sigmod Record, vol.30, issue.4, pp.55-64, 2001.

E. J. , What to do about bad language on the internet, Proceedings of North American Chapter of the Association for Computational Linguistics : Human Language Technologies (HLT-NAACL), pp.359-369, 2013.

H. J. Escalante and . Solorio-t.-&-montes-y-gómez-m, Local histograms of character n-grams for authorship attribution, Proceedings of the Annual Meeting of the Association for Computational Linguistics : Human Language Technologies (HTL-ACL), pp.288-298, 2011.

G. F. , Niveaux de langue et variation intrinsèque, Palimpsestes, vol.10, pp.17-40, 1996.

G. P. Adamson-d.-&-rosé-c, Modeling of stylistic variation in social media with stretchy patterns, Proceedings of the Workshop on Algorithms and Resources for Modelling of Dialects and Language Varieties, pp.49-59, 2011.

H. , Bigrams of syntactic labels for authorship discrimination of short texts, Literary and Linguistic Computing, vol.22, issue.4, pp.405-417, 2007.

I. F. Binsalleeh, H. C. Fung-b, and . Debbabi-m, A unified data mining solution for authorship analysis in anonymous textual communications, Information Sciences, vol.231, pp.98-112, 2013.

K. C. Yvon-f.-&-damnati-g, Normalizing sms : are two metaphors better than one ?, Proceedings of the International Conference on Computational Linguistics (COLING), pp.441-448, 2008.

K. , Exploiting stylistic idiosyncrasies for authorship attribution, Proceedings of IJCAI Workshop on Computational Approaches to Style Analysis and Synthesis, vol.69, pp.72-80, 2003.

L. Q. Mikolov-t, Distributed representations of sentences and documents, Proceedings of the International Conference on Machine Learning (ICML), pp.1188-1196, 2014.

L. G. and G. G. Sébillot-p, On the use of web resources and natural language processing techniques to improve automatic speech recognition systems, Proceedings of the Language Resources and Evaluation Conference (LREC), pp.592-599, 2008.

B. P. Grave and E. Joulin-a.-&-mikolov-t, Enriching word vectors with subword information, Transactions of the Association of Computational Linguistics, vol.5, pp.135-146, 2017.

B. Bucilu?a, C. , and C. , Model compression, Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, pp.535-541, 2006.

C. W. Jaitly, N. V. Le-q, and . Vinyals-o, Listen, attend and spell : A neural network for large vocabulary conversational speech recognition, ICASSP, 2016.

C. Y. Ng and H. T. Zhong-z, Nus-pt : Exploiting parallel texts for word sense disambiguation in the english all-words tasks, Proceedings of the 4th International Workshop on Semantic Evaluations, SemEval '07, pp.253-256, 2007.

C. K. Van-merrienboer-b, . &. Bahdanau-d, and . Bengio-y, On the properties of neural machine translation : Encoder-decoder approaches, Proceedings of SSST-8, Eighth Workshop on Syntax, Semantics and Structure in Statistical Translation, pp.103-111, 2014.

E. , Senseval-2 : Overview, The Proceedings of the Second International Workshop on Evaluating Word Sense Disambiguation Systems, SENSEVAL '01, pp.1-5, 2001.

. Hinton-g, . &. Vinyals-o, and . Dean-j, Distilling the knowledge in a neural network, 2015.

H. S. Schmidhuber and J. , Long short-term memory, Neural Computation, vol.9, issue.8, pp.1735-1780, 1997.

H. E. , M. M. , and P. M. Ramshaw-l.-&-weischedel-r, Ontonotes : The 90, Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume : Short Papers, NAACL-Short '06, pp.57-60, 2006.

I. I. Pilehvar and M. T. Navigli-r, Embeddings for word sense disambiguation : An evaluation study, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, vol.1, pp.897-907, 2016.

. Ide-n, C. Baker, C. Fellbaum, and . Fillmore-c.-&-passonneau-r, Masc : the manually annotated sub-corpus of american english, Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08), 2008.

K. M. Salomonsson-h, Word sense disambiguation using a bidirectional lstm, 5th Workshop on Cognitive Aspects of the Lexicon (CogALex), 2016.

K. S. Kim-h.-e, Transferring knowledge to smaller network with class-distance loss, 2017.

K. D. Ba and J. , Adam : A method for stochastic optimization, 2014.

M. T. Sutskever-i, C. K. Corrado, and G. S. Dean-j, Distributed representations of words and phrases and their compositionality, Advances in Neural Information Processing Systems, vol.26, pp.3111-3119, 2013.

D. E. Kok, D. , and M. J. Dima-c.-&-hinrichs-e, Pp attachment : Where do we, p.311, 2017.

D. S. Nasr and A. Bechet-f.-&-favre-b, Correcting prepositional phrase attachments using multimodal corpora, Proceedings of the 15th International Conference on Parsing Technologies, pp.72-77, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01693292

F. F. Fleet, D. J. Kiros, J. R. Fidler-s.-;-fang, H. Gupta-s, F. N. Iandola et al., Vse++ : Improved visual-semantic embeddings, IEEE Conference on Computer Vision and Pattern Recognition, pp.1473-1482, 2015.

F. B. Hakkani-tür-d.-&-cuendet-s.-;-he-k, R. S. Zhang-x, and . Sun-j, Deep residual learning for image recognition, Proceedings of the IEEE conference on computer vision and pattern recognition, pp.770-778, 2007.

K. , Deep visual-semantic alignments for generating image descriptions, IEEE Conference on Computer Vision and Pattern Recognition, pp.3128-3137, 2015.

M. M. Marcinkiewicz and M. A. Santorini-b, Building a large annotated corpus of english : The penn treebank, Computational linguistics, vol.19, issue.2, pp.313-330, 1993.

M. S. Nasr-a, Integrating selectional constraints and subcategorization frames in a dependency parser, Computational Linguistics, 2016.

P. J. Laptev-i and . Schmid-c.-&-sivic-j, Weakly-supervised learning of visual relations, IEEE International Conference on Computer Vision, pp.5189-5198, 2017.

W. A. Plummer-b, C. M. Cervantes, J. C. Caicedo, and . Hockenmaier-j.-&-lazebnik-s, Flickr30k entities : Collecting region-to-phrase correspondences for richer image-tosentence models, International Journal of Computer Vision, vol.123, issue.1, pp.74-93, 2017.

R. G. Sontakke and S. Bhattacharyya-p.-&-haffari-g, Prepositional attachment disambiguation using bilingual parsing and alignments, Computer Vision and Pattern Recognition (CVPR), 2017 IEEE Conference on, pp.6517-6525, 2016.

S. A. , The Two-Word Sentence in Child Language Development : A Study Based on Evidence Provided by Dutch-Speaking Triplets, 1973.

C. E. Snow, Mothers' speech to children learning language, Child Development, vol.43, issue.2, pp.549-565, 1972.

. J. Spivey-m, M. K. Tanenhaus, E. K. Sedivy, and J. C. , Eye movements and spoken language comprehension : Effects of visual context on syntactic ambiguity resolution, Cognitive Psychology, vol.45, issue.4, pp.447-481, 2002.

V. O. Toshev and A. Bengio-s.-&-erhan-d, Show and tell : A neural image caption generator, IEEE Conference on Computer Vision and Pattern Recognition, pp.3156-3164, 2015.

Y. P. , L. A. , and H. M. , From image descriptions to visual denotations : New similarity metrics for semantic inference over event descriptions, Transactions of the Association for Computational Linguistics, vol.2, 2014.

B. S. Heigold-g, Word embeddings for speech recognition, Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014.

B. T. and L. J. Messina-r, Scan, Attend and Read : End-to-End Handwritten Paragraph Recognition with MDLSTM Attention, Proceedings of the 14th International Conference on Document Analysis and Recognition (ICDAR'17), 2017.

B. P. Grave and E. Joulin-a.-&-mikolov-t, Enriching word vectors with subword information, Transactions of the Association of Computational Linguistics, vol.5, issue.1, pp.135-146, 2017.

B. M. Bingel and J. &. Søgaard-a, Learning attention for historical text normalization by learning to pronounce, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL'17), pp.332-344, 2017.

C. K. Van-merrienboer-b, . &. Bahdanau-d, and . Bengio-y, On the Properties of Neural Machine Translation : Encoder-Decoder Approaches, Proceedings of the 8th Workshop on Syntax, Semantics and Structure in Statistical Translation (SSST'14), pp.103-111, 2014.

C. K. Van-merriënboer-b, C. Gulcehre, D. Bahdanau, F. Bougares, and . Schwenk-h,

&. Bengio and Y. , Learning phrase representations using rnn encoder-decoder for statistical machine translation, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01433235

C. F. Eglin, V. Kieu, V. C. Stutzmann-d, and . Vincent-n, , p.2016, 2016.

, Competition on Classification of Medieval Handwritings in Latin Script, Proceedings of the 15th International Conference on Frontiers in Handwriting Recognition (ICFHR'16), pp.590-595

F. A. Keller-a and . Frinken-v.-&-bunke-h, Lexicon-free handwritten word spotting using character HMMs, PRL, vol.33, issue.7, pp.934-942, 2012.

G. , An unsupervised model of orthographic variation for historical document transcription, Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics : Human Language Technologies (NAACLHTL'16), pp.467-472, 2016.

G. E. , C. E. Likforman-sulem-l, and M. Mokbel-c.-&-cîrstea-b.-i, Transcription of spanish historical handwritten documents with deep neural networks, Journal of Imaging, vol.4, issue.1, p.15, 2018.

G. , ICDAR 2011-French Handwriting Recognition Competition, Proceedings of the 11th International Conference on Document Analysis and Recognition (ICDAR'11), pp.1459-1463, 2011.

L. J. Rusiñol, M. Fornés, and A. Fernández-d.-&-dutta-a, On the influence of word representations for handwritten word spotting in historical documents, IJPRAI, vol.26, issue.05, pp.1263002-1263003, 2012.

N. V. Hinton-g, Rectified Linear Units Improve Restricted Boltzmann Machines, Proceedings of the 27th international conference on machine learning (ICML'10), pp.807-814, 2010.

N. , Zero-resource machine translation by multimodal encoderdecoder network with multimedia pivot. Machine Translation, vol.31, pp.49-64, 2017.

P. S. Yang-q, A survey on transfer learning, IEEE Transactions on knowledge and data engineering, vol.22, issue.10, pp.1345-1359, 2010.

P. I. Zagoris and K. Barlas-g.-&-gatos-b, Proceedings of the 15th International Conference on Frontiers in Handwriting Recognition (ICFHR'16), pp.619-623, 2016.

R. V. Fornés, A. Serrano-n, J. A. Sánchez, A. H. Toselli, V. Frinken et al., The ESPOSALLES database : An ancient marriage license corpus for off-line hwr, vol.46, pp.1658-1669, 2013.

J. A. Sanchez, . Romero-v, A. H. Toselli, . &. Villegas-m, and . Vidal-e, , p.2017, 2017.

, Competition on Handwritten Text Recognition on the READ Dataset, Proceedings of the 14th International Conference on Document Analysis and Recognition (ICDAR'17), pp.1383-1388

V. C. Lopez-a, From Characters to Words to in Between : Do We Capture Morphology ?, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL'17), pp.2016-2027, 2017.

V. O. Toshev and A. Bengio-s.-&-erhan-d, Show and tell : A neural image caption generator, Computer Vision and Pattern Recognition (CVPR), 2015 IEEE Conference on, pp.3156-3164, 2015.

B. E. References and . Ddddddddd-t, Cg-3-beyond classical constraint grammar, Proceedings of the 20th Nordic Conference of Computational Linguistics, pp.31-39, 2015.

B. P. , G. E. , and J. A. Mmmmmmm-t, Enriching word vectors with subword information, 2016.

F. M. Ggggggí, -. Nnnnnnnn, J. O'rrrrr, J. Ooooo-rrrrr-s, -. A. Péééé et al., Apertium: a free/open-source platform for rule-based machine translation, Machine Translation, vol.25, issue.2, pp.127-144, 2011.

H. R. , Breton Grammar. Evertype, 2007.

K. E. Gggggggg-y, Simple and accurate dependency parsing using bidirectional LSTM feature representations, TACL, vol.4, pp.313-327, 2016.

L. T. , , 2016.

L. , Universal Dependencies for Irish, Proceedings of CLTW 2016, 2016.

N. J. , M. Gggggg, F. Gggggggg, Y. Hhhh?, J. Mmmmmmm et al., Universal Dependencies v1: A Multilingual Treebank Collection, Proceedings of Language Resources and Evaluation Conference (LREC'16), 2016.

N. J. Hhhh, J. Nnnnnnn, J. Cccccc, A. Eeeeeee, G. Küüüüü et al., , 2007.

, MaltParser: A language-independent system for data-driven dependency parsing, Natural Language Engineering, vol.13, issue.2, pp.95-135

B. J. References, J. Bos, and . Van-der-goot-r.-&-nissim-m, The meaning factory: Formal semantics for recognizing textual entailment and determining semantic similarity, SemEval@COLING, pp.642-646, 2014.

C. R. , W. J. Bottou, L. , and K. M. Kavukcuoglu-k.-&-kuksa-p, Natural language processing (almost) from scratch, J. Mach. Learn. Res, vol.12, pp.2493-2537, 2011.

E. , Colloque International Francophone sur l'Ecrit et le Document, CORIA 2016-Conférence en Recherche d'Informations et Applications-13th French Information Retrieval Conference, pp.235-250, 2016.

G. K. Srivastava-r, J. Koutník, . R. Steunebrink-b, and J. Schmidhuber, LSTM: A search space odyssey, 2015.

H. H. and G. K. Lin-j, Multi-perspective sentence similarity modeling with convolutional neural networks, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp.1576-1586, 2015.

K. Y. , Convolutional neural networks for sentence classification, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, pp.1746-1751, 2014.

K. R. Zhu, Y. Salakhutdinov-r, . S. Zemel-r, A. Torralba, and . Urtasun-r.-&-fidler-s, Skip-thought vectors, Proceedings of the 28th International Conference on Neural Information Processing Systems, NIPS'15, pp.3294-3302, 2015.

L. A. , Illinois-lh: A denotational and distributional approach to semantics, Proceedings of the 8th International Workshop on Semantic Evaluation, pp.329-334, 2014.

L. , Nonparametric estimation of distributions with categorical and continuous data, Journal of Multivariate Analysis, vol.86, issue.2, pp.266-292, 2003.

M. M. Menini, S. Baroni, M. Bentivogli, and L. Bernardi-r.-&-zamparelli-r, A SICK cure for the evaluation of compositional distributional semantic models, Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC, pp.216-223, 2014.

M. , Siamese recurrent architectures for learning sentence similarity, Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, AAAI'16, pp.2786-2792, 2016.

R. B. Pakulska, K. Chodorowska, and K. Walczak-w.-&-andruszkiewicz-p, Samsung poland nlp team at semeval-2016 task 1: Necessity for diversity; combining recursive autoencoders, wordnet and ensemble methods to measure semantic similarity, SemEval@ NAACL-HLT, 2016.

S. A. Nicosia-m.-&-moschitti-r, Learning semantic textual similarity with structural representations, ACL, 2013.

T. K. Socher-r and . D. Manning-c, Improved semantic representations from tree-structured long short-term memory networks, 2015.

F. Benarmara, N. Hatout, P. Muller-&-s, . Ozdowska, and . Eds, Actes de TALN, 2007.

C. J. Lee-w and . W. Teh-y, Nus-ml : Improving word sense disambiguation using topic features, Proceedings of the 4th International Workshop on Semantic Evaluations, SemEval '07, pp.249-252, 2007.

C. Y. Ng and H. T. Zhong-z, Nus-pt : exploiting parallel texts for word sense disambiguation in the english all-words tasks, Proceedings of the 4th International Workshop on Semantic Evaluations, pp.253-256, 2007.

G. Dias, E. ;. Caen, H. Atala, . Hadj-salah-m, H. Blanchon et al., Amélioration de la traduction automatique d'un corpus annoté, Traitement automatique des langues naturelles), 2015.

K. P. Hoang, H. Birch-a, F. M. Callison-burch-c, N. Bertoldi, . Cowan-b et al., Moses : Open source toolkit for statistical machine translation, Proceedings of the 45th annual meeting of the ACL on interactive poster and demonstration sessions, pp.177-180, 2007.

L. , Repérer automatiquement les segments obsolescents à l'aide d'indices sémantiques et discursifs, Actes de TALN 2009, 2009.

L. Benarmara, Enrichissement d'un lexique bilingue par analogie, pp.101-110, 2007.

G. A. Miller, Wordnet : a lexical database for english, Communications of the ACM, vol.38, issue.11, pp.39-41, 1995.
DOI : 10.1145/219717.219748

N. M. Tchechmedjiev, A. Blanchon, and H. &. Schwab-d, Création rapide et efficace d'un système de désambiguïsation lexicale pour une langue peu dotée, pp.2015-2037, 2015.

F. Caen, S. M. Novischi-a, and . Bennett-a, Lcc-wsd : System description for english coarse grained all words task at semeval, Proceedings of the 4th International Workshop on Semantic Evaluations, SemEval '07, pp.223-226, 2007.

. Schwab-d, Cours master mosig, 2017.

S. Benarmara, Collocation translation based on sentence alignment and parsing, pp.401-410, 2007.

V. L. , L. B. Schwab-d, . Orléans, . France, P. M. Weischedel-r et al., Uniformisation de corpus anglais annotés en sens, Ontonotes release 5.0. LDC2013T19. Web Download. Philadelphia : Linguistic Data, 2015.

, base de connaissances pour la désambiguïsation d'entités nommées, Actes de la 23e conférence sur le Traitement Automatique des Langues Naturelles, pp.290-303

B. K. Evans, C. Paritosh, P. , and S. , Freebase: A Collaboratively Created Graph Database for Structuring Human Knowledge, Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, pp.1247-1250, 2008.

C. S. , Large-scale named entity disambiguation based on Wikipedia data, Proceedings of the issue2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp.708-716, 2007.

D. M. Mcnamee, P. Rao-d, . Gerber-a.-&-finin-t.-;-fang-w, J. Zhang, W. D. et al., Entity Disambiguation by Knowledge and Text Jointly Embedding, Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp.457-466, 2009.

L. J. Isele-r, J. M. , J. A. Kontokostas, D. Mendes, P. Hellmann et al., DBpedia-A large-scale, multilingual knowledge base extracted from wikipedia, Semantic Web Journal, vol.6, issue.2, pp.167-195, 2015.

L. X. Singh-s.-&-weld-d, Design Challenges for Entity Linking, Transactions of the Association for Computational Linguistics (TACL), vol.3, pp.315-328, 2015.

M. T. Sutskever-i, C. K. Corrado, and G. S. Dean-j, Distributed Representations of Words and Phrases and their Compositionality, Advances in Neural Information Processing Systems, vol.26, pp.3111-3119, 2013.

M. J. Besançon-r, L. Beaumont-r.,-d'hondt-e, R. S. , and T. X. Grau-b, Combining Word and Entity Embeddings for Entity Linking, The Semantic Web, vol.10249, 2017.

M. A. Raganato-a.-&-navigli-r, Entity Linking meets Word Sense Disambiguation: a Unified Approach, Transactions of the Association for Computational Linguistics, vol.2, pp.231-244, 2014.

N. M. and T. , Factorizing YAGO: scalable machine learning for linked data, Proceedings of the 21st international conference on World Wide Web, pp.271-280, 2012.

P. B. Al-rfou-r.-&-skiena-s, DeepWalk: Online Learning of Social Representations, Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp.701-710, 2014.

R. S. Yao-l, . Mccallum-a, and . M. Marlin-b, Relation Extraction with Matrix Factorization and Universal Schemas, Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT 2013), pp.74-84, 2013.

W. J. Shen-w and . Han-j, Entity Linking with a Knowledge Base: Issues, Techniques, and Solutions, Transactions on Knowledge & Data Engineering, vol.27, issue.2, pp.443-460, 2015.

S. F. Kasneci-g.-&-weikum-g, Yago: A Core of Semantic Knowledge, Proceedings of the 16th International Conference on World Wide Web, pp.697-706, 2007.

U. R. Ngomo-a.-c, M. Röder, . Gerber-d, S. Coelho, and A. S. Both-a, AGDISTIS-Graph-Based Disambiguation of Named Entities using Linked Data, The Semantic Web-ISWC, vol.8796, pp.457-471, 2014.

V. Ci´ci´,

C. , Wikidata: A Free Collaborative Knowledgebase, Communications of the ACM, vol.57, issue.10, pp.78-85, 2014.

W. J. Bordes and A. Yakhnenko-o.-&-usunier-n, Connecting Language and Knowledge Bases with Embedding Models for Relation Extraction, Proceedings of the 2013, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00880455

, Conference on Empirical Methods in Natural Language Processing (EMNLP 2013), pp.1366-1371

Y. I. Shindo and H. Takeda-h.-&-takefuji-y, Joint Learning of the Embedding of Words and Entities for Named Entity Disambiguation, Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, pp.250-259, 2016.

B. R. Références, Evaluation des traductions automatiques en français des titres de presse japonais, 2014.

B. R. , Automatic Evaluation of Alignments without using a Gold-Corpus-Example with French-Japanese Aligned Corpora, 2018.

C. Y. , W. L. Boitet, and C. &. Shi-x, On-going Cooperative Research towards Developing Economy-Oriented Chinese-French SMT Systems with a New SMT Framework, Actes de la 21e conférence sur le Traitement Automatique des Langues Naturelles, pp.401-406, 2014.
URL : https://hal.archives-ouvertes.fr/hal-02014321

C. F. and C. C. Nakazawa-t.-&-kurohashi-s, Kyoto University Participation to WAT 2016, Proceedings of the 3rd Workshop on Asian Translation (WAT2016), pp.166-174, 2016.

D. , Pre-reordering for Neural Machine Translation : Helpful or Harmful ?, The Prague Bulletin of Mathematical Linguistics, vol.108, pp.171-182, 2017.

J. M. Schuster, M. V. Le-q, M. Krikun, Y. Wu, C. Z. Thorat et al., Google's Multilingual Neural Machine Translation System : Enabling Zero-Shot Translation, 2016.

K. G. , K. Y. Deng, Y. Senellart, and J. &. Rush-a, Opennmt : Open-Source Toolkit for Neural Machine Translation, Proceedings of ACL 2017, System Demonstrations, pp.67-72, 2017.

N. T. Mino, H. Goto-i, G. Neubig, and K. S. Sumita-e, Overview of the 2nd Workshop on Asian Translation, Proceedings of the 2nd Workshop on Asian Translation (WAT2015), pp.1-28, 2015.

N. T. Yaguchi, M. Uchimoto, K. Utiyama, M. Sumita, E. Kurohashi-s.-&-isahara-h.-;-k et al., ASPEC : Asian Scientific Paper Excerpt Corpus, Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), 2016.

N. , On the Elements of an Accurate Tree-to-String Machine Translation System, The 52nd Annual Meeting of the Association for Computational Linguistics, 2014.

N. G. Nakata and Y. &. Mori-s, Pointwise Prediction for Robust, Adaptable Japanese Morphological Analysis, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics : Human Language Technologies : Short Papers, vol.2, pp.529-533, 2011.

N. G. Watanabe-t and . Mori-s, Inducing a Discriminative Parser to Optimize Machine Translation Reordering, Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, p.45, 2012.

P. K. , R. S. , and W. , BLEU : a Method for Automatic Evaluation of Machine Translation, Proc. ACL, pp.311-318, 2002.

T. T. Sumita, E. Sugaya, F. Yamamoto, and H. &. Yamamoto-s, Toward a Broad-Coverage Bilingual Corpus for Speech Translation of Travel Conversations in the Real World, International Conference on Language Resources and Evaluation, pp.147-152, 2002.

T. , The OPUS corpus-parallel & free, Proceedings of the Fourth International Conference on Language Resources and Evaluation, p.0, 2004.

T. and T. , Investigating Phrase-Based and NeuralBased Machine Translation on Low-Resource Settings, p.0, 2017.

U. H. Shirai-s, A. Yokoo, and . Ooyama-y.-&-furuse-o, ALTFLASH : A JapaneseEnglish Machine Translation System for Market Flash Reports. The transactions of the Institute of Electronics, D-II, vol.84, issue.6, pp.1167-1174, 2001.

W. Y. Schuster, M. , C. Z. Le-q, M. Norouzi, W. Macherey et al., Google's Neural Machine Translation System : Bridging the Gap between Human and Machine Translation, 2016.

B. M. Références and . Dinu-g.-&-kruszewski-g, Don't count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors, 52 nd Annual Meeting of the Association for Computational Linguistics (ACL 2014), pp.238-247, 2014.

B. E. Tram, N. Baroni, and M. , Multimodal distributional semantics, Journal of Artificial Intelligence Research, vol.49, pp.1-47, 2014.

D. , cocor: A Comprehensive Solution for the Statistical Comparison of Correlations, PLOS ONE, vol.10, issue.4, pp.1-12, 2015.

D. J. and H. E. Singer-y, Adaptive Subgradient Methods for Online Learning and Stochastic Optimization, Journal of Machine Learning Research, vol.12, pp.2121-2159, 2011.

F. M. Dodge, J. Jauhar, S. K. Dyer, C. , and H. E. Smith-n, Retrofitting Word Vectors to Semantic Lexicons, NAACL HLT 2015, pp.1606-1615, 2015.

G. W. Church-k and . Yarowsky-d, Work on statistical methods for word sense disambiguation, AAAI Fall Symposium on Probabilistic Approaches to Natural Language, pp.54-60, 1992.

H. G. Dror, G. , and G. E. Koren-y, Large-scale Learning of Word Relatedness with Constraints, 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'12), pp.1406-1414, 2012.

H. F. Reichart-r.-&-korhonen-a, Simlex-999: Evaluating semantic models with (genuine) similarity estimation, Computational Linguistics, vol.41, issue.4, pp.665-695, 2015.

K. D. and H. F. Clark-s, Specializing Word Embeddings for Similarity or Relatedness, Conference on Empirical Methods in Natural Language Processing, pp.2044-2048, 2015.

L. O. Goldberg-y, Dependency-Based Word Embeddings, 52nd Annual Meeting of the Association for Computational Linguistics (ACL 2014), pp.302-308, 2014.

L. B. Liu-t, . Zhao-z, . Tang-b, A. Drozd, and R. A. Du-x, Investigating Different Syntactic Context Types and Context Representations for Learning Word Embeddings, 2017 Conference on Empirical Methods in Natural Language Processing, pp.2421-2431, 2017.

M. T. , C. K. , and C. , Efficient estimation of word representations in vector space, ICLR 2013, workshop track, 2013.

G. A. Miller, WordNet: An On-Line Lexical Database, International Journal of Lexicography, vol.3, issue.4, 1990.

M. Mrk?i´c, N. , Ó. Séaghdha-d, . Thomson-b, . Ga?i´cga?i´-ga?i´c-m et al., Counter-fitting Word Vectors to Linguistic Constraints, 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT 2016), pp.142-148, 2016.

M. Mrk?i´c, N. Vuli´cvuli´-vuli´c-i, Ó. Séaghdha-d, . Leviant-i, . Reichart-r et al., Semantic Specialization of Distributional Word Vector Spaces using c ATALA, 2017.

C. Monolingual and . Constraints, Transactions of the Association for Computational Linguistics, vol.5, pp.309-324

M. J. , All-but-the-Top: Simple and Effective Postprocessing for Word Representations, Sixth International Conference on Learning Representations (ICLR 2018), poster session, 2018.

N. C. Gormley and M. R. Van-durme-b, Annotated Gigaword, NAACL Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction (AKBC-WEKEX), pp.95-100, 2012.

P. J. Socher-r and . D. Manning-c, GloVe: Global Vectors for Word Representation, Conference on Empirical Methods in Natural Language Processing, pp.1532-1543, 2014.

S. H. , Dimensions of meaning, ACM/IEEE conference on Supercomputing, pp.787-796, 1992.

V. Vuli´c-i, . Mrk?i´cmrk?i´-mrk?i´c-n, Ó. Reichart-r, Y. S. Séaghdha-d, and . Korhonen-a, , 2017.

, Morph-fitting: Fine-Tuning Word Vector Spaces with Simple Language-Specific Rules, p.55

, Annual Meeting of the Association for Computational Linguistics, pp.56-68, 2017.

W. J. Bansal and M. Gimpel-k.-&-livescu-k, From Paraphrase Database to Compositional Paraphrase Model and Back, Transactions of the Association for Computational Linguistics, vol.3, pp.345-358, 2015.

Y. W. Schütze-h, Learning Word Meta-Embeddings, 54th Annual Meeting of the Association for Computational Linguistics (ACL 2016), pp.1351-1360, 2016.

B. Références, Syntaxe des incises de citation, Actes du premier Congrès Mondial de Linguistique Française, pp.2395-2408, 2008.

B. , Traitement des incises en français : capture automatique et modèle prosodique, XXIVèmes Journées d'Étude sur la Parole, 2002.

C. M. Crabbé-b and D. P. Guérin-f, Analyse syntaxique du français : des constituants aux dépendances, 16e Conférence sur le Traitement Automatique des Langues Naturelles-TALN 2009, 2009.

C. M. Nivre, J. , and D. P. Anguiano-e, Benchmarking of statistical dependency parsers for french, Proceedings of the 23rd International Conference on Computational Linguistics : Posters, p, pp.108-116, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00514815

C. C. and M. O. Fohr-d, Jtrans, an open-source software for semi-automatic text-to-speech alignment, Proceedings of the 10th Annual Conference of the International Speech Communication Association-Interspeech, 2009.

C. J. Lecorvé-g.-&-lolive-d, ROOTS : a toolkit for easy, fast and consistent processing of large sequential annotated data collections, Language Resources and Evaluation Conference (LREC), 2014.

D. L. Sagot-b and . Stern-r, Analyse discursive des incises de citation, p.2, 2010.

, Congrès Mondial de Linguistique Française-CMLF 2010

D. D. Rilliard, A. , R. S. Adda-decker-m.-&-d', and A. C. , , 2011.

, Prosodic Analysis of a Corpus of Tales, International Speech Communication Association (ISCA), pp.3129-3132

M. R. and A. , Prosodic analysis of storytelling discourse modes and narrative situations oriented to text-to-speech synthesis, Eighth ISCA Workshop on Speech Synthesis, 2013.

S. C. Schlör, D. Popp-s, . Brunner-a, . &. Henny-u, and J. C. Tello, Straight talk ! automatic recognition of direct speech in nineteenth-century french novels, Digital Humanities 2016 : Conference Abstracts, pp.346-353, 2016.

S. A. Lolive, D. Vidal, and G. Tahon-m.-&-Élisabeth-delais-roussarie, , 2018.

, Synpaflex-corpus : An expressive french audiobooks corpus dedicated to expressive speech synthesis, Language Resources and Evaluation Conference (LREC), 2018.

A. A. Darwish and K. Durrani-n.-&-mubarak-h, Farasa : A fast and furious segmenter for arabic, The 2016 Conference of the North American Chapter of the Association for Computational Linguistics : Human Language Technologies, pp.11-16, 2016.

A. J. , Sentiment classification at the time of the tunisian uprising : Machine learning techniques applied to a new corpus for arabic language, Proceedings of the 2014 European Network Intelligence Conference, ENIC '14, pp.38-45, 2014.

D. R. El-orfali-m, A study of the effects of preprocessing strategies on sentiment analysis for arabic text, J. Information Science, vol.40, issue.4, pp.501-513, 2014.

E. R. Kalamawy and M. E. Soliman-a, Niletmrg at semeval-2017 task 4 : Arabic sentiment analysis, Proceedings of the 11th International Workshop on Semantic Evaluation, pp.790-795, 2017.

G. M. , Character-aware neural networks for arabic named entity recognition for social media, Proceedings of the 6th Workshop on South and Southeast Asian Natural Language Processing (WSSANLP2016), pp.23-32, 2016.

K. N. Tunisia, . S. Larkey-l, . &. Ballesteros-l, and . E. Connell-m, Improving stemming for arabic information retrieval : light stemming and co-occurrence analysis, SIGIR 2002 : Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp.275-282, 2002.

M. S. Bougares, F. , and E. , Sentiment analysis of tunisian dialects : Linguistic ressources and experiments, Proceedings of the Third Arabic Natural Language Processing Workshop, pp.55-61, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01592418

M. H. Haddad, H. , and A. C. Ismail-babao-?-glu, Tunisian dialect sentiment analysis : A natural language processing-based approach, Proceedings of the 11th International Workshop on Semantic Evaluation, pp.664-669, 2017.

P. R. and D. M. Singh-v, Analytical mapping of opinion mining and sentiment analysis research during 2000-2015, Inf. Process. Manage, vol.53, issue.1, pp.122-150, 2017.

S. Y. Attia, M. Eldesouki, M. Abdelali, A. Mubarak, and H. Kallmeyer-l.-&-darwish-k, A neural architecture for dialectal arabic segmentation, Proceedings of the Third Arabic Natural Language Processing Workshop, WANLP 2017@EACL, pp.46-54, 2017.

S. K. Liwicki, M. &. Ingold-r, and . Bui-m, Tunisian dialect and modern standard arabic dataset for sentiment analysis : Tunisian election context, Second International Conference on Arabic Computational Linguistics, ACLING 2016, pp.35-53, 2016.

B. R. References, . Sennrich-r, . Birch-a, and . Haddow-b, Evaluating Discourse Phenomena in Neural Machine Translation, Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics; Human Language Technologies, 2018.

C. M. Wu-d, Context-Dependent Phrasal Translation Lexicons for Statistical Machine Translation, Proceedings of the 11th Machine Translation Summit, pp.73-80, 2007.

G. L. , Incorporating Pronoun Function into Statistical Machine Translation, 2016.

G. L. Hardmeier, C. Nakov, P. Stymne, S. Tiedemann, J. Versley et al., Findings of the 2016 WMT Shared Task on Cross-lingual Pronoun Prediction, Proceedings of the 1st Conference on Machine Translation, WMT'16, pp.525-542, 2016.

H. C. , Discourse in statistical machine translation. a survey and a case study, p.11, 2012.

H. C. Federico-m, Modelling pronominal anaphora in statistical machine translation, Proceedings of the 7th International Workshop on Spoken Language Translation, IWSLT'10, pp.283-289, 2010.

H. C. Nakov, P. Stymne, S. Tiedemann, J. Versley, and Y. &. Cettolo-m, , 2015.

. Pronoun-focused, . Mt, and . Cross-lingual, Pronoun Prediction: Findings of the 2015 DiscoMT Shared Task on Pronoun Translation, Proceedings of the 2nd Workshop on Discourse in Machine Translation, DISCOMT'15, pp.1-16

I. P. and C. C. Foster-g, A Challenge Set Approach to Evaluating Machine Translation, Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, vol.17, pp.2476-2486, 2017.

J. S. Lauly, S. &. Firat-o, and . Cho-k, Does Neural Machine Translation Benefit from Larger Context?, 2017.

L. E. Hoste-v, SemEval-2013 Task 10: Cross-lingual Word Sense Disambiguation, Proceedings of the 2nd Joint Conference on Lexical and Computational Semantics, pp.158-166, 2013.

L. , Attention Strategies for Multi-Source Sequence-to-Sequence Learning, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, ACL'17, pp.196-202, 2017.

L. Sanchez and S. , Pronominal anaphora and verbal tenses in machine translation, 2017.

L. S. Stymne, S. Nakov, P. Hardmeier, C. Tiedemann, J. et al., Findings of the 2017 DiscoMT Shared Task on Cross-lingual Pronoun Prediction, Proceedings of the 3rd Workshop on Discourse in Machine Translation, DISCOMT'17, pp.1-16, 2017.

M. O. and G. J. Dagan-i, context2vec: Learning Generic Context Embedding with Bidirectional LSTM, Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, CoNLL'16, pp.51-61, 2016.

M. , Machine Translation of Labeled Discourse Connectives, Proceedings of the 10th Biennial Conference of the Association for Machine Translation in the Americas, AMTA'12, pp.129-138, 2012.

M. T. Webber-b, Implicitation of Discourse Connectives in (Machine) Translation, Proceedings of the 1st Workshop on Discourse in Machine Translation, DISCOMT'13, pp.19-26, 2013.

M. T. Sutskever-i, C. K. , and C. , Distributed representations of words and phrases and their compositionality, Proceedings of the 26th Annual Conference on Neural Information Processing Systems, NIPS'13, pp.3111-3119, 2013.

P. K. , R. S. , and W. , BLEU: A Method for Automatic Evaluation of Machine Translation, Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, ACL'02, pp.311-318, 2002.

P. J. Socher-r and . D. Manning-c, GloVe: Global Vectors for Word Representation, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, vol.14, pp.1532-1543, 2014.

R. Gonzales and A. Mascarell-l.-&-sennrich-r, Improving Word Sense Disambiguation in Neural Machine Translation with Sense Embeddings, Proceedings of the 2nd Conference on Machine Translation, WMT'17, pp.11-19, 2017.

S. R. , How Grammatical is Character-level Neural Machine Translation?, Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, EACL'17, pp.376-382, 2017.

W. L. , T. Z. , and W. A. Qun-liu, Exploiting Cross-Sentence Context for Neural Machine Translation, Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, vol.17, pp.2816-2821, 2017.

, syntactic and semantic annotations of the clinical narrative, Journal of the American Medical Informatics Association, vol.20, issue.5, pp.922-930

. W. Chapman-w, W. Bridewell, P. Hanbury, and C. G. Buchanan-b, , 2001.

, A simple algorithm for identifying negated findings and diseases in discharge summaries, Journal of Biomedical Informatics, issue.5, p.34

D. C. , Détection de l'incertitude et de la négation : un état de l'art, 19es REncontres jeunes Chercheurs en Informatique pour le TAL, pp.94-107, 2017.

D. C. Claveau-v.-&-grabar-n, Détection de la négation : corpus français et apprentissage supervisé, SIIM 2017-Symposium sur l'Ingénierie de l'Information Médicale, pp.1-8, 2017.

D. , Detecting negation of medical problems in french clinical notes, Proceedings of the 2nd ACM SIGHIT International Health Informatics Symposium, 2012.

F. F. Lopez-a, H. Webber-b.-;-harkema, J. N. Dowling, and T. T. Chapman-w, Context : an algorithm for determining negation, experiencer, and temporal status from clinical reports, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, vol.1, pp.839-851, 2009.

H. S. Schmidhuber and J. , Long short-term memory, Neural Comput, vol.9, issue.8, 1997.

L. J. Mccallum-a and . Pereira-f, Conditional random fields : Probabilistic models for segmenting and labeling sequence data, Proceedings of the eighteenth international conference on machine learning, ICML, vol.1, 2001.

M. P. Deshpande-a.-&-nadkarni-p, Use of general-purpose negation detection to augment concept indexing of medical documents : a quantitative study using the umls, Journal of the American Medical Informatics Association : JAMIA, vol.8, issue.6, 2001.

. Q. Nguyen-d, . Q. Nguyen-d, . D. Pham-d, and . B. Pham-s, A robust transformationbased learning approach using ripple down rules for part-of-speech tagging, AI Communications, vol.29, issue.3, pp.409-422, 2015.

P. W. Bender-e, J. Read, and . Oepen-s.-&-dridan-r, Simple Negation Scope Resolution through Deep Parsing : A Semantic Solution to a Semantic Problem, ACL, 2014.

P. Y. , W. X. , L. L. Bagheri, M. &. Summers-r, and . Lu-z, Negbio : a highperformance tool for negation and uncertainty detection in radiology reports, AMIA 2018 Informatics Summit, 2018.

R. J. Velldal, E. , and Ø. L. Oepen-s, Uio 1 : Constituent-based discriminative ranking for negation resolution, Proceedings of the First Joint Conference on Lexical and Computational Semantics, vol.1, 2012.

H. Schmid, Probabilistic part-ofispeech tagging using decision trees, Proceedings of International Conference on New Methods in Language Processing, p.45, 1994.

U. Ö. South-b, . &. Shen-s, and . L. Duvall-s, 2010 i2b2/va challenge on concepts, assertions, and relations in clinical text, Journal of the American Medical Informatics Association, vol.18, issue.5, pp.552-556, 2011.

B. , Pragmatic neural language modelling in machine translation, Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics : Human Language Technologies, pp.820-829, 2015.

B. Y. Ducharme-r and . Vincent-p, A neural probabilistic language model, 2001.

B. , Quick training of probabilistic neural nets by importance sampling, Proceedings of the conference on Artificial Intelligence and Statistics (AISTATS), 2003.

B. , Adaptive importance sampling to accelerate training of a neural probabilistic language model, IEEE Trans. Neural Networks, vol.19, issue.4, pp.713-722, 2008.

C. C. Mikolov, T. Schuster, M. Ge, Q. Brants, T. Koehn et al., One billion word benchmark for measuring progress in statistical language modeling, INTERSPEECH 2014, 15th Annual Conference of the International Speech Communication Association, pp.2635-2639, 2014.

C. W. Grangier-d.-&-auli-m, Strategies for training large vocabulary neural language models, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, vol.1, pp.1975-1985, 2016.

C. X. Liu-x, M. J. Gales, and . C. Woodland-p, Recurrent neural network language model training with noise contrastive estimation for speech recognition, ICASSP, pp.5411-5415, 2015.

D. J. Zbib-r, L. T. Huang-z, and . Schwartz-r.-&-makhoul-j, Fast and robust neural network joint models for statistical machine translation, Proceedings of the 52nd, 2014.

, Annual Meeting of the Association for Computational Linguistics, vol.1, pp.1370-1380

D. C. , Notes on noise contrastive estimation and negative sampling. CoRR, abs/1410, p.8251, 2014.

G. M. Hyvärinen-a, Noise-contrastive estimation : A new estimation principle for unnormalized statistical models, Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, AISTATS 2010, pp.297-304, 2010.

G. M. Hyvärinen-a, Noise-contrastive estimation of unnormalized statistical models, with applications to natural image statistics, Journal of Machine Learning Research, vol.13, pp.307-361, 2012.

G. M. Hyvärinen-a, Estimation of unnormalized statistical models without numerical integration, Proceedings of the Workshop on Information Theoretic Methods in Science and Engineering, 2013.

J. S. Cho-k, . &. Memisevic-r, and . Bengio-y, On using very large target vocabulary for neural machine translation, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, vol.1, pp.1-10, 2015.

J. S. Vishwanathan-s, N. Satish, . J. Anderson-m, and . Dubey-p, Blackout : Speeding up recurrent neural network language models with very large vocabularies, 2015.

J. R. Vinyals, O. Schuster, M. &. Shazeer-n, and . Wu-y, Exploring the limits of language modeling, 2016.

K. D. Ba and J. , Adam : A method for stochastic optimization, 2014.

L. Oparin-i, A. Allauzen, and G. , Structured output layer neural network language model, pp.5524-5527, 2011.

M. O. Dagan-i.-&-goldberger-j, PMI matrix approximations with applications to neural language modeling, 2016.

M. O. Dagan-i.-&-goldberger-j, A simple language model based on pmi matrix approximations, Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp.1861-1866, 2017.

M. T. Karafiát, M. Burget, L. , and C. , Recurrent neural network based language model, INTERSPEECH 2010, 11th Annual Conference of the International Speech Communication Association, pp.1045-1048, 2010.

M. A. Hinton-g, A scalable hierarchical distributed language model, 2009.

K. , D. Schuurmans, and Y. , Advances in Neural Information Processing Systems, vol.21, pp.1081-1088

M. A. Teh-y, A fast and simple algorithm for training neural probabilistic language models, Proceedings of the 29th International Conference on Machine Learning, ICML 2012, 2012.

M. , Hierarchical probabilistic neural network language model, Proceedings of the Tenth International Workshop on Artificial Intelligence and Statistics, pp.246-252, 2005.

H. Schwenk, Continuous space language models, Comput. Speech Lang, vol.21, issue.3, pp.492-518, 2007.
DOI : 10.1016/j.csl.2006.09.003
URL : https://hal.archives-ouvertes.fr/hal-01454941

V. A. Zhao, Y. , and F. V. Chiang-d, Decoding with large-scale neural language models improves translation, Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp.1387-1392, 2013.

Z. B. Vaswani, A. , and M. J. Knight-k, Simple, fast noise-contrastive estimation for large rnn vocabularies, Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics : Human Language Technologies, pp.1217-1222, 2016.

B. D. Ng-a and . I. Jordan-m, Latent dirichlet allocation, J. Mach. Learn. Res, vol.3, pp.993-1022, 2003.

C. J. , Unleashing the killer corpus : experiences in creating the multi-everything AMI meeting corpus. Language Resources and Evaluation, vol.41, pp.181-190, 2007.

F. R. Frampton, M. Ehlen, P. , and P. M. Peters-s, Modelling and detecting decisions in multi-party dialogue, Proceedings of the 9th SIGdial Workshop on Discourse and Dialogue, SIGdial '08, pp.156-163, 2008.

G. M. Mckeown-k and F. , Discourse segmentation of multi-party conversation, Proceedings of the 41st Annual Meeting on Association for Computational Linguistics, vol.1, 2003.

G. M. and C. A. Armstrong-s, Exploiting structural meeting-specific features for topic segmentation, TALN/RECITAL, pp.15-24, 2007.

H. Q. and C. , Analyzing feature trajectories for event detection, Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '07, pp.207-214, 2007.

L. D. Seung and H. S. , Learning the parts of objects by nonnegative matrix factorization, Nature, vol.401, pp.788-791, 1999.

P. M. Dowding, J. Niekrasz, J. Ehlen, P. , and N. S. Peters-s, , 2007.

, Detecting and summarizing action items in multi-party dialogue, Proc. of the 9th SIGdial Workshop on Discourse and Dialogue

P. M. Griffiths-t, . P. Körding-k, and J. B. Tenenbaum, Identifying relevant phrases to summarize decisions in spoken meetings, Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics, ACL-44, pp.45-50, 2006.

R. K. and F. , Packing the Meeting Summarization Knapsack, Unknown, Unknown or Invalid Region, 2008.

S. H. Hurst-m.-&-maykov-a, Event detection and tracking in social streams, Proceedings of the International Conference on Weblogs and Social Media, 2009.

T. G. Stolcke, A. Voss-l, J. Dowding, . Favre-b, . Fernandez-r et al., The calo meeting speech recognition and understanding system, IEEE Spoken Language Technology Workshop, pp.69-72, 2008.
URL : https://hal.archives-ouvertes.fr/hal-01194293

T. G. Stolcke, A. Voss-l, . Peters-s, . Hakkani-tur-d, J. Dowding et al., The calo meeting assistant system, IEEE Transactions on Audio, Speech, and Language Processing, vol.18, pp.1601-1611, 2010.
URL : https://hal.archives-ouvertes.fr/hal-01194269

P. Références and E. Auer, Code-Switching in Conversation : Language, Interaction and Identity, 1998.

B. E. Jelinek-f, J. Lafferty, D. M. Magerman, and . Mercer-r.-&-roukos-s, Towards history-based grammars : Using richer models for probabilistic parsing, Proceedings of the Workshop on Speech and Natural Language, HLT'91, pp.134-139, 1992.

C. M. , Discriminative training methods for hidden markov models : Theory and experiments with perceptron algorithms, Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing, pp.1-8, 2002.

I. L. and W. D. De-bot-k, Multidisciplinary Approaches to Code Switching, 2009.

L. M. Baldwin-t, langid.py : An off-the-shelf language identification tool, Proceedings of the ACL 2012 System Demonstrations, pp.25-30, 2012.

M. G. Alghamdi, F. Ghoneim, M. Hawwari, A. Rey-villamizar-n, and D. M. Solorio-t, Overview for the second shared task on language identification in code-switched data, Proceedings of the Second Workshop on Computational Approaches to Code Switching, pp.40-49, 2016.

M. , Duelling Languages : Grammatical Structure in Codeswitching, 1997.

N. J. Agi´agi´-c-?-z, Universal dependencies 2.1. LIN, 2017.

, DAT/CLARIN digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics

R. , Jee haan, i'd like both, por favor : Elicitation of a code-switched corpus of hindi-english and spanish-english human-machine dialog, 18th Annual Conference of the International Speech Communication Association, pp.47-51, 2017.

S. , Crowdsourcing universal part-of-speech tags for code-switching, 2017.

, 18th Annual Conference of the International Speech Communication Association, pp.77-81, 2017.

S. M. Haji?haji?haji?, J. &. Straková, and J. , UDPipe : trainable pipeline for processing CoNLL-U files performing tokenization, morphological analysis, POS tagging and parsing, Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), 2016.

T. Y. Miyao and Y. &. , Learning with lookahead : Can history-based models rival globally optimized models ?, Proceedings of the Fifteenth Conference on Computational Natural Language Learning, CoNLL'11, pp.238-246, 2011.

W. G. Pécheux, N. Gahbiche-braham-s, and . Yvon-f, Cross-lingual partof-speech tagging through ambiguous learning, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp.1779-1785, 2014.

A. B. and G. C. Shen-r.-&-kabadjov-m, Agile corpus annotation in practice : an overview of manual and automatic annotation of CVs, Proc of LAW, pp.29-37, 2010.

B. P. Bigi-b, L. Prevot, R. S. Seinturier, and J. , Annotation schemes, annotation tools and the question of interoperability : from typed feature structures to XML schemas, Proc of International Conference on Global Interoperability for Language Resource, 2010.

D. S. Biswas and P. Choudhury-m.-&-bali-k, Complex linguistic annotation-no easy way out ! a case from Bangla and Hindi POS labeling tasks, Proc of LAW, pp.10-18, 2009.

D. F. Lee and J. Y. Szolovits-p, NeuroNER : an easy-to-use program for named-entity recognition based on neural networks, Proc of EMNLP, 2017.

F. , Influence of pre-annotation on POS-tagged corpus development, Proc of LAW, pp.56-63, 2010.

G. K. Pereira-f, M. Mandel, and C. S. White-p, Semi-automated named entity annotation, Proc of LAW, pp.53-56, 2007.

G. C. , Controlled propagation of concept annotations in textual corpora, Proc of LREC, 2016.

L. , Banner : an executable survey of advances in biomedical named entity recognition, Proc of Pacific Symposium on Biocomputing, vol.8, pp.275-281, 1993.

M. S. Burga, A. &. Ferraro-g, and . Wanner-l, How does the granularity of an annotation scheme influence dependency parsing performance ?, Proc of COLING, posters, pp.839-852, 2012.

P. J. Socher-r and . D. Manning-c, GloVe : global vectors for word representation, Proc of EMNLP, vol.12, pp.1532-1575, 2014.

S. T. Zukerman-i, A. J. Yepes, and C. L. Verspoor-k, Impact of corpus diversity and complexity on NER performance, Proc of Australasian Language Technology Association Wrokshop, pp.91-95, 2013.

. Stubbs-a and . Kotfila-c.-&-uzuner-o, Automated systems for the de-identification of longitudinal clinical narratives : Overview of 2014 i2b2/UTHealth shared task track 1, J Biomed Inform, vol.58, pp.11-19, 2015.

V. A. , Improving corpus annotation productivity : a method and experiment with interactive tagging, Proc of LREC, pp.2097-2102, 2012.

W. A. Thomas-j, Semantic annotation, Corpus Annotation, pp.53-65, 1997.

Y. S. De-castilho-r and G. I. Biemann-c, Automatic annotation suggestions and custom annotation layers in WebAnno, Proc of ACL, System Demonstrations, pp.91-96, 2014.

A. Références, Investigation of the relationship between product involvement and brand commitment, 2012.

A. R. and K. S. , Applying support vector machines to imbalanced datasets, Machine learning : ECML, pp.39-50, 2004.

. E. Batista-g, . L. Bazzan-a, and . C. Monard-m, Balancing training data for automated annotation of keywords : a case study, WOB, pp.10-18, 2003.

. E. Batista-g, . C. Prati-r, and . C. Monard-m, A study of the behavior of several methods for balancing machine learning training data, ACM Sigkdd Explorations Newsletter, vol.6, issue.1, pp.20-29, 2004.

B. J. Bengio-y, Random search for hyper-parameter optimization, Journal of Machine Learning Research, vol.13, pp.281-305, 2012.

G. V. , An overview of classification algorithms for imbalanced datasets, International Journal of Emerging Technology and Advanced Engineering, vol.2, issue.4, pp.42-47, 2012.

H. A. , Sentiment analysis with recurrent neural network and unsupervised neural language model, 2017.

H. H. Bai, Y. , and G. E. Li-s, Adasyn : Adaptive synthetic sampling approach for imbalanced learning, IEEE International Joint Conference on, pp.1322-1328, 2008.

H. M. , Research frontiers in marketing : Dialogues and directions, pp.184-187, 1978.

K. B. Wo´zniakwo´-wo´zniak-m and . Schaefer-g, Cost-sensitive decision tree ensembles for effective imbalanced classification, Applied Soft Computing, vol.14, pp.554-562, 2014.

L. P. Zuidema-w, Compositional distributional semantics with long short term memory, 2015.

L. M. Patin-g, Détecter les intentions d'achat dans les forums de discussion du domaine automobile : une approche robuste à l ?, 2014.

L. S. , W. Z. Zhou-g, and . Y. Lee-s, Semi-supervised learning for imbalanced sentiment classification, IJCAI proceedings-international joint conference on artificial intelligence, vol.22, p.1826, 2011.

O. O. , Knowing your customers to serve them better : Enduring involvement approach, Global Research Journal of Business Management, vol.2, issue.2, pp.5-14, 2014.

S. B. Wijayanto, H. &. Notodiputro-k, and . Sartono-b, Synthetic over sampling methods for handling class imbalanced problems : A review, IOP Conference Series : Earth and Environmental Science, vol.58, p.12031, 2017.

S. J. and R. D. Dias-l, Consumer behavior today, 2014.

S. Z. Xu, H. &. Zhang-d, and . Xu-y, Chinese sentiment classification using a neural network tool ?word2vec, Multisensor Fusion and Information Integration for Intelligent Systems (MFI), 2014 International Conference on, pp.1-6, 2014.

. M. Ting-k, An instance-weighting method to induce cost-sensitive trees, IEEE Transactions on Knowledge and Data Engineering, vol.14, issue.3, 2002.

V. , Conceptualisation et mesure de l'implication, Recherche et Applications en Marketing, vol.4, issue.1, pp.57-78, 1989.

. M. Weiss-g, Mining with rarity : a unifying framework, ACM Sigkdd Explorations Newsletter, vol.6, issue.1, pp.7-19, 2004.

. M. Weiss-g and . Mccarthy-k.-&-zabar-b, Cost-sensitive learning vs. sampling : Which is best for handling unbalanced classes with unequal error costs ?, vol.7, pp.35-41, 2007.

Z. C. , W. G. Zhou, and Y. &. , A new approach for imbalanced data classification based on minimize loss learning, 2017 IEEE Second International Conference on, pp.82-87, 2017.

A. D. Références, . H. Hiebert-e, and . D. Pearson-p, The effects of syntactic and lexical complexity on the comprehension of elementary science texts, International Electronic Journal of Elementary Education, vol.4, issue.1, p.107, 2011.

B. P. Grave and E. Joulin-a.-&-mikolov-t, Enriching word vectors with subword information, 2016.

B. J. Tsang-v and J. D. Shein-f.-&-hirst-g, Building readability lexicons with unannotated corpora, Proceedings of the First Workshop on Predicting and Improving Text Readability for target reader populations, pp.33-39, 2012.

C. De-l'europe, Cadre européen commun de référence pour les langues, 2001.

D. , Towards automatic lexical simplification in spanish : an empirical study, Proceedings of the First Workshop on Predicting and Improving Text Readability for target reader populations, pp.8-16, 2012.

F. T. Billami, M. , and G. N. Bernhard-d, Bleu, contusion, ecchymose : tri automatique de synonymes en fonction de leur difficulté de lecture et compréhension, JEP-TALNRECITAL 2016, vol.2, pp.15-28, 2016.

F. T. and G. N. Watrin-p.-&-fairon-c, FLELex : a graded lexical resource for french foreign learners, LREC, pp.3766-3773, 2014.

G. N. François, T. , and B. D. Fairon-c, A model to predict lexical complexity and to grade words (un modèle pour prédire la complexité lexicale et graduer les mots), Proceedings of TALN 2014, vol.1, pp.91-102, 2014.

J. S. Specia-l, Uow-shef : Simplex-lexical simplicity ranking based on contextual and psycholinguistic features, Proceedings of the First Joint Conference on Lexical and Computational Semantics, vol.1, pp.477-481, 2012.

K. P. and L. , Statistical estimation of word acquisition with application to readability prediction, Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, vol.2, pp.900-909, 2009.

L. B. Sprenger-charolles-l.-&-colé-p, MANULEX : A grade-level lexical database from french elementary school readers, Behavior Research Methods, Instruments, & Computers, vol.36, issue.1, pp.156-166, 2004.

N. B. Pallier, C. , and F. L. Matos-r, Une base de données lexicales du français contemporain sur internet : LEXIQUE TM //a lexical database for contemporary french : LEXIQUE TM. L'année psychologique, vol.101, pp.447-462, 2001.

R. L. Baeza-yates-r, . Dempere-marco-l, and . Saggion-h, Frequent words improve readability and short words improve understandability for people with dyslexia, IFIP Conference on Human-Computer Interaction, pp.203-219, 2013.

S. M. , A Comparison of Techniques to Automatically Identify Complex Words, 51st Annual Meeting of the Association for Computational Linguistics Proceedings of the Student Research Workshop, pp.103-109, 2013.

. Guellil, Cet intérêt récent est lié au développement croissant de l'utilisation d'Internet (Saâdane, Semmar, 2012) Trois approches sont couramment évoquées dans la littérature pour réaliser la translittération: 1) A base de règles. 2) Statistiques et 3) Hybrides combinant les deux précédentes, Le problème de la translittération a suscité l'intérêt des spécialistes dans plusieurs langues, 2017.

. Kaur and . Singh-;-habash, ont proposé un ensemble de règles permettant le passage de l'arabizi vers l'arabe. Ils ont signalé un nombre d'exceptions et de défis reliés aux traitements des voyelles. Rosca&Breuel (2016) ont abordé l'approche statistique où ils ont présenté un modèle basé sur les réseaux de neurones pour effectuer la translitération entre plusieurs paires de langues dont l'arabe et l'anglais, 2007.

. Guellil, Darwish, 2013.

. Saâdane, Tous ces travaux suivent la même idée générale, à savoir générer un ensemble de possibilités de translittération, appelés candidats, pour ensuite déterminer le meilleur candidat à l'aide d'un modèle de translitération ou autre. Pour ce faire, Darwish (2013) construit manuellement un ensemble contenant 3452 mots arabizi (extrait de Twitter) translitéré vers l'arabe. Une partie de ce corpus arabizi-arabe a été utilisée dans le travail de, vol.les travaux de, 2013.

L. Enfin and . Travaux-de-(guellil, , 2013.

. Habash, Translittération arabe présentée dans schemeHabash-Soudi-Buckwalter (HSB), 2007.

, Code switching: Présence de plusieurs langues ou dialectes au sein du même message c ATALA, p.510, 2018.

, Analyse des résultats et des cas d'erreurs

, est à 45.35% et dans notre cas elle atteint 74,76% dans le cas de la recherche simple et où le corpus arabe utilisé est complet (c'est-à-dire 100%). Nous obtenons une précision égale à 75,11% dans le cas de notre corpus Test_300, qui représente le meilleur résultat obtenu, ce qui est compréhensible vu que notre approche est basée sur la translitération des messages extraient des médias sociaux et que la translitération est faite de l'arabizi vers l'arabe et non pas l, D'après le tableau 2, nous constatons que la taille du corpus arabe influe sur les résultats obtenus. Plus ce corpus est volumineux, meilleurs sont les résultats. Concernant les corpus de test, nous avons utilisé trois corpus contenant respectivement, vol.50, 2017.

, ???? 3) Dans certains cas deux translitérations sont correctes, tout dépend du contexte et du sens de la phrase. Par exemple, le mot 'raht' pourrait être translitéré en '?'???? ou en '?,'????? ou encore le mot 'djabat' qui pourrait être translitéré en '?'???? ou en '?'????? et ce tout dépend du sens de la phrase. 4) Des erreurs reliées aux mots puisant leurs signification du français et donc non reconnu par notre corpus dans la plupart des cas. Par exemple, le mot 'lafichage' est translitéré en '?'???????? au lieu de '?'??????? et le mot 'elsemastar' est translitéré en '?'?????????? au lieu de '?.'????????Toutes ces erreurs sont causé par deux principales raisons : 1) La non prise, Néanmoins en analysant le corpus translitéré, nous avons identifié les erreurs suivantes : 1) Omission de certaines voyelles où elles devraient apparaître. Par exemple : le mot 'bik' est translitéré en '?'??? au lieu de '?

. Références,

M. Al-badrashiny, R. Eskander, N. Habash, and O. Rambow, Automatic Transliteration of Romanized Dialectal Arabic, 2014.

R. Cotterell, A. Renduchintala, N. Saphra, and C. Burch, An algerianarabic-french code-switched corpus, Paper presented at the Workshop on Free/Open-Source Arabic Corpora and Corpora Processing Tools Workshop Programme, 2014.

K. Darwish, Arabizi detection and conversion to Arabic, 2013.

I. Guellil and F. Azouaou, ASDA: Analyseur Syntaxique du Dialecte Algérien dans un but d'analyse sémantique, 2017.

I. Guellil, F. Azouaou, and M. Abbas, Comparison between Neural and Sta-tistical translation after translitera-tion of Algerian Arabic Dialect, WiNLP: Women & Underrepresented Minorities in Natural Language Processing, 2017.

I. Guellil, F. Azouaou, M. Abbas, and S. Fatiha, Arabizi transliteration of Algerian Arabic dialect into Modern Standard Arabic. Paper presented at the Social MT 2017/First workshop on Social Media and User Generated Content Machine Translation, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01570289

I. Guellil and . Azouaou-f, Neural Vs Statistical Translation of Algerian Arabic Dialect written with Arabizi and Arabic letter, The 31st Pacific Asia Conference on Language, Information and Computation PACLIC, 2017.

A. F. Guellil, Bilingual Lexicon for Algerian Arabic Dialect Treatment in Social Media, WiNLP: Women & Underrepresented Minorities in Natural Language Processing, 2017.

N. Habash, A. Soudi, and T. &buckwalter, On arabic transliteration Arabic computational morphology, pp.15-22, 2007.

G. S. Josan and G. S. , A Punjabi to Hindi Machine Transliteration System, Computational Linguistics and Chinese Language Processing, vol.15, pp.77-102, 2010.

K. Kaur and P. Singh, Review of Machine Transliteration Techniques, International Journal of Computer Applications, issue.20, p.107, 2014.

J. May, Y. Benjira, and A. &echihabi, An Arabizi-English social media statistical machine translation system, 2014.

K. Meftouh, S. Harrat, S. Jamoussi, M. Abbas, and K. &smaili, Machine translation experiments on PADIC: A parallel Arabic dialect corpus, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01261587

, Pacific Asia conference on language, information and computation

H. Saâdane and N. &habash, A conventional orthography for Algerian Arabic, Proceedings of the Second Workshop on Arabic Natural Language Processing, pp.69-79, 2015.

H. Saâdane and N. Semmar, Utilisation de la transliteration arabe pour l'amélioration de l'alignement de mots à partir de corpus parallèles français-arabe (UsingArabic Transliteration to Improve Word Alignment from French-ArabicParallelCorpora), Paper presented at the Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, vol.2, 2012.

H. Saâdane, ,. D. Nouvel, H. Seffih, and C. &fluhr, Une approche linguistique pour la détection des dialectes arabes, 2017.

M. Rosca, Sequence-to-sequence neural network models for transliteration, 2016.

H. Saâdane, M. Guidere, and C. &fluhr, La reconnaissance automatique des dialectes arabes à l'écrit. Paper presented at the Colloque International Traduction et Champs Connexes, Quelle Place Pour La Langue Arabe Aujourd'hui, 2013.

M. Van-der-wees, A. Bisazza, and C. &monz, A Simple but Effective Approach to Improve Arabizi-to-English Statistical Machine Translation, pp.43-2018, 2016.

, 38) remarque que tout nom de chose matérielle peut remplir dans un énoncé le rôle sémantique de lieu. Blidon (2008:4) rappelle que « d'une certaine façon, tout objet est géographique si son traitement l'est » ; un objet qui est géolocalisé, comparé sur des critères de localisation, d'implantation, de classification, à des objets géographiques peut dès lors être considéré comme un lieu. Ce lieu (zone de commerce, axe piétonnier, voie cycliste, camp de concentration), cet objet localisé (banc, poubelle, chêne centenaire) sont désignés par une dénomination descriptive, mais qui n'est ni stable ni unique, des lieux et objets localisés utiles à l'activité humaine. D'un point de vue linguistique, Van de Velde, 2000.

, Ces contextes et objectifs étant spécifiques à chaque situation de production et d'analyse, les éléments qu'il est pertinent d'analyser comme des lieux ou des objets localisés varient selon les situations. Dans le contexte de l'analyse des récits de vie de Républicains espagnols, les différents camps constituent des lieux importants de l'analyse, à la fois par leur fréquence d'évocation dans les récits et les événements qui s'y rapportent. Par exemple, le camp situé à Argelès-sur-Mer admet plusieurs désignations qui varient selon les locuteurs et les moments du récit et rendent compte de ses différents usages : camp de concentration, camp d'internement, camp de regroupement, le camp célèbre d'internement des Républicains espagnols, Cette définition élargie de lieu est aussi dictée par les contextes de production des textes et les objectifs de leur analyse, 2006.

, L'identification d'un nom de lieu et des informations afférentes à ce lieu dans un texte a souvent pour objectif de représenter ces informations localisées c'est-à-dire de définir les objets cartographiques correspondant aux lieux et à leurs propriétés dans le texte, chaque objet cartographique étant caractérisé par son implantation, sa symbolisation et sa position

. Olmedo and . Santos, de films (Caquard, Fiset, 2013), d'analyses géopolitiques, 2012.

, Identification des lieux NPr et Nc Les lieux NPr ont été identifiés à l'aide de dictionnaires construits ad hoc et de l'outil ANNIE de GATE qui reconnaît et annote dans des textes les occurrences des entrées des dictionnaires. Les dictionnaires ont été construits à partir de BDNyme 7 , la base de données toponymique de l'IGN et la ressource collaborative GeoNames 8 qui propose à la fois endonyme(s) et exonymes, utiles pour CoRR où les lieux peuvent être désignés en français

. Finkel, Les deux corpus CoRR et CoMP ont été séparés en corpus d'apprentissage et corpus de validation et le Stanford NER a été entraîné sur le corpus d'apprentissage avant d'être intégré à une chaîne de traitements GATE construite ad hoc. Un lexique de mots génériques, Concernant les lieux Nc, la méthode d'identification mise en place repose sur l'apprentissage automatique à partir d'extraits de corpus annotés manuellement. L'outil d'apprentissage automatique choisi est le Stanford Named Entity Recognizer (NER) 9, 2005.

A. J. Références, « De quoi Europe est-il le nom ? Enjeux et usages argumentatifs de la polyréférentialité, vol.17, 2016.

B. J. , Sémiologie Graphique. Les diagrammes, les réseaux, les cartes, 1967.

. Blank-d and . Henrich-a, A Depth-First Branch-and-Bound Algorithm for Geocoding Historic Itinerary Tables, Proceedings of the 10th Workshop on Geographic Information Retrieval, 2016.

B. M. , Jalons pour une géographie des homosexualités », L'Espace géographique, vol.2, pp.175-189, 2008.

B. C. Domingues and C. Capeyron-m, Evaluation of NER systems for the recognition of place mentions in French thematic corpora, Proceedings of the 10th Workshop on Geographic Information Retrieval, 2016.

B. L. Laroche-s, Représenter l'ambiance sonore de la ville. Retours sur un atelier pratique, Nicolas Rémy (dir, 2016.

, Ambiances, tomorrow, Proceedings of 3rd International Congress on Ambiances, vol.1, pp.271-276

. Brunet-r, . Ferras-r, and H. Thery, première édition : 1992). Les mots de la géographie, dictionnaire critique. Collection Dynamiques du territoire, Reclus, La documentation française, 1993.

C. G. Tian and Y. , Towards Geo-referencing Infrastructure for Local News, Proceedings of the 10th Workshop on Geographic Information Retrieval, 2016.

C. , Nom propre et dénomination évènementielle : quelles différences en langue et en discours ? Corela, 2009.

C. S. Cartwright-w, Narrative Cartography: From Mapping Stories to the Narrative of Maps and Mapping, The Cartographic Journal, vol.51, issue.2, pp.101-106, 2014.

C. S. Fiset-j-p, How can we map stories? A cybercartographic application for narrative cartography, Journal of Maps, vol.10, issue.1, pp.18-25, 2014.

C. G. , Le nom de pays comme outil de représentation sociale, Mots. Les langages du politique, vol.86, 2008.

D. , J. Marcellesi, J. Mevel-j-p, G. , and M. , , 1994.

, Dictionnaire de linguistique et des sciences du langage, Larousse. c ATALA, 2018.

E. M. , Les entités nommées, de la linguistique au TAL : statut théorique et méthodes de désambiguïsation, 2008.

F. , J. R. Grenager, T. Manning, and C. , Incorporating non-local information into information extraction systems by gibbs sampling, Proceedings of the 43rd annual meeting on association for computational linguistics, pp.363-370, 2005.

G. M. Sallaberry, C. Nguyen, and V. T. , Typage de noms toponymiques à des fins d'indexation géographique, TAL, vol.53, issue.2, pp.143-176, 2012.

H. I. , Cartographies autochtones. Eléments pour une analyse critique. L'espace géographique, pp.171-186, 2009.

J. K. , Le nom propre. Constructions et interprétations, 1994.

K. G. , Noms propres et noms communs : un problème de dénomination, Meta, vol.414, pp.567-589, 1996.

L. S. Piveteau-v, Méthodologie de diagnostic pour le projet de territoire : une approche par les modèles spatiaux », Géocarrefour, vol.80, 2005.

L. J. Lieberman, Detecting geographical references in the form of place names and associated spatial natural language, Actes de SIGSPATIAL Special, vol.3, issue.2, pp.5-11, 2011.

L. S. , Le nom propre en français. L'essentiel Français, 2004.

M. S. , Les Récits Migratoires Sont-Ils Encore Possibles Dans le Domaine Des Refugee Studies? Analyse Critique et Expérimentation de Cartographies Créatives, ACME: An International Journal for Critical Geographies, p.7, 2016.

M. S. and A. , Cartographies traverses, des espaces où l'on ne finit jamais d'arriver, 2016.

M. J. , Langages, p.66, 1982.

M. S. Abadie, N. Aussenac-gilles-n, . Bessagnet-m.-n, M. Kamel, E. Kergosien et al., GéOnto: Enrichissement d'une taxonomie de concepts topographiques, Proceedings of Spatial Analysis and GEOmatics Sageo, 2009.

N. D. Ehrmann, M. , and R. S. , Les entités nommées pour le traitement automatique des langues, 2015.

O. E. , Femmes de Marrakech. Pour une cartographie émotionnelle des récits des femmes de Sidi Youssef Ben Ali, Marrakech, Maroc. », in FOURNIER M, 2016.

P. , Le discours électronique médié : bilan et perspectives », in A. PIOLAT (Éd.). Lire, écrire, communiquer et apprendre avec Internet, pp.345-366, 2006.

P. D. Fattori and F. Holzinger-f, Journées d'études "Comment cartographier les récits documentaires et fictionnels ?, pp.16-17, 2012.

P. , Le toponyme, désignateur souple et organisateur mémoriel. L'exemple du nom de bataille. Mots. Les langages du politique, vol.86, 2008.

P. R. Derungs and C. , From Space to Place: Place-Based Explorations of Text, International Journal of Humanities and Arts Computing, vol.9, pp.74-94, 2015.

R. J. , Ce qui fait lieu. Vers une éthique chorographique. Thèse en aménagement de l'espace et urbanisme. École doctorale Ville, 2017.

R. F. , La sémantique des noms propres : remarques sur la notion de « désignateur rigide», Langue française, vol.57, pp.106-118, 1983.

R. M. Troin-f, Cartographie du Marseille d'un héros de roman policier (Total Khéops de J, Izzo). M@ppemonde, vol.121, 2017.

D. E. Runz and C. , Imperfection, temps et espace : modélisation, analyse et visualisation dans un SIG archéologique, 2008.

S. R. Murrieta-flores-p and . Martins-b, An Automated Approach for Geocoding Tabular Itineraries, Proceedings of the 11th Workshop on Geographic Information Retrieval, 2017.

D. E. Souza, D. A. Silva-e, and . Ahlers-d, Factorization Models for Spatiotemporal Retrieval, Proceedings of the 11th Workshop on Geographic Information Retrieval, 2017.

. Van-de-velde-d, Existe-t-il des noms propres de temps ? Lexique 15/Les noms propres : nature et détermination, Septentrion, Presses universitaires, 2000.

Y. L. Liu-x, L. M. , and P. P. Lu-f, A Holistic Framework of Geographical Semantic Web Aligning, Proceedings of the 10th Workshop on Geographic Information Retrieval, 2016.

B. M. Références, S. Bernardini, and . Ferraresi-a.-&-zanchetta-e, The wacky wide web : a collection of very large linguistically processed web-crawled corpora. Language Resources and Evaluation, 2009.

B. P. Grave and E. Joulin-a.-&-mikolov-t, Enriching word vectors with subword information, 2016.

D. J. , J. M. Hokamp, and C. &. Mendes-p, Improving efficiency and accuracy in multilingual entity extraction, 9 th International Conference on Semantic Systems (I-SEMANTICS), 2013.

. Dbpedia, , 2007.

F. V. Mencacci and M. Mengoni-p.-&-milani-a, Heuristics for semantic path search in wikipedia, 14 th International Conference on Computational Science and Its Applications (ICCSA), 2014.

. Freebase, , 2007.

, GEONAMES, 2006.

G. G. Adda, G. Paulsson, N. Carr, and M. Giraudel-a.-&-galibert-o, The ETAPE corpus for the evaluation of speech-based TV content processing in the French language, 8 th International Conference on Language Resources and Evaluation (LREC), 2012.
URL : https://hal.archives-ouvertes.fr/hal-00712591

H. B. Nothman and J. &. Radford-w, Cheap and easy entity evaluation, 52 nd Annual Meeting of the Association for Computational Linguistics (ACL), 2014.

H. He, X. Gao, J. Deng-l, . Acero-a, and . Heck-l, Learning deep structured semantic models for web search using clickthrough data, 22 nd ACM International Conference on Information & Knowledge Management (CIKM), 2013.

L. M. , Jeux de mots, 2007.

L. M. , Making people play for Lexical Acquisition with the JeuxDeMots prototype, 7th International Symposium on Natural Language Processing (SNLP'07), 2007.

L. , The link-prediction problem for social networks, Journal of the American Society for Information Science and Technology, 2007.

. Linkedmdb, , 2009.

L. and J. , Similarity index based on local paths for link prediction of complex networks, Phys. Rev. E, 2009.

L. , Construction of a french lexical network : Methodological issues, 2011.

M. T. Sutskever-i, C. K. , and C. , Distributed representations of words and phrases and their compositionality, 2013.

M. A. Raganato-a.-&-navigli-r, Entity linking meets word sense disambiguation : a unified approach, 2014.

. Musicbrainz, , 2000.

N. R. Ponzetto-s, Babelnet : The automatic construction, evaluation and application of a wide-coverage multilingual semantic network, Artificial Intelligence. c ATALA, 2012.

N. A. Gentile-a, . Presutti-v, A. Gangemi, and . Garigliotti-d.-&-navigli-r, The 1st Open Knowledge Extraction Challenge, 12 th European Semantic Web Conference (ESWC), 2015.

P. J. , Knowledge Extraction in Web Media : At The Frontier of NLP, Machine Learning and Semantics, 25 th World Wide Web Conference, 2016.

S. , Building a free french wordnet from multilingual resources, Ontolex, 2008.

S. M. Heo-g.-&-ding-y, Sempathfinder : Semantic path analysis for discovering publicly unknown knowledge, Journal of Informetrics, 2015.

S. R. Sagot-b.-&-b-´-echet-f, A joint named entity recognition and entity linking system, Proceedings of the Workshop on Innovative Hybrid Approaches to the Processing of Textual Data, 2012.

. Wikidata, , p.537, 2012.

. Barbieri, 2017) ont obtenu 65% de moyenne harmonique en prédisant les 5 emojis les plus utilisés dans 40 millions de tweets à l'aide de LSTM (Hochreiter & Schmidhuber, 1997). Enfin, nous avons précédemment abordé l'approche par classification multi-étiquette sur des corpus de messages instantanés privés, 2017.

. Eisner, Ces derniers ont été étudiés récemment avec plusieurs approches quant à l'objet du plongement lexical, soit en considérant uniquement les descriptions Unicode des emojis comme groupe de métadonnées, Il existe encore peu de ressources pour les emojis, la plupart étant en fait des modèles de plongements lexicaux appris, 2016.

. Dans, Ce travail se rapproche donc de celui de Pohl (Pohl et al., 2017) à la différence que nous proposons une méthodologie pour obtenir des groupes au sein d'un type spécifique d

, Représentation vectorielle des emojis et plongements lexicaux

, Nous cherchons donc à vérifier si l'usage des emojis faciaux suit implicitement une catégorisation des expressions du visage existante. Pour ce faire, nous observons l'usage de 63 emojis faciaux se rapprochant du visage humain, excluant ainsi les chats , démons , aliens ou autres. Ces 63 emojis ont été récupérés à partir de trois classes d'emojis présentes dans la classification Unicode : face neutral, face positive et face negative, Si l'on en croit les métriques d'usage des emojis sur Twitter 4 , les emojis les plus utilisés sont ceux représentant des émotions ou des sentiments, tels que

, Pour obtenir une répartition plus fine de ces emojis, nous mettons en place des plongements lexicaux d'emojis (dits emoji embeddings) sur un corpus de tweets

. Corpus-de-tweets, Notre corpus se compose de 695 031 tweets provenant du continent nord américain sur tous sujets, collectés à l'aide l'API de flux Twitter 5. Pour nous assurer d'un corpus mono-lingue, tous ces tweets ont été préalablement filtrés par un détecteur de langue basé sur la liste des mots vides de NLTK 6 et leur ratio d

, Dans notre corpus nous considérons les emojis comme des mots comme les autres, bien qu'ils ne soient pas concernés par la lemmatisation effectuée avec WordNet (Miller, 1995)

. Mikolov, Représentations vectorielles. Pour représenter les emojis nous avons utilisé Word2Vec, 2010.

A. W. Références, L. X. Liu-x, W. N. Huang-g, and . Mei-q, Untangling emoji popularity through semantic embeddings, ICWSM, pp.2-11, 2017.

B. F. Ballesteros and M. &. Saggion-h, Are emojis predictable ?, Proceedings of the 15th Conference of the European Chapter, vol.2, pp.105-111, 2017.

B. F. Ronzano-f and . Saggion-h, What does this emoji mean ? a vector space skip-gram model for twitter emojis, Language Resources and Evaluation conference, 2016.

E. B. Rocktäschel, T. Augenstein-i, and . Bo?njak-m.-&-riedel-s, emoji2vec : Learning emoji representations from their description, 2016.

E. P. , Basic emotions in t, pp.45-60, 1999.

F. B. Mislove, A. Søgaard, A. &. Rahwan-i, and . Lehmann-s, Using millions of emoji occurrences to learn any-domain representations for detecting sentiment, emotion and sarcasm, 2017.

G. G. Ochs-m.-&-bellot-p, Prédiction automatique d'emojis sentimentaux, COnférence en Recherche d'Information et Applications (CORIA), 2017.

H. S. Schmidhuber and J. , Long short-term memory, Neural Comput, vol.9, issue.8, pp.1735-1780, 1997.

J. R. , Closing statements : Linguistics and poetics, style in langage, 1960.

K. C. , Do you know what i mean > :( : A linguistic study of the understanding of emoticons and emojis in text messages, 2015.

K. R. Watts-l, Characterising the inventive appropriation of emoji as relationally meaningful in mediated close personal relationships, Experiences of Technology Appropriation : Unanticipated Users, Usage, Circumstances, and Design, 2015.

L. J. and L. , A hierarchical neural autoencoder for paragraphs and documents, 2015.

M. L. Hinton-g, Visualizing data using t-sne, Journal of Machine Learning Research, vol.9, pp.2579-2605, 2008.

M. J. , Some methods for classification and analysis of multivariate observations, Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, vol.1, pp.281-297, 1967.

M. T. , C. K. , and C. , Efficient estimation of word representations in vector space, 2013.

M. T. Sutskever-i, C. K. Corrado, and G. S. Dean-j, Distributed representations of words and phrases and their compositionality, Advances in neural information processing systems, pp.3111-3119, 2013.

G. A. Miller, Wordnet : a lexical database for english, Communications of the ACM, vol.38, issue.11, 1995.

J. Y. Ng-a and . Weiss-y, On spectral clustering : Analysis and an algorithm, Advances in neural information processing systems, pp.849-856, 2002.

P. , Emoticons vs. emojis on twitter : A causal inference approach, 2015.

P. H. Domin-c.-&-rohs-m, Beyond just text : Semantic emoji similarity modeling to support expressive communication, ACM Transactions on Computer-Human Interaction (TOCHI), vol.24, issue.1, p.6, 2017.

R. R. Sojka-p, Software framework for topic modelling with large corpora, Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks, 2010.

R. , V-measure : A conditional entropy-based external cluster evaluation measure, Proceedings of the 2007 joint conference on empirical methods in natural language processing and computational natural language learning, 2007.

W. S. Balasuriya, L. P. Sheth-a, and . Doran-d, Emojinet : An open service and api for emoji sense discovery, ICWSM, pp.437-447, 2017.

X. R. Liu-z and Y. R. Sun-m, Neural emoji recommendation in dialogue systems, 2016.

B. A. Références, . Turchi-m.-;-balahur-a, M. Turchi, . Steinberger-r, J. M. Perea-ortega et al., Resource creation and evaluation for multilingual sentiment analysis in social media texts, Proceedings of the 3rd Workshop in Computational Approaches to Subjectivity and Sentiment Analysis, pp.52-60, 2012.

B. C. Mihalcea-r and . Wiebe-j, A bootstrapping method for building subjectivity lexicons for languages with scarce resources, Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC-08), pp.8-1086, 2008.

B. C. Perez-j.-&-roux-c, XRCE at semeval-2016 task 5 : Feedbacked ensemble modeling on syntactico-semantic knowledge for aspect based sentiment analysis, Proceedings of the 10th International Workshop on Semantic Evaluation, SemEval@NAACL-HLT 2016, pp.277-281, 2016.

D. C. and C. V. Smith-n, A simple, fast, and effective reparameterization of ibm model 2, Proceedings of NAACL, 2013.

G. G. and E. N. Marian-a, Beyond the stars : Improving rating predictions using review text content, Proceedings of the 12th International Workshop on the Web and Databases, 2009.

G. U. , Yawat : Yet another word alignment tool, Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies : Demo Session, HLT-Demonstrations '08, pp.20-23, 2008.

H. H. Bellot-p.-&-bechet-f, Lsislif : Crf and logistic regression for opinion target extraction and sentiment polarity analysis, Proceedings of the 9th International Workshop on Semantic Evaluation, pp.753-758, 2015.

I. P. and C. C. Foster-g, A challenge set approach to evaluating machine translation, 2017.

J. Yepes, A. Neveol, A. Neves, M. Verspoor, K. Bojar et al., Findings of the wmt 2017 biomedical translation shared task, Proceedings of the Second Conference on Machine Translation, pp.234-247, 2017.

K. S. Zhu, X. , and C. C. Mohammad-s, Nrc-canada-2014 : Detecting aspects and sentiment in customer reviews, Proceedings of the 8th International Workshop on Semantic Evaluation, pp.437-442, 2014.

K. A. Kohail, S. Kumar-a, . Ekbal-a, and . Biemann-c, Iit-tuda at semeval-2016 task 5 : Beyond sentiment lexicon : Combining domain dependency and distributional semantics c ATALA, 2016.

, features for aspect based sentiment analysis, Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016), pp.1129-1135

L. J. Mccallum-a and . C. Pereira-f, Conditional random fields : Probabilistic models for segmenting and labeling sequence data, Proceedings of the Eighteenth International Conference on Machine Learning, ICML '01, pp.282-289, 2001.

L. B. , Synthesis Lectures on Human Language Technologies, 2012.

M. R. Banea and C. &. Wiebe-j, Learning multilingual subjective language via crosslingual projections, Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pp.976-983, 2007.

O. F. Ney-h, A systematic comparison of various statistical alignment models, Computational Linguistics, vol.29, issue.1, pp.19-51, 2003.

P. M. Galanis, D. Papageorgiou, H. Androutsopoulos-i, S. Manandhar, . Al-smadi-m et al.,

E. Eryi?git and G. , SemEval-2016 task 5 : Aspect based sentiment analysis, Proceedings of the 10th International Workshop on Semantic Evaluation, SemEval '16, 2016.

P. M. Galanis, D. Papageorgiou, and H. Manandhar-s.-&-androutsopoulos-i, Semeval-2015 task 12 : Aspect based sentiment analysis, Proceedings of the 9th International Workshop on Semantic Evaluation, pp.486-495, 2015.

P. M. Galanis, D. Pavlopoulos, J. Papageorgiou, and H. ,

M. S. , Semeval-2014 task 4 : Aspect based sentiment analysis, International Workshop on Semantic Evaluation (SemEval), 2014.

R. S. , G. P. Breslin, and J. G. , INSIGHT-1 at SemEval-2016 Task 5 : Deep Learning for Multilingual Aspect-based Sentiment Analysis, 2016.

T. , Dlirec : Aspect term extraction and term polarity classification system, Proceedings of the 8th International Workshop on Semantic Evaluation, pp.235-240, 2014.

W. J. Arora, P. Cortes-s, U. Barman, D. Bogdanova, and F. J. Tounsi-l, Dcu : Aspect-based polarity classification for semeval task 4, Proceedings of the 8th International Workshop on Semantic Evaluation, pp.392-397, 2014.

W. X. , Co-training for cross-lingual sentiment classification, Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, vol.1, pp.235-243, 2009.

Z. G. Zeng-z, J. X. Huang, and . He-t, Transfer learning for cross-lingual sentiment classification with weakly shared deep neural networks, Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '16, pp.245-254, 2016.

G. Angeli, M. J. Premkumar, and C. D. Manning, Leveraging linguistic structure for open domain information extraction, ACL, p.p

M. Banko, J. Cafarella, S. Soderland, M. Broadhead, and O. Etzioni, Open information extraction from the web, IJCAI

N. Béchet, P. Cellier, T. Charnois, and B. Crémilleux, Discovering linguistic patterns using sequence mining, CICLing

R. C. Bunescu and R. J. Mooney, A shortest path dependency kernel for relation extraction

D. Corro, L. Gemulla, and R. , Clausie : Clause-based open information extraction, International Conference on World Wide Web

A. Fader, S. Soderland, and O. Etzioni, Identifying relations for open information extraction

O. Ferret, Language Production, Cognition, and the Lexicon, chapter Typing Relations in Distributional Thesauri

K. Fundel, R. Küffner, and R. Zimmer, Relex-relation extraction using dependency parse trees, Bioinformatics

K. Gábor, H. Zargayouna, D. Buscaldi, I. Tellier, and T. Charnois, Semantic annotation of the acl anthology corpus for the automatic analysis of scientific literature, LREC

K. Gábor, H. Zargayouna, and D. Buscaldi,

, Unsupervised relation extraction in specialized corpora using sequence mining, Advances in Intelligent Data Analysis XV (IDA )

K. Gábor, H. Zargayouna, I. Tellier, D. Buscaldi, and T. Charnois, A typology of semantic relations dedicated to scientific literature analysis, SAVE-SD Workshop at the th World Wide Web Conference

K. Gábor, H. Zargayouna, I. Tellier, D. Buscaldi, and T. Charnois, Exploring vector spaces for semantic relations, EMNLP, p.p

M. Hearst, Automatic acquisition of hyponyms from large text corpora, COLING
DOI : 10.3115/992133.992154
URL : http://dl.acm.org/ft_gateway.cfm?id=992154&type=pdf

J. R. Hobbs and E. Riloff, Information extraction, Handbook of Natural Language Processing

O. Levy, Y. Goldberg, and I. Dagan, Improving distributional similarity with lessons learned from word embeddings, Transactions of the ACL

M. D. Marneffe, B. Maccartney, and C. D. Manning, Generating typed dependency parses from phrase structure parses, LREC

M. D. Marneffe and C. D. Manning, Stanford typed dependencies manual. The Stanford NLP Group. revised for the Stanford Parser v

, (b). The stanford typed dependencies representation, COLING Workshop on Cross-framework and Cross-domain Parser

T. Mikolov, W. Yih, and G. Zweig, Linguistic regularities in continuous space word representations, NAACL

R. J. Mooney and R. Bunescu, Mining knowledge from text using information extraction, SIGKDD Explor. Newsl

, Exploitation de résultats d'analyse syntaxique pour extraction semi-supervisée des chemins de relations, e Conférence sur le Traitement Automatique des Langues Naturelles-TALN

Y. Nakamura-delloye and R. Stern, Extraction de relations et de patrons de relations entre entités nommées en vue de l'enrichissement d'une ontologie, TOTh : Terminologie & Ontologie : Théories et Applications, p. ?

J. Nivre, An efficient algorithm for projective dependency parsing

M. Porumb, I. Barbantan, C. Lemnaru, and R. Potolea, Remed : Automatic relation extraction from medical documents, Proceedings of the th International Conference on Information Integration and Web-based Applications & Services

D. Radev, P. Muthukrishnan, and V. Qazvinian, The ACL Anthology Network Corpus, ACL Workshop on Text and Citation Analysis for Scholarly Digital Libraries

E. Santus, A. Lenci, and Q. Lu, Chasing hypernyms in vector spaces with entropy

R. Srikant and R. Agrawal, Mining sequential patterns : Generalizations and performance improvements, EDBT, p.p

P. D. Turney, Measuring semantic similarity by latent relational analysis, IJCAI

P. D. Turney, Similarity of semantic relations. CoRR, abs/cs

P. D. Turney, Domain and function : A dual-space model of semantic relations and compositions, Journal of Artificial Intelligence Research

, Experiments with three approaches to recognizing lexical entailment, Natural Language Engineering

D. Valsamou, Extraction d'Information pour les réseaux de régulation de la graine chez Arabidopsis Thaliana, Ecole doctorale Sciences et technologies de l'information

J. Weeds, D. Clarke, J. Reffin, D. Weir, and B. Keller, Learning to distinguish hypernyms and co-hyponyms

R. Yangarber, W. Lin, and R. Grishman, Unsupervised learning of generalized names

Y. Zhao and G. Karypis, Evaluation of hierarchical clustering algorithms for document datasets, CIKM

Y. Zhao, G. Karypis, and U. Fayyad, Hierarchical clustering algorithms for document datasets. Data Mining for Knowledge Discovery

B. E. Lacour, M. Labeau, M. Allauzen, A. &. Wisniewski-g, and . Yvon-f, , 2017.

F. Orléan, . Ben-david-s, J. Blitzer, K. Crammer, A. Kulesza et al., Adaptation au domaine pour l'analyse morpho-syntaxique, TALN 2017-24e conférence sur le Traitement Automatique des Langues Naturelles, 2010.

, A theory of learning from different domains, Machine Learning, vol.79, pp.151-175

B. E. Jelinek-f, J. Lafferty, D. M. Magerman, and . Mercer-r.-&-roukos-s, , 1992.

, Towards history-based grammars : Using richer models for probabilistic parsing, Proceedings of the Workshop on Speech and Natural Language, HLT'91, pp.134-139

C. Bosco and M. S. Simi-m, Converting italian treebanks : Towards an Italian stanford dependency treebank, Proceedings of the 7th Linguistic Annotation Workshop and Interoperability with Discourse, pp.61-69, 2013.

B. , Détection et correction automatique d'erreurs d'annotation morpho-syntaxique du french treebank, TALN 2012-19e conférence sur le Traitement Automatique des Langues Naturelles, vol.6, pp.113-137, 2008.

C. M. Seddah-d, Le corpus Sequoia : annotation syntaxique et exploitation pour l'adaptation d'analyseur par pont lexical, TALN 2012-19e conférence sur le Traitement Automatique des Langues Naturelles, 2012.

D. D. , Detecting errors in part-of-speech annotation, Proceedings of the Tenth Conference on European Chapter of the Association for Computational Linguistics, vol.1, pp.107-114, 2003.

G. D. , Algorithms on Strings, Trees, and Sequences : Computer Science and Computational Biology, 1997.

L. Agi´agi´-c-?-z, Converting Russian dependency treebank to Stanford typed dependencies representation, Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, vol.2, pp.143-147, 2014.

, DAT/CLARIN digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics

P. F. Varoquaux, G. Gramfort, A. , M. V. Thirion-b, G. O. Blondel et al., Scikit-learn : Machine learning in Python, Journal of Machine Learning Research, vol.12, pp.2825-2830, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00650905

P. B. Johannsen-a and . Søgaard-a, Importance weighting and unsupervised domain adaptation of POS taggers : a negative result, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, pp.968-973, 2014.

S. D. Sagot-b, M. Candito, and . Mouilleron-v.-&-combet-v, The French Social Media Bank : a treebank of noisy user generated content, The COLING 2012 Organizing Committee. c ATALA, pp.2441-2458, 2012.

S. H. , Improving predictive inference under covariate shift by weighting the log-likelihood function, Journal of Statistical Planning and Inference, vol.90, issue.2, pp.227-244, 2000.

V. , A non-projective greedy dependency parser with bidirectional LSTMs, Proceedings of the CoNLL 2017 Shared Task : Multilingual Parsing from Raw Text to Universal Dependencies, pp.152-162, 2017.

W. G. Pécheux, N. , K. E. Allauzen-a, and . Yvon-f, Apprentissage partiellement supervisé d'un étiqueteur morpho-syntaxique par transfert cross-lingue, Proceedings of TALN 2014, vol.1, pp.173-183, 2014.

Z. , Transition-based Dependency Parsing with Rich Non-local Features, Proceedings of ACL 2011, the 49th Annual Meeting of the Association for Computational Linguistics : Human Language Technologies, pp.188-193, 2011.

A. N. Références and . Nasr-a.-&-perrotin-r, Manuel d'annotation en actes de dialogue pour le corpus Datcha, 2017.

A. J. , How to do things with words, 1975.

B. M. Bernardini and S. Ferraresi-a.-&-zanchetta-e, The wacky wide web : a collection of very large linguistically processed web-crawled corpora. Language resources and evaluation, vol.43, pp.209-226, 2009.

C. M. Allen-j, Coding dialogs with the damsl annotation scheme, AAAI fall symposium on communicative action in humans and machines, vol.56, 1997.

D. G. Guerraz-a.-&-charlet-d, Web chat conversations from contact centers : a descriptive study, LREC, 2016.

H. Hardy, . Baker-k, . Bonneau-maynard-h, L. Devillers, and R. S. Strzalkowski-t, Semantic and dialogic annotation for automated multilingual customer service, Eighth European Conference on Speech Communication and Technology, 2003.

I. E. , Dialogue act tagging for instant messaging chat sessions, Proceedings of the ACL Student Research Workshop, pp.79-84, 2005.

L. J. Mccallum-a and . C. Pereira-f, Conditional random fields : Probabilistic models for segmenting and labeling sequence data, 2001.

M. C. Rus-v.-&-graesser-a, Automated speech act classification for online chat, MAICS, vol.710, pp.23-29, 2011.

O. N. , Crfsuite : a fast implementation of conditional random fields, 2007.

S. S. Hernandez-n and . Morin-e, Comparaison d'approches de classification automatique des actes de dialogue dans un corpus de conversations écrites en ligne sur différentes modalités, 23ème Conférence sur le Traitement Automatique des Langues Naturelles, 2016.

S. E. Dhillon-r, S. Bhagat, and A. Ca, The ICSI meeting recorder dialog act (MRDA) corpus. Rapport interne, INTERNATIONAL COMPUTER SCIENCE, 2004.

Y. Z. , Y. D. Dyer, C. He, and X. Smola-a.-&-hovy-e, Hierarchical attention networks for document classification, Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics : Human Language Technologies, 2016.

C. Ali and . .. Bechikh,

N. .. Aliane, , p.316

P. .. Alizadeh,

A. .. Allauzen, , vol.451, p.494

M. .. Apidianaki, , p.494

J. .. Auguste, , vol.220, p.572

H. .. Ayats,

N. .. Béchet,

A. .. Barhoumi,

D. .. Battistelli, , p.141

R. .. Bawden,

L. .. Becerra-bonache, , p.169

F. .. Bechet, , vol.169, p.229

L. Belguith and . .. Hadrich, , p.432

P. .. Bellot,

F. .. Benali,

. Benoît and . .. Fournier,

G. .. Bernard,

R. .. Besançon, , vol.127, p.342

E. .. Bigeard,

H. .. Blanchon,

D. .. Blasi,

A. .. Bossard,

. Bouraoui and . .. Jean-léon, , p.397

B. E. Boussaha and . .. Amel, , p.112

A. .. Boyer,

C. .. Brun,

F. .. Burlot,

D. .. Buscaldi,

R. .. Carbou,

P. .. Cellier,

D. .. Charlet, , vol.220, p.397

. Charnois and . .. Thierry, , vol.460, p.552

J. .. Chevelu,

V. .. Claveau,

K. .. Cousot,

B. .. Cremilleux,

A. .. Cristia,

G. .. Damnati, , vol.220, p.397

B. .. Davis,

E. .. Delais-roussarie, , p.370

S. .. Delecraz, , p.169

C. .. Dominguès, , p.514

D. .. Duchier,

Y. .. Estève, , vol.210, p.414

. Faical and .. .. Azouaou,

E. .. Farce,

. Favre and . .. Benoit, , vol.169, p.229

O. .. Ferret, , vol.127, p.361

K. .. Gábor,

T. .. Gaillat,

N. .. Grabar, , vol.333, p.405

A. .. Granet,

C. .. Grouin, , vol.477, p.585

C. .. Guérineau,

I. .. Guellil,

G. .. Guibon,

A. .. Gupta,

H. Hachani and A. .. Eddine, , p.504

H. .. Haddad,

. Hamon and . .. Thierry,

N. .. Hernandez, , p.112

S. .. Huet,

K. Khelil and C. .. Ben,

D. .. Kodelja,

N. .. Kooli,

. Lafourcade and . .. Mathieu, , vol.442, p.525

F. .. Landomiel,

F. .. Landragin, , p.397

A. .. Laurent,

T. .. Lavergne,

N. .. Lechevrel,

G. .. Lecorvé,

B. .. Lecouteux, , p.155

M. .. Leenhardt, , p.485

A. .. Ligozat, , p.73

A. Linhares and . .. Carneiro, , p.307

J. .. Liu,

D. .. Lolive,

G. R. Loukatou, , p.45

. Magallon and .. .. Thibault,

P. .. Magistry,

M. .. Marcia,

J. .. Mariage, , p.316

D. .. Maurel,

S. .. Mdhaffar,

J. .. Mekki,

E. .. Morin, , vol.16, p.181

C. .. Moro,

H. .. Mouchère, , p.181

H. .. Mulki,

J. C. Núñez and . .. Rosales, , p.469

A. .. Nasr, , vol.169, p.572

W. .. Neifar,

D. .. Nouvel,

V. .. Nyzam, , vol.246, p.277

Y. .. Parmentier, , p.256

G. .. Patin,

P. Saldarriaga and . .. Sebastián, , p.16

R. .. Perrotin,

B. .. Pierrejean, , p.30

E. .. Pigneul,

J. .. Plu,

E. Pontes and . .. Linhares, , p.307

A. .. Poupon,

B. .. Raoul,

V. .. Ravishankar,

G. .. Rizzo,

C. .. Rodrigues, , p.98

S. .. Rosset, , vol.73, p.388

. Saadane and .. .. Houda,

M. Salah and . .. Hadj,

D. .. Schwab, , vol.155, p.325

A. .. Sini,

A. Soler and . .. Garí,

A. .. Sousa,

L. .. Sparrow,

S. .. Stoll,

I. .. Tellier, , vol.84, p.552

. Thiessard and . .. Frantz,

. Torres-moreno and . .. Juan-manuel, , p.307

R. .. Troncy,

F. M. Tyers,

. Vial and .. .. Loïc, , vol.155, p.325

C. .. Viard-gaudin,

Y. .. Wang,

G. .. Wisniewski, , vol.469

Y. Yvon and . .. François, , vol.59, p.562

. Zargayouna and .. .. Haifa,

M. .. Zarrouk,

A. .. Zimmermann, , p.460

C. Zribi and . .. Ben-othmane, , vol.256, p.288

M. .. Zrigui,

P. .. Zweigenbaum, , p.432