E. Agirre, C. Baneab, C. Cardiec, D. Cerd, M. Diabe et al., SemEval-2015 Task 2: Semantic Textual Similarity, English, Spanish and Pilot on Interpretability, Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015), pp.252-263, 2015.
DOI : 10.18653/v1/S15-2045

Y. Chen and A. Eisele, Multiun v2: Un documents with multilingual alignments, LREC, pp.2500-2504, 2012.

R. Collobert and J. Weston, A unified architecture for natural language processing, Proceedings of the 25th international conference on Machine learning, ICML '08, pp.160-167, 2008.
DOI : 10.1145/1390156.1390177

J. Ferrero, F. Agnès, L. Besacier, and D. Schwab, Using Word Embedding for Cross-Language Plagiarism Detection King saud university corpus, European Association for Computational Linguistics, 2012.

C. Lioma and R. Blanco, Part of Speech Based Term Weighting for Information Retrieval, European Conference on Information Retrieval, pp.412-423, 2009.
DOI : 10.1108/eb026526

. Meedan, Meedan's open source arabic english , https://github.com/anastaw/meedan-memory, 2012.

T. Mikolov, K. Chen, G. Corrado, and J. Dean, Efficient estimation of word representations in vector space, In: ICLR: Proceeding of the International Conference on Learning Representations Workshop Track, pp.1301-3781, 2013.

T. Mikolov, I. Sutskever, K. Chen, S. Greg, J. Corrado et al., Distributed representations of words and phrases and their compositionality, Advances in neural information processing systems, pp.3111-3119, 2013.

T. Mikolov, W. Yih, and G. Zweig, Linguistic regularities in continuous space word representations, Hlt-naacl, pp.746-751, 2013.

A. Mnih, E. Geoffrey, and . Hinton, A scalable hierarchical distributed language model, Advances in Neural Information Processing Systems 21, pp.1081-1088, 2009.

M. Hazem, . Raafat, A. Mohamed, M. Zahran, and . Rashwan, Arabase-a database combining different arabic resources with lexical and semantic information, In KDIR/KMIS, pp.233-240, 2013.

K. Motaz, W. Saad, and . Ashour, Osac: Open source arabic corpora, 6th ArchEng Int. Symposiums , EEECS, 2010.

G. Salton and C. Buckley, Termweighting approaches in automatic text retrieval. Information processing & management, pp.513-523, 1988.
DOI : 10.1016/0306-4573(88)90021-0

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=

D. Schwab, Approche hybride-lexicale et thématique-pour la modélisation, la détection et lexploitation des fonctions lexicales en vue de lanalyse sémantique de texte, 2005.

J. Tiedemann, Parallel data, tools and interfaces in opus, LREC, pp.2214-2218, 2012.

J. Turian, L. Ratinov, and Y. Bengio, Word representations: a simple and general method for semi-supervised learning, Proceedings of the 48th annual meeting of the association for computational linguistics, pp.384-394, 2010.

. Wikiar, Arabic wikipedia corpus, 2006.