D. Bahdanau, K. Cho, and Y. Bengio, Neural machine translation by jointly learning to align and translate, 2014.

M. Baroni, G. Dinu, and G. Kruszewski, Don't count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.238-247, 2014.
DOI : 10.3115/v1/P14-1023

Y. Bengio, R. Ducharme, P. Vincent, and C. Janvin, Neural Probabilistic Language Models, J. Mach. Learn. Res, vol.3, pp.1137-1155, 2003.
DOI : 10.1007/3-540-33486-6_6
URL : https://hal.archives-ouvertes.fr/hal-01434258

A. Cardoso-cachopo, Improving Methods for Single-label Text Categorization, 2007.

R. Collobert and J. Weston, A unified architecture for natural language processing, Proceedings of the 25th international conference on Machine learning, ICML '08, pp.160-167, 2008.
DOI : 10.1145/1390156.1390177

M. Andrew, . Dai, V. Quoc, and . Le, Semisupervised sequence learning, Advances in Neural Information Processing Systems 28, pp.3061-3069, 2015.

Y. Dauphin and Y. Bengio, Stochastic ratio matching of rbms for sparse high-dimensional inputs, Advances in Neural Information Processing Systems 26, pp.1340-1348, 2013.

Z. Deng, K. Luo, and H. Yu, A study of supervised term weighting scheme for sentiment analysis, Expert Systems with Applications, vol.41, issue.7, pp.3506-3513, 2014.
DOI : 10.1016/j.eswa.2013.10.056

L. Dong, F. Wei, C. Tan, D. Tang, M. Zhou et al., Adaptive Recursive Neural Network for Target-dependent Twitter Sentiment Classification, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp.49-54, 2014.
DOI : 10.3115/v1/P14-2009

J. Gao, P. Pantel, M. Gamon, X. He, and L. Deng, Modeling Interestingness with Deep Neural Networks, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp.2-13, 2014.
DOI : 10.3115/v1/D14-1002

Y. Goldberg, A primer on neural network models for natural language processing. CoRR, abs/1510, p.726, 2015.

K. Moritz, H. , and P. Blunsom, The Role of Syntax in Vector Space Models of Compositional Semantics, Proceedings of ACL, 2013.

M. Iyyer, V. Manjunatha, J. Boyd-graber, H. Daumé, and I. , Deep Unordered Composition Rivals Syntactic Methods for Text Classification, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp.1681-1691, 2015.
DOI : 10.3115/v1/P15-1162

R. Johnson and T. Zhang, Effective Use of Word Order for Text Categorization with Convolutional Neural Networks, Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp.103-112, 2015.
DOI : 10.3115/v1/N15-1011

N. Kalchbrenner, E. Grefenstette, and P. Blunsom, A Convolutional Neural Network for Modelling Sentences, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.655-665, 2014.
DOI : 10.3115/v1/P14-1062

Y. Kim and O. Zhang, Credibility Adjusted Term Frequency: A Supervised Term Weighting Scheme for Sentiment Analysis and Text Classification, Proceedings of the 5th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, pp.79-83, 2014.
DOI : 10.3115/v1/W14-2614

Y. Kim, Convolutional Neural Networks for Sentence Classification, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp.1746-1751, 2014.
DOI : 10.3115/v1/D14-1181

M. Lan, C. Tan, and H. Low, Proposing a new term weighting scheme for text categorization, Proceedings of the 21st National Conference on Artificial Intelligence, pp.763-768, 2006.

V. Quoc, T. Le, and . Mikolov, Distributed representations of sentences and documents, Proceedings of the 31th International Conference on Machine Learning, ICML 2014, pp.21-26, 2014.

J. Li, Feature weight tuning for recursive neural networks. CoRR, abs/1412, 2014.

W. Ling, Y. Tsvetkov, S. Amir, R. Fermandez, C. Dyer et al., Not all contexts are created equal: Better word representations with variable attention Association for Computational Lin- guistics, Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp.1367-1372, 2015.

A. L. Maas, R. E. Daly, P. T. Pham, D. Huang, A. Y. Ng et al., Learning word vectors for sentiment analysis, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp.142-150, 2011.

M. Mammadov, J. Yearwood, and L. Zhao, Proceedings, chapter A New Supervised Term Ranking Method for Text Categorization, AI 2010: Advances in Artificial Intelligence: 23rd Australasian Joint Conference, pp.102-111, 2010.

C. D. Manning, P. Raghavan, and H. Schütze, Introduction to Information Retrieval, chapter Scoring, term weighting, and the vector space model, 2008.

T. Mikolov, K. Chen, G. Corrado, and J. Dean, Efficient estimation of word representations in vector space, 2013.

T. Mikolov, W. Yih, and G. Zweig, Linguistic regularities in continuous space word representations, Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp.746-751, 2013.

J. Nam, J. Kim, E. Loza-mencía, I. Gurevych, and J. Fürnkranz, Large-Scale Multi-label Text Classification ??? Revisiting Neural Networks, Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD-14), Part 2, pp.437-452, 2014.
DOI : 10.1007/978-3-662-44851-9_28

G. Paltoglou and M. Thelwall, A study of information retrieval weighting schemes for sentiment analysis Association for Computational Linguis- tics, Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics , ACL '10, pp.1386-1395, 2010.

B. Pang and L. Lee, Seeing stars, Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics , ACL '05, pp.115-124, 2005.
DOI : 10.3115/1219840.1219855

J. Pennington, R. Socher, and C. Manning, Glove: Global Vectors for Word Representation, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp.1532-1543, 2014.
DOI : 10.3115/v1/D14-1162

X. Quan, W. Liu, and B. Qiu, Term Weighting Schemes for Question Categorization, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.33, issue.5, pp.1009-1021, 2011.
DOI : 10.1109/TPAMI.2010.154

R. Socher, A. Perelygin, J. Wu, J. Chuang, C. D. Manning et al., Recursive deep models for semantic compositionality over a sentiment treebank, Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp.1631-1642, 2013.

K. Tai, R. Socher, and C. D. Manning, Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp.1556-1566, 2015.
DOI : 10.3115/v1/P15-1150

J. Turian, L. Ratinov, and Y. Bengio, Word representations: A simple and general method for semi-supervised learning, Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL '10, pp.384-394, 2010.

L. Van-der-maaten and G. E. Hinton, Visualizing high-dimensional data using t-sne, Journal of Machine Learning Research, vol.9, pp.2579-2605, 2008.

S. Wang and C. D. Manning, Baselines and bigrams: Simple, good sentiment and topic classification, Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers, pp.90-94, 2012.

P. Wang, J. Xu, B. Xu, C. Liu, H. Zhang et al., Semantic clustering and convolutional neural network for short text categorization Association for Computational Linguis- tics, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, pp.352-357, 2015.

K. Xu, J. Ba, R. Kiros, K. Cho, A. C. Courville et al., Show, attend and tell: Neural image caption generation with visual attention, Proceedings of the 32nd International Conference on Machine Learning, ICML 2015, pp.6-11, 2015.

D. Matthew and . Zeiler, ADADELTA: an adaptive learning rate method. CoRR, abs/1212, 2012.

Y. Zhang and B. Wallace, A sensitivity analysis of (and practitioners' guide to) convolutional neural networks for sentence classification, 2015.