M. Baroni, S. Bernardini, A. Ferraresi, and E. Zanchetta, The wacky wide web: A collection of very large linguistically processed webcrawled corpora. Language Resources and Evaluation, vol.43, pp.209-226, 2009.

Y. Bengio, R. Ducharme, P. Vincent, and C. Jauvin, A neural probabilistic language model, Journal of Machine Learning Research, vol.3, pp.1137-1155, 2003.

S. Bergsma, D. Lin, and R. Goebel, Discriminative learning of selectional preference from unlabeled text, Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp.59-68, 2008.

K. W. Church and P. Hanks, Word association norms, mutual information & lexicography, Computational Linguistics, vol.16, issue.1, pp.22-29, 1990.

S. Clark and D. Weir, Class-based probability estimation using a semantic hierarchy, Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies, pp.95-102, 2001.

R. Collobert and J. Weston, A unified architecture for natural language processing: Deep 33 neural networks with multitask learning, Proceedings of the 25th international conference on Machine learning, pp.160-167, 2008.

K. Erk, S. Padó, and U. Padó, A flexible, corpus-driven model of regular and inverse selectional preferences, Computational Linguistics, vol.36, issue.4, pp.723-763, 2010.

K. Erk, A simple, similarity-based model for selectional preferences, Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pp.216-223, 2007.

D. Gildea and D. Jurafsky, Automatic labeling of semantic roles, vol.28, pp.245-288, 2002.

E. H. Huang, R. Socher, C. D. Manning, and A. Y. Ng, Improving word representations via global context and multiple word prototypes, Annual Meeting of the Association for Computational Linguistics (ACL), 2012.

D. Daniel, H. S. Lee, and . Seung, Algorithms for non-negative matrix factorization, Advances in Neural Information Processing Systems 13, pp.556-562, 2000.

H. Li and N. Abe, Generalizing case frames using a thesaurus and the MDL principle, Computational linguistics, vol.24, issue.2, pp.217-244, 1998.

T. Li and C. Ding, The relationships among various nonnegative matrix factorization methods for clustering, Data Mining, 2006. ICDM'06. Sixth International Conference on, pp.362-371, 2006.

C. Dong, J. Liu, and . Nocedal, On the limited memory BFGS method for large scale optimization, Mathematical programming, vol.45, issue.1-3, pp.503-528, 1989.

D. Mccarthy and J. Carroll, Disambiguating nouns, verbs, and adjectives using automatically acquired selectional preferences, Computational Linguistics, vol.29, issue.4, pp.639-654, 2003.

T. Mikolov, K. Chen, G. Corrado, and J. Dean, Efficient estimation of word representations in vector space, 2013.

A. Mnih and G. Hinton, Three new graphical models for statistical language modelling, Proceedings of the 24th international conference on Machine learning, pp.641-648, 2007.

J. Nivre, J. Hall, and J. Nilsson, Maltparser: A data-driven parser-generator for dependency parsing, Proceedings of LREC-2006, pp.2216-2219, 2006.

D. Séaghdha and A. Korhonen, Modelling selectional preferences in a lexical hierarchy, Proceedings of the First Joint Conference on Lexical and Computational Semantics, vol.1, pp.170-179, 2012.

D. Séaghdha, Latent variable models of selectional preference, Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp.435-444, 2010.

S. Padó, U. Padó, and K. Erk, Flexible, corpus-based modelling of human plausibility judgements, Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL), pp.400-409, 2007.

P. Resnik, Selectional constraints: An information-theoretic model and its computational realization, Cognition, vol.61, pp.127-159, 1996.

A. Ritter, M. , and O. E. , A latent dirichlet allocation method for selectional preferences, Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp.424-434, 2010.

M. Rooth, S. Riezler, and D. Prescher, Inducing a semantically annotated lexicon via em-based clustering, Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics, pp.104-111, 1999.

E. Shutova, S. Teufel, and A. Korhonen, Statistical metaphor processing, Computational Linguistics, vol.39, issue.2, pp.301-353, 2013.

K. Toutanova, D. Klein, C. Manning, and Y. Singer, Feature-rich part-ofspeech tagging with a cyclic dependency network, Proceedings of HLT-NAACL 2003, pp.252-259, 2003.

M. Tsubaki, K. Duh, M. Shimbo, and Y. Matsumoto, Modeling and learning semantic co-compositionality through prototype projections and neural networks, Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp.130-140, 2013.

T. Van-de-cruys, A non-negative tensor factorization model for selectional preference induction, 2009.
URL : https://hal.archives-ouvertes.fr/inria-00546045

, Proceedings of the Workshop on Geometrical Models of Natural Language Semantics, pp.83-90

D. Yu, L. Deng, and F. Seide, The deep tensor neural network with applications to large vocabulary speech recognition, IEEE Transactions on Audio, Speech, and Language Processing, vol.21, pp.388-396, 2013.