J. Lafferty, A. Mccallum, and F. Pereira, Conditional random fields: Probabilistic models for segmenting and labeling sequence data, International Conference on Machine Learning (ICML), 2001.

W. Wang, R. Besançon, O. Ferret, and B. Grau, Filtering and clustering relations for unsupervised information extraction in open domain, Proceedings of the 20th ACM international conference on Information and knowledge management, CIKM '11, pp.1405-1414, 2011.
DOI : 10.1145/2063576.2063780

W. Wang, R. Besan-c, O. Ferret, and B. Grau, Evaluation of unsupervised information extraction, Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC'12), 2012.

A. R. Ebadat, V. Claveau, and P. Sébillot, Semantic clustering using bag-of-bag-offeatures, Actes de le 9e conférence en recherche d'information et applications, 2012.

Z. Kozareva, Bootstrapping named entity recognition with automatically generated gazetteer lists, Proceedings of the Eleventh Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop on, EACL '06, pp.15-21, 2006.
DOI : 10.3115/1609039.1609041

J. Kazama and K. Torisawa, Exploiting wikipedia as external knowledge for named entity recognition, Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp.698-707, 2007.

W. Liao and S. V. , A simple semi-supervised algorithm for named entity recognition, Proceedings of the NAACL HLT 2009 Workshop on Semi-Supervised Learning for Natural Language Processing, SemiSupLearn '09, pp.58-65, 2009.
DOI : 10.3115/1621829.1621837

B. Merialdo, Tagging english text with a probabilistic model, Computational Linguistics, vol.20, pp.155-171, 1994.

D. Richard and F. Benoit, Semi-supervised part-of-speech tagging in speech applications, Interspeech 2010, Makuhari (Japan), 2010.
URL : https://hal.archives-ouvertes.fr/hal-01433898

S. Ravi and K. Knight, Minimized models for unsupervised part-of-speech tagging, Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1, ACL-IJCNLP '09, pp.504-512, 2009.
DOI : 10.3115/1687878.1687950

N. Smith and J. Eisner, Contrastive estimation, Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics , ACL '05, 2005.
DOI : 10.3115/1219840.1219884

S. Goldwater and T. L. Griffiths, A fully bayesian approach to unsupervised part-ofspeech tagging, Proceedings of the ACL, 2007.

M. Collins and Y. Singer, Unsupervised models for named entity classification, Proceedings of Empirical Methods for Natural Language Processing (EMNLP) conference, 1999.

M. Elsner, E. Charniak, and M. Johnson, Structured generative models for unsupervised named-entity clustering, Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics on, NAACL '09, 2009.
DOI : 10.3115/1620754.1620778

H. Ji and R. Grishman, Knowledge base population: Successful approaches and challenges, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp.1148-1158, 2011.

T. Wang, J. Li, Q. Diao, W. Hu, Y. Z. Dulong et al., Semantic event detection using conditional random fields, IEEE Conference on Computer Vision and Pattern Recognition Workshop (CVPRW '06, 2006.

A. Pranjal, R. Delip, and R. Balaraman, Part of speech tagging and chunking with hmm and crf, Proceedings of NLP Association of India (NLPAI) Machine Learning Contest, 2006.

M. Constant, I. Tellier, D. Duchier, Y. Dupont, A. Sigogne et al., Intégrer des connaissances liguistiques dans un CRF : ApplicationàApplication`Applicationà l'apprentissage d'un segmenteur-´ etiqueteur du français, Traitement Automatique du Langage Naturel (TALN'11), 2011.

C. Raymond and J. Fayolle, Reconnaissance robuste d'entités nommées sur de la parole transcrite automatiquement, Actes de la conférence Traitement Automatique des Langues Naturelles, 2010.

B. Liu, Y. Xia, Y. , and P. , Cltree-clustering through decision tree construction, IBM Research, 2000.

T. Hastie, R. Tibshirani, and J. H. Friedman, The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2001.

T. Shi and S. Horvath, Unsupervised Learning With Random Forest Predictors, Journal of Computational and Graphical Statistics, vol.15, issue.1, pp.118-138, 2005.
DOI : 10.1198/106186006X94072

N. N. Schraudolph, J. Yu, and S. Günter, A stochastic quasi-Newton method for online convex optimization, Proceedings of 11th International Conference on Artificial Intelligence and Statistics. Conference Proceedings, pp.436-443, 2007.

L. Breiman, Bagging predictors, Machine Learning, vol.10, issue.2, pp.123-140, 1996.
DOI : 10.1007/BF00058655

S. Van-dongen, Graph Clustering by Flow Simulation, Thèse de doctorat, 2000.

T. Lavergne, O. Cappé, and F. Yvon, Practical very large scale CRFs, Proceedings the 48th Annual Meeting of the Association for Computational Linguistics (ACL), pp.504-513, 2010.

K. Fort and V. Claveau, Annotating football matches: influence of the source medium on manual annotation, Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC'12), 2012.
URL : https://hal.archives-ouvertes.fr/hal-00709170

C. Manning, P. Raghavan, and H. Schütze, Introduction to information retrieval, 2008.
DOI : 10.1017/CBO9780511809071

W. M. Rand, Objective Criteria for the Evaluation of Clustering Methods, Journal of the American Statistical Association, vol.15, issue.336, pp.846-850, 1971.
DOI : 10.1080/01621459.1963.10500845

J. Nguyen-xuan-vinh and J. B. Epps, Information theoretic measures for clusterings comparison, Journal of Machine Learning Research, 2010.

L. Hubert and P. Arabie, Comparing partitions, Journal of Classification, vol.78, issue.1, pp.193-218, 1985.
DOI : 10.1007/BF01908075

G. Gravier, J. F. Bonastre, E. Geoffrois, S. Galliano, K. M. Tait et al., ESTER, une campagne d'´ evaluation des systèmes d'indexation automatique, 2005.

T. M. Mitchell, The need for biases in learning generalizations. Rutgers Computer Science Department, 1980.