P. Koehn and H. Hoang, Factored translation models, Proc. of EMNLP-CoNLL, pp.868-876, 2007.

D. M. Blei, A. Y. Ng, and M. I. Jordan, Latent Dirichlet allocation, The Journal of Machine Learning Research, vol.3, pp.993-1022, 2003.

P. Koehn, H. Hoang, A. Birch, C. Callison-burch, M. Federico et al., Moses: Open source toolkit for statistical machine translation, Proc. of ACL, pp.177-180, 2007.

M. Gr?ar, S. Krek, and K. Dobrovoljc, Obeliks: statisti?ni oblikoskladenjski ozna?evalnik in lematizator za slovenski jezik, Proc. of the 15th International Multiconference (IS), pp.89-94, 2012.

H. Schmid, Improvements in part-of-speech tagging with an application to German, Proc. of the ACL SIGDAT Workshop, pp.47-50, 1995.

A. Stolcke, SRILM-an extensible language modeling toolkit, Proc. of Interspeech, 2002.

Q. Gao and S. Vogel, Parallel implementations of word alignment tool, Proc. of the ACL Workshop: Software Engineering, Testing, and Quality Assurance for Natural Language Processing, pp.49-57, 2008.

F. J. Och, Minimum error rate training in statistical machine translation, Proc. of ACL, vol.1, 2003.

S. Huet, E. Manishina, and F. Lefèvre, Factored machine translation systems for Russian-English, Proc. of WMT, 2013.
URL : https://hal.archives-ouvertes.fr/hal-02021814

S. Deerwester, S. Dumais, G. Furnas, T. Landauer, and R. Harshman, Indexing by latent semantic analysis, Journal of the American society for information science, vol.41, issue.6, pp.391-407, 1990.
DOI : 10.1002/(sici)1097-4571(199009)41:6<391::aid-asi1>3.0.co;2-9

URL : http://www.cs.bham.ac.uk/~pxt/IDA/lsa_ind.pdf

J. R. Bellegarda, A latent semantic analysis framework for large-span language modeling, Proc. of Eurospeech, 1997.

T. Hofmann, Probabilistic latent semantic analysis, Proc. of Uncertainty in Artificial Intelligence, UAI ' 99, 1999.

G. Salton, Analysis and Retrieval of Information by Computer, 1989.

J. R. Bellegarda, Exploiting latent semantic information in statistical language modeling, Proceedings of the IEEE, vol.88, issue.8, pp.1279-1296, 2000.

Y. Suzuki, F. Fukumoto, and Y. Sekiguchi, Keyword extraction using term-domain interdependence for dictation of radio news, Proc. of Coling, vol.2, pp.1272-1276, 1998.

T. Hofmann, Unsupervised learning by probabilistic latent semantic analysis, Machine Learning, vol.42, issue.1, pp.177-196, 2001.

T. Minka and J. Lafferty, Expectation-propagation for the generative aspect model, Proc. of the Conference on Uncertainty in artificial intelligence, pp.352-359, 2002.

T. L. Griffiths and M. Steyvers, Finding scientific topics, Proceedings of the National academy of Sciences of the United States of America, vol.101, pp.5228-5235, 2004.

S. Geman and D. Geman, Stochastic relaxation, gibbs distributions, and the bayesian restoration of images, IEEE Transactions on Pattern Analysis and Machine Intelligence, issue.6, pp.721-741, 1984.

G. Heinrich, Parameter estimation for text analysis, Fraunhofer IGD, 2009.

N. Bach, F. Huang, and Y. Al-onaizan, Goodness: A method for measuring machine translation confidence, Proc. of ACL, 2011.