C. M. Bishop, Pattern Recognition and Machine Learning, 2006.

E. Brill, A simple rule-based part of speech tagger, Proceedings of the 3rd Conference on Applied Natural Language Processing (ANLC), 1992.

E. Brill, M. Pop, ;. S. Armstrong, K. Church, P. Isabelle et al., Unsupervised learning of disambiguation rules for Part-of-Speech tagging, Natural Language Processing Using Very Large Corpora, vol.11, pp.27-42, 1999.

A. Cangelosi, K. R. Coventry, R. Rajapakse, D. Joyce, A. Bacon et al., Grounding language in perception: A connectionist model of spatial terms and vague quantifiers, Modeling Language, Cognition, and Action: Proceedings of the 9th Neural Computation and Psychology Workshop, pp.47-56, 2005.

C. Christodoulopoulos, S. Goldwater, and M. Steedman, Two decades of unsupervised POS induction: How far have we come?, Proceedings of the 15th Conference on Empirical Methods in Natural Language Processing (EMNLP), p.3, 2010.

K. W. Church, A stochastic parts program and noun phrase parser for unrestricted text, Proceedings of the 2nd Conference on Applied Natural Language Processing (ANLC), 1988.

C. Craye, D. Filliat, and J. F. Goudou, Environment exploration for object-based visual saliency learning, Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), 2016.
URL : https://hal.archives-ouvertes.fr/hal-01289159

C. Craye, D. Filliat, and J. F. Goudou, RL-IAC: An exploration policy for online saliency learning on an autonomous mobile robot, Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2016.
URL : https://hal.archives-ouvertes.fr/hal-01392947

D. Cutting, J. Kupiec, J. Pedersen, and P. Sibun, A practical Part-of-Speech tagger, Proceedings of the 3rd Conference on Applied Natural Language Processing (ANLC), 1992.

C. R. Dawson, J. Wright, A. Rebguns, M. V. Escarcega, D. Fried et al., A generative probabilistic framework for learning spatial language, Proceedings of the 3rd Joint IEEE International Conference on Development and Learning and on Epigenetic Robotics (ICDL), 2013.

M. A. Fischler and R. C. Bolles, Random Sample Consensus: A paradigm for model fitting with applications to image analysis and automated cartography, Communications of the ACM, vol.24, issue.6, pp.381-395, 1981.

J. Gao and M. Johnson, A comparison of Bayesian estimators for unsupervised Hidden Markov Model POS taggers, Proceedings of the 13th Conference on Empirical Methods in Natural Language Processing (EMNLP), pp.344-352, 2008.

S. Geman and D. Geman, Stochastic relaxation, Gibbs distributions and the Bayesian restoration of images, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.6, issue.6, pp.721-741, 1984.

S. Guadarrama, L. Riano, D. Golland, D. Gohring, Y. Jia et al., Grounding spatial relations for human-robot interaction, Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2013.

S. Harnad, The symbol grounding problem, Physica D, vol.42, issue.1, pp.335-346, 1990.

W. P. Headden, D. Mcclosky, and E. Charniak, Evaluating unsupervised part-of-speech tagging for grammar induction, Proceedings of the 22nd International Conference on Computational Linguistics (COLING), 2008.

X. Y. Jiang, U. Meier, and H. Bunke, Fast range image segmentation using high-level segmentation primitives, Proceedings of the 3rd IEEE International Workshop on Applications of Computer Vision (WACV), 1996.

K. Koster and M. Spann, MIR: An approach to robust clustering application to range image segmentation, IEEE Transaction on Pattern Analysis and Machine Intelligence, vol.22, issue.5, 2000.

B. Landau and R. Jackendoff, What' and 'where' in spatial language and spatial cognition, Behavioral and Brain Sciences, vol.16, issue.1, pp.217-238, 1993.

C. Liu, J. Walker, and J. Y. Chai, Ambiguities in spatial language understanding in situated human robot dialogue, Proceedings of the AAAI Fall Symposium: Dialog with Robots, 2010.

C. Matuszek, N. Fitzgerald, L. Zettlemoyer, L. Bo, and D. Fox, A joint model of language and perception for grounded attribute learning, Proceedings of the 29th International Conference on Machine Learning (ICML), 2012.

R. Moratz, T. Tenbrink, J. Bateman, and K. Fischer, Spatial knowledge representation for human-robot interaction, Spatial Cognition III, pp.263-286, 2003.

G. Neubig, Simple, correct parallelization for blocked Gibbs sampling, 2014.

A. Nguyen and B. Le, 3D point cloud segmentation: A survey, Proceedings of the 6th IEEE International Conference on Robotics, Automation, and Mechatronics (RAM), 2013.

P. Y. Oudeyer, Self-Organization in the Evolution of Speech, vol.6, 2006.
URL : https://hal.archives-ouvertes.fr/hal-00818204

K. Plunkett, C. Sinha, M. F. Moller, and O. Strandsby, Symbol grounding or the emergence of symbols? vocabulary growth in children and a connectionist net, Connection Science, vol.4, issue.3-4, pp.293-312, 1992.

T. Regier, The Human Semantic Potential: Spatial Language and Constrained Connectionism, 1996.

B. Rosman and S. Ramamoorthy, Learning spatial relationships between objects, International Journal of Robotics Research, vol.30, issue.11, pp.1328-1342, 2011.

D. Roy, Learning visually-grounded words and syntax for a scene description task, Computer Speech and Language, vol.16, issue.3, pp.353-385, 2002.

D. Roy, K. Hsiao, and N. Mavridis, Conversational robots: Building blocks for grounding word meanings, Proceedings of the International Workshop on Learning Word Meaning from Non-Linguistic Data (HLT-NAACL), 2003.

R. B. Rusu, G. Bradski, J. Hsu, and R. Thibaux, Fast 3D recognition and pose using the viewpoint feature histogram, Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), p.3, 2010.

R. Schnabel, R. Wahl, and R. Klein, Efficient RANSAC for point-cloud shape detection, Computer Graphics Forum, vol.26, issue.2, pp.214-226, 2007.

J. M. Siskind, Grounding language in perception, Artificial Intelligence Review, vol.8, pp.371-391

K. Smith, A. D. Smith, and R. A. Blythe, Cross-situational learning: An experimental study of word-learning mechanisms, Computer Graphics Forum, vol.35, issue.3, p.5, 2011.

L. Steels, The symbol grounding problem has been solved, so what's next?, Symbols and Embodiment: Debates on Meaning and Cognition, pp.223-244, 2008.

J. Strom, A. Richardson, and E. Olson, Graph based segmentation of colored 3D laser point clouds, Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2010.

M. K. Tanenhaus, M. J. Spivey-knowlton, K. M. Eberhard, and J. C. Sedivy, Integration of visual and linguistic information in spoken language comprehension, Science, vol.268, issue.5217, pp.1632-1634, 1995.

A. Taniguchi, T. Taniguchi, and A. Cangelosi, Multiple categorization by iCub: Learning relationships between multiple modalities and words, Proceedings of the International Workshop on Machine Learning Methods for High-Level Cognitive Capabilities in Robotics, in conjunction with the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2016.

T. Taniguchi, T. Nagai, T. Nakamura, N. Iwahashi, T. Ogata et al., Symbol emergence in robotics: A survey, Advanced Robotics, vol.30, issue.2, pp.11-12, 2016.

S. Tellex, T. Kollar, S. Dickerson, M. R. Walter, A. G. Banerjee et al., Approaching the symbol grounding problem with probabilistic graphical models, vol.32, pp.64-76, 2011.

K. Toutanova and M. Johnson, A Bayesian LDA-based model for semi-supervised part-of-speech tagging, Proceedings of the 20th International Conference on Neural Information Processing Systems (NIPS), pp.1521-1528, 2007.