C. Alm, D. Roth, and R. Sproat, Emotions from text, Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing , HLT '05, pp.579-586, 2005.
DOI : 10.3115/1220575.1220648

R. Arstein and M. Poesio, Inter-Coder Agreement for Computational Linguistics, Computational Linguistics, vol.27, issue.1, pp.555-596, 2008.
DOI : 10.1037/0033-2909.103.3.374

R. Artstein and M. Poesio, Bias decreases in proportion to the number of annotators, Proceedings FG-MoL'2005, p.150, 2005.

P. Saskia-bayerl and K. I. Paul, What Determines Inter-Coder Agreement in Manual Annotations? A Meta-Analytic Investigation, Computational Linguistics, vol.37, issue.699, p.725, 2011.

P. Brennan and A. Silman, Statistical methods for assessing observer variability in clinical measures., BMJ, vol.304, issue.6840, pp.1491-1494, 1992.
DOI : 10.1136/bmj.304.6840.1491

T. Byrt, J. Bishop, and J. Carlin, Bias, prevalence and kappa, Journal of Clinical Epidemiology, vol.46, issue.5, pp.423-429, 1993.
DOI : 10.1016/0895-4356(93)90018-V

H. Brenner and U. Kliebsch, Dependence of Weighted Kappa Coefficients on the Number of Categories, Epidemiology, vol.7, issue.2, pp.199-202, 1996.
DOI : 10.1097/00001648-199603000-00016

Z. Callejas and R. Lopez-cozar, Influence of contextual information in emotion annotation for spoken dialogue systems, Speech Communication, vol.50, issue.5, pp.416-433, 2008.
DOI : 10.1016/j.specom.2008.01.001
URL : https://hal.archives-ouvertes.fr/hal-00499202

J. Carletta, Assessing agreement on classification tasks: the Kappa statistic, Computational Linguistics, vol.22, issue.2, pp.249-254, 1996.

J. Cohen, A Coefficient of Agreement for Nominal Scales, Educational and Psychological Measurement, vol.20, issue.1, pp.37-46, 1960.
DOI : 10.1177/001316446002000104

J. Cohen, Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit., Psychological Bulletin, vol.70, issue.4, pp.213-220, 1968.
DOI : 10.1037/h0026256

R. Cowie and R. Cornelius, Describing the emotional states that are expressed in speech, Speech Communication, vol.40, issue.1-2, pp.5-32, 2003.
DOI : 10.1016/S0167-6393(02)00071-7

J. Lee and . Cronbach, Coefficient alpha and the internal structure of tests, Psychometrica, vol.16, pp.297-334, 1951.

L. Devillers, L. Vidrascu, and L. Lamel, Challenges in real-life emotion annotation and machine learning based detection, Neural Networks, vol.18, issue.4, pp.407-422, 2005.
DOI : 10.1016/j.neunet.2005.03.007

P. Ekman, Patterns of emotions: New Analysis of Anxiety and Emotion, 1999.

B. D. , E. , and M. Glass, The kappa statistic: A second look, Computational Linguistics, vol.30, issue.1, pp.95-101, 2004.

M. Davies and J. Fleiss, Measuring Agreement for Multinomial Data, Biometrics, vol.38, issue.4, pp.1047-1051, 1982.
DOI : 10.2307/2529886

A. Feinstein and D. Cicchetti, High agreement but low Kappa: I. the problems of two paradoxes, Journal of Clinical Epidemiology, vol.43, issue.6, pp.543-549, 1990.
DOI : 10.1016/0895-4356(90)90158-L

L. Joseph and . Fleiss, 1971 Measuring nominal scale agreement among many raters, Psychological Bulletin, vol.76, issue.5, pp.378-382

A. Hayes, Answering the Call for a Standard Reliability Measure for Coding Data, Communication Methods and Measures, vol.12, issue.1, pp.77-89, 2007.
DOI : 10.1037/0033-2909.103.3.374

K. Krippendorff, Content Analysis: an Introduction to its Methodology. Chapter 11, Sage, 2004.

K. Krippendorff, Reliability in Content Analysis., Human Communication Research, vol.103, issue.3, pp.411-433, 2004.
DOI : 10.1086/266577

K. Krippendorff, Testing the reliability of content analysis data: what is involved and why The content analysis reader, 2008.

K. Krippendorff, Testing the reliability of content analysis data: what is involved and why The Content Analysis Reader, 2009.

J. Marc-le-tallec, J. Villaneau, D. Antoine, and . Duhaut, Affective Interaction with a Companion Robot for vulnerable Children: a Linguistically based Model for Emotion Detection, Proc. Language Technology Conference, pp.445-450, 2011.

B. Macwhinney, The CHILDES project : Tools for analyzing talk. 3 rd edition, 2000.

J. Muzerelle, A. Lefeuvre, E. Schang, J. Antoine, A. Pelletier et al., AN- COR_Centre, a large free spoken French coreference corpus: description of the resource and reliability measures, Proc. LREC', p.2014, 2014.

K. Neuendorf, The Content Analysis Guidebook, 2002.

J. Russell, A circumplex model of affect., Journal of Personality and Social Psychology, vol.39, issue.6, pp.1161-1178, 1980.
DOI : 10.1037/h0077714
URL : https://hal.archives-ouvertes.fr/hal-01086372

K. Scherer, What are emotions? And how can they be measured?, Social Science Information, vol.42, issue.1, pp.694-729, 2005.
DOI : 10.1177/0539018405058216

B. Schuller, S. Steidl, and A. Batliner, The Interspeech'2009 emotion challenge, Proceedings Interspeech'2009, p.315, 2009.

W. Scott, Reliability of Content Analysis: The Case of Nominal Scale Coding, Public Opinion Quarterly, vol.19, issue.3, pp.321-325, 1955.
DOI : 10.1086/266577

J. Sim and C. Wright, The Kappa Statistic in Reliability Studies: Use, Interpretation, and Sample Size Requirements, Physical Therapy, vol.85, issue.3, p.257268, 2005.

M. Thelwall, K. Buckley, G. Paltoglou, D. Cai, and A. Kappas, Sentiment strength detection in short informal text, Journal of the American Society for Information Science and Technology, vol.5, issue.2, pp.61-2544, 2010.
DOI : 10.1002/asi.21416

P. Turney, Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews, Proceedings ACL'02, pp.417-424, 2002.

W. Vach, The dependence of Cohen's kappa on the prevalence does not matter, Journal of Clinical Epidemiology, vol.58, issue.7, pp.655-661, 2005.
DOI : 10.1016/j.jclinepi.2004.02.021

R. Vassallo, Comment le Grand Nord découvrit l'été. Flammarion, 2004.

K. Vanderheyden, Le Noel des animaux de la montagne. Fairy tale available at the URL, 1995.

E. Volkova, B. Mohler, D. Meurers, D. Gerdemann, and H. Bülthoff, Emotional perception of fairy tales: achieving agreement in emotion annotation of text, Proceedings NAACL HLT 2010, 2010.

T. Wilson, J. Wiebe, and P. Hoffmann, Recognizing contextual polarity in phrase-level sentiment analysis, Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing , HLT '05, pp.347-354, 2005.
DOI : 10.3115/1220575.1220619