HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Book sections

Automated classification of unexpected uses of this and that in a learner corpus of English

Abstract : This paper deals with the way learners make use of the demonstratives this and that. NLP tools are applied to classify occurrences of native and non-native uses of the two forms. The objective of the two experiments is to automatically identify expected and unexpected uses. The textual environment of all the occurrences is explored at text and PoS level to uncover features which play a role in the selection of a particular form. Results of the first experiment show that the PoS features predeterminer and determiner, which are found in the close context of occurrences, help identify unexpected learner uses among many occurrences also including native uses. The second experiment shows evidence that the PoS features plural noun and coordinating conjunction influence the unexpected uses of the demonstratives by learners. This study shows that NLP tools can be used to explore texts and uncover underlying grammatical categories that play a role in the selection of specific words. 2 Thomas Gaillat
Complete list of metadata

Cited literature [16 references]  Display  Hide  Download

Contributor : Thomas Gaillat Connect in order to contact the contributor
Submitted on : Monday, February 19, 2018 - 12:06:51 PM
Last modification on : Tuesday, March 8, 2022 - 3:07:19 AM


TG-ICAME33 (copy).pdf
Files produced by the author(s)



Thomas Gaillat, Pascale Sébillot, Nicolas Ballier. Automated classification of unexpected uses of this and that in a learner corpus of English. Lieven Vandelanotte; Kristin Davidse; Caroline Gentens. Recent Advances in Corpus Linguistics: Developing and Exploiting Corpora, 78, Brill, pp.309-324, 2014, Rodopi Language and Linguistics Special E-Book, ⟨10.1163/9789401211130_015⟩. ⟨hal-01058760v2⟩



Record views


Files downloads