Towards an n-grammar of English

Bert Cappelle; Natalia Grabar

Chapitre D'ouvrage Année : 2016

Towards an n-grammar of English

(1) , (1)

Bert Cappelle

Fonction : Auteur
PersonId : 6066
IdHAL : bcappelle
ORCID : 0000-0002-4779-6259
IdRef : 110735609

Savoirs, Textes, Langage (STL) - UMR 8163

Natalia Grabar

Fonction : Auteur
PersonId : 6735
IdHAL : natalia-grabar
ORCID : 0000-0002-0237-4554
IdRef : 089015460

Savoirs, Textes, Langage (STL) - UMR 8163

Résumé

In this chapter, it is shown how we can develop a new type of learner’s or student’s grammar based on n-grams (sequences of 2 or 3, 4, etc. items) automatically extracted from a large corpus, such as the Corpus of Contemporary American English (COCA). The notion of n-gram and its primary role in statistical language modelling is first discussed. The part-of-speech (POS) tagging provided for lexical n-grams in COCA is then demonstrated to be useful for the identification of frequent structural strings in the corpus. We propose using the hundred most frequent POS-based 5-grams as the content around which an ‘n-grammar’ of English can be constructed. We counter some obvious objections to this approach (e.g. that these patterns only scratch the surface, or that they display much overlap among them) and describe extra features for this grammar, relating to the patterns’ productivity, corpus dispersion, functional description and practice potential.

Mots clés

ESL/EFL POS n-grams frequency constructicon grammar teaching

Domaines

Linguistique

Fichier principal

De Knop_10_Cappelle.pdf (6.73 Mo)

Origine : Accord explicite pour ce dépôt

Natalia Grabar : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01426700

Soumis le : mercredi 4 janvier 2017-18:16:23

Dernière modification le : mercredi 24 janvier 2024-09:54:19

Archivage à long terme le : mercredi 5 avril 2017-15:13:29

Dates et versions

hal-01426700 , version 1 (04-01-2017)

Identifiants

HAL Id : hal-01426700 , version 1

Citer

Bert Cappelle, Natalia Grabar. Towards an n-grammar of English. Constructionist Approaches to Second Language Acquisition and Foreign Language Teaching, 2016. ⟨hal-01426700⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS STL CAMPUS-AAR AAI UNIV-LILLE

105 Consultations

288 Téléchargements

Towards an n-grammar of English

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager