What About Sequential Data Mining Techniques to Identify Linguistic Patterns for Stylistics? - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

What About Sequential Data Mining Techniques to Identify Linguistic Patterns for Stylistics?

Résumé

In this paper, we study the use of data mining techniques for stylistic analysis, from a linguistic point of view, by considering emerging sequential patterns. First, we show that mining sequential patterns of words with gap constraints gives new relevant linguistic patterns with respect to patterns built on n-grams. Then, we investigate how sequential patterns of itemsets can provide more generic linguistic patterns. We validate our approach from a linguistic point of view by conducting experiments on three corpora of various types of French texts (Poetry, Letters, and Fictions). By considering more particularly poetic texts, we show that characteristic linguistic patterns can be identified using data mining techniques. We also discuss how to improve our proposed approach so that it can be used more efficiently for linguistic analyses.
Fichier principal
Vignette du fichier
cicling2012.pdf (260.25 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-00675578 , version 1 (01-03-2012)

Identifiants

Citer

Solen Quiniou, Peggy Cellier, Thierry Charnois, Dominique Legallois. What About Sequential Data Mining Techniques to Identify Linguistic Patterns for Stylistics?. CICLing 2012: Computational Linguistics and Intelligent Text Processing, Mar 2012, New Delhi, India. pp.166-177, ⟨10.1007/978-3-642-28604-9_14⟩. ⟨hal-00675578⟩
547 Consultations
1631 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More