Discovering linguistic patterns using sequence mining - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

Discovering linguistic patterns using sequence mining

Peggy Cellier
Thierry Charnois

Résumé

In this paper, we present a method based on data mining techniques to automatically discover linguistic patterns matching appositive qualifying phrases. We develop an algorithm mining sequential patterns made of itemsets with gap and linguistic constraints. The itemsets allow several kinds of information to be associated with one term. The advantage is the extraction of linguistic patterns with more expressiveness than the usual sequential patterns. In addition, the constraints enable to automatically prune irrelevant patterns. In order to manage the set of generated patterns, we propose a solution based on a partial ordering. A human user can thus easily validate them as relevant linguistic patterns.We illustrate the efficiency of our approach over two corpora coming from a newspaper
Fichier principal
Vignette du fichier
ACTI-BECHET-2012-1.pdf (725.73 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01023109 , version 1 (15-07-2014)

Identifiants

  • HAL Id : hal-01023109 , version 1

Citer

Nicolas Béchet, Peggy Cellier, Thierry Charnois, Bruno Crémilleux. Discovering linguistic patterns using sequence mining. 13th Int. Conf. on Intelligent Text Processing and Computational Linguistics (CICLing'12), Mar 2012, new delhi, India. pp.154-165. ⟨hal-01023109⟩
277 Consultations
448 Téléchargements

Partager

Gmail Facebook X LinkedIn More