Extraction of Syntactical Patterns from Parsing Trees

Jean-Gabriel Ganascia 1
1 APA - Apprentissage et Acquisition des connaissances
LIP6 - Laboratoire d'Informatique de Paris 6
Abstract : This paper presents experiments with a new algorithm designed to detect recurrent syntactical patterns in natural language texts. It first describes the pattern extraction algorithm, which is based on an edit model generalized to SOT (Stratified Ordered Trees). Then it focuses on experiments with French classical literature of the 18 th and 19 th centuries. It goes on to evaluate efficiency before providing some examples of recurrent patterns that are typical of an 18 th century author, Madame de Lafayette.
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01561417
Contributor : Lip6 Publications <>
Submitted on : Wednesday, July 12, 2017 - 5:06:41 PM
Last modification on : Thursday, March 21, 2019 - 2:22:15 PM

Identifiers

  • HAL Id : hal-01561417, version 1

Citation

Jean-Gabriel Ganascia. Extraction of Syntactical Patterns from Parsing Trees. JADT 2002 - 6èmes Journées internationales d’Analyse statistique des Données Textuelles, Mar 2002, Saint-Malo, France. pp.277-288. ⟨hal-01561417⟩

Share

Metrics

Record views

54