Structure patterns in Information Extraction: a multilingual solution? - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2010

Structure patterns in Information Extraction: a multilingual solution?

Résumé

IE systems nowadays work very well, but they are mostly monolingual and difficult to convert to other languages. We maybe have then to stop thinking only with traditional pattern-based approaches. Our project, PULS, makes epidemic surveillance through analysis of On-Line News in collaboration with MedISys, developed at the European Commission's Joint Research Centre (EC-JRC). PULS had only an English pattern-based system and we worked on a pilot study on French to prepare a multilingual extension. We will present here why we chose to ignore classical approaches and how we can use it with a mainly language-independent based only on discourse properties of press articlestructure. Our results show a precision of 87% and a recall of 93%. And we have good reasons to think that this approach will also be efficient for other languages.
Fichier principal
Vignette du fichier
amict09.pdf (55.33 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-00605691 , version 1 (04-07-2011)

Identifiants

  • HAL Id : hal-00605691 , version 1

Citer

Gaël Lejeune. Structure patterns in Information Extraction: a multilingual solution?. Advances in Methods of Information and Communication Technology, May 2009, Petrozavodsk, Russia. pp.105-111. ⟨hal-00605691⟩
91 Consultations
62 Téléchargements

Partager

Gmail Facebook X LinkedIn More