| HAL : hal-00498045, version 1 |
| Fiche détaillée | Récupérer au format |
|
|
|
|
| Disambiguation Tools for NooJ |
|
|
| Max Silberztein 1 |
|
|
| NooJ Collaboration(s) |
|
|
| (06/06/2008) |
|
|
| When NooJ performs an automatic lexical analysis of corpora, it recognizes five types of atomic linguistic units (ALUs) and represents them as annotations stored inside each text's annotation structure (TAS). Unfortunately, the massive level of ambiguities generated by each of the five corresponding parsers produces a TAS far too heavy for most corpus linguistics applications. In consequence, most users' queries produce too many incorrect results. In order to provide a working solution for NooJ's lexical parser's behavior, we have implemented a new set of tools specifically designed to deal with unwanted ambiguities in corpora and texts: automatic and semi-automatic tools as well as a manual access to edit the TAS. |
|
|
|
|
|
|
|
|
|
|
| 1 : | Le LAboratoire de SEmio - Linguistique Didactique et Informatique (LASELDI) |
| Université de Franche-Comté | |
|
|
|
|
|
|
|
|
| Domaine | : | Sciences cognitives/Linguistique Informatique/Informatique et langage Sciences de l'Homme et Société/Linguistique |
|
|
| NooJ – Disambiguation – Local Grammars |
|
|
| Liste des fichiers attachés à ce document : | |||||
|
|
|
| hal-00498045, version 1 | |
| http://hal.archives-ouvertes.fr/hal-00498045 | |
| oai:hal.archives-ouvertes.fr:hal-00498045 | |
| Contributeur : Max Silberztein | |
| Soumis le : Mardi 6 Juillet 2010, 15:02:56 | |
| Dernière modification le : Mardi 6 Juillet 2010, 16:54:59 | |