Error Mining with Suspicion Trees: Seeing the Forest for the Trees

Shashi Narayan; Claire Gardent

Communication Dans Un Congrès Année : 2012

Error Mining with Suspicion Trees: Seeing the Forest for the Trees

(1) , (1)

Shashi Narayan

Fonction : Auteur
PersonId : 767539
IdRef : 182508757

Natural Language Processing : representations, inference and semantics

Claire Gardent

Fonction : Auteur
PersonId : 3949
IdHAL : claire-gardent
ORCID : 0000-0002-3805-6662
IdRef : 034104593

Natural Language Processing : representations, inference and semantics

Résumé

In recent years, error mining approaches have been proposed to identify the most likely sources of errors in symbolic parsers and generators. However the techniques used generate a flat list of suspicious forms ranked by decreasing order of suspicion. We introduce a novel algorithm that structures the output of error mining into a tree (called, suspicion tree) highlighting the relationships between suspicious forms. We illustrate the impact of our approach by applying it to detect and analyse the most likely sources of failure in surface realisation; and we show how the suspicion tree built by our algorithm helps presenting the errors identified by error mining in a linguistically meaningful way thus providing better support for error analysis. The right frontier of the tree highlights the relative importance of the main error cases while the subtrees of a node indicate how a given error case divides into smaller more specific cases

Domaines

Traitement du texte et du document

Fichier principal

nargar_coling12_error_mining_final.pdf (146.95 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Claire Gardent : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00768227

Soumis le : vendredi 21 décembre 2012-09:22:39

Dernière modification le : lundi 11 septembre 2023-17:41:18

Archivage à long terme le : vendredi 22 mars 2013-03:45:44

Dates et versions

hal-00768227 , version 1 (21-12-2012)

Identifiants

HAL Id : hal-00768227 , version 1

Citer

Shashi Narayan, Claire Gardent. Error Mining with Suspicion Trees: Seeing the Forest for the Trees. 24th International Conference on Computational Linguistics, Dec 2012, Mumbai, India. pp.60-73. ⟨hal-00768227⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE LORIA LORIA-NLPKD

415 Consultations

76 Téléchargements

Error Mining with Suspicion Trees: Seeing the Forest for the Trees

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager