Enhanced discriminative models with tree kernels and unsupervised training for entity detection

Lina Maria Rojas Barahona 1, * Christophe Cerisara 1, *
* Auteur correspondant
1 SYNALP - Natural Language Processing : representations, inference and semantics
LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : This work explores two approaches to improve the discriminative models that are commonly used nowadays for entity detection: tree-kernels and unsupervised training. Feature-rich classifiers have been widely adopted by the Natural Language processing (NLP) community because of their powerful modeling capacity and their support for correlated features, which allow separating the expert task of designing features from the core learning method. The first proposed approach consists in leveraging the fast and efficient linear models with unsupervised training, thanks to a recently proposed approximation of the classifier risk, an appealing method that provably converges towards the minimum risk without any labeled corpus. In the second proposed approach, tree kernels are used with support vector machines to exploit dependency structures for entity detection , which relieve designers from the burden of carefully design rich syntactic features manually. We study both approaches on the same task and corpus and show that they offer interesting alternatives to supervised learning for entity recognition.
Type de document :
Communication dans un congrès
6th. International Conference on Information Systems & Economic Intelligence (SIIE), Feb 2015, Hammamet, Tunisia. ISBN 978-1-4799-8934-8, 2015, Proc. of the 6th International Conference on Information Systems and Economic Intelligence (SIIE)
Liste complète des métadonnées

Littérature citée [24 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01184847
Contributeur : Christophe Cerisara <>
Soumis le : mardi 18 août 2015 - 15:07:56
Dernière modification le : mardi 24 avril 2018 - 13:29:44
Document(s) archivé(s) le : jeudi 19 novembre 2015 - 10:14:47

Fichier

weaklysupnumint.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01184847, version 1

Collections

Citation

Lina Maria Rojas Barahona, Christophe Cerisara. Enhanced discriminative models with tree kernels and unsupervised training for entity detection. 6th. International Conference on Information Systems & Economic Intelligence (SIIE), Feb 2015, Hammamet, Tunisia. ISBN 978-1-4799-8934-8, 2015, Proc. of the 6th International Conference on Information Systems and Economic Intelligence (SIIE). 〈hal-01184847〉

Partager

Métriques

Consultations de la notice

227

Téléchargements de fichiers

72