Skip to Main content Skip to Navigation
Conference papers

Domain Adaptation for Named Entity Recognition Using CRFs

Abstract : In this paper we explain how we created a labelled corpus in English for a Named Entity Recognition (NER) task from multi-source and multi-domain data, for an industrial partner. We explain the specificities of this corpus with examples and describe some baseline experiments. We present some results of domain adaptation on this corpus using a labelled Twitter corpus (Ritter et al., 2011). We tested a semi-supervised method from (Garcia-Fernandez et al., 2014) combined with a supervised domain adaptation approach proposed in (Raymond and Fayolle, 2010) for machine learning experiments with CRFs (Conditional Random Fields). We use the same technique to improve the NER results on the Twitter corpus (Ritter et al., 2011). Our contributions thus consist in an industrial corpus creation and NER performance improvements.
Document type :
Conference papers
Complete list of metadata

Cited literature [12 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01473397
Contributor : Tian Tian Connect in order to contact the contributor
Submitted on : Tuesday, February 21, 2017 - 5:58:46 PM
Last modification on : Friday, October 15, 2021 - 1:40:08 PM
Long-term archiving on: : Monday, May 22, 2017 - 4:10:16 PM

File

1102_Paper.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01473397, version 1

Collections

Citation

Tian Tian, Marco Dinarelli, Isabelle Tellier, Pedro Dias Cardoso. Domain Adaptation for Named Entity Recognition Using CRFs. LREC 2016, May 2016, Portoroz, Slovenia. ⟨hal-01473397⟩

Share

Metrics

Record views

348

Files downloads

204