From Inter-Annotation to Intra-Publication Inconsistency

Abstract : What are effective ways to help people with chronic illness, e.g. diabetes and heart disease? Computational linguistics relies on human-annotated data to train machine learners. Inconsistency among the human annotators must be carefully managed (otherwise, the annotations are useless in computation). How can this annotation process be made scalable? ABSTRACT Curing chronic illnesses and diseases requires the huge effort of collecting all available information on this matter and piecing it together with the aids of mathematical and computer modeling. Both phases of information collection and piecing together are prone to error. Errors may result from human annotation inconsistency, machine learning and parameterization when using supervised learning. On a different scale, published results that need to be collected may suffer from another kind of disagreement either due to varying experimental methodologies or assumptions. Here, we discuss these inconsistencies and disagreements in scientific literature and we investigate those of the inter-annotation of named entities in bioliterature from empirical perspectives.
Complete list of metadatas

Cited literature [15 references]  Display  Hide  Download

https://hal.sorbonne-universite.fr/hal-01245137
Contributor : Mihnea Tufis <>
Submitted on : Wednesday, March 16, 2016 - 12:31:30 AM
Last modification on : Thursday, March 21, 2019 - 1:07:45 PM
Long-term archiving on : Friday, June 17, 2016 - 10:02:33 AM

File

AbiHaidar_Tufis_IR14.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01245137, version 1

Citation

Alaa Abi Haidar, Mihnea Tufis, Jean Gabriel Ganascia. From Inter-Annotation to Intra-Publication Inconsistency. Inconsistency Robustness, 52, College Publications, pp.614, 2015, Studies in Logic, 978-1-84890-159-9. ⟨hal-01245137⟩

Share

Metrics

Record views

168

Files downloads

151