Skip to Main content Skip to Navigation
Journal articles

Comparison of concept recognizers for building the Open Biomedical Annotator

Abstract : The National Center for Biomedical Ontology (NCBO) is developing a system for automated, ontology-based access to online biomedical resources. The system's indexing workflow processes the text metadata of diverse resources such as datasets from GEO and ArrayExpress to annotate and index them with concepts from appropriate ontologies. This indexing requires the use of a concept-recognition tool to identify ontology concepts in the resource's textual metadata. In this paper, we present a comparison of two concept recognizers – NLM's MetaMap and the University of Michigan's Mgrep. We utilize a number of data sources and dictionaries to evaluate the concept recognizers in terms of precision, recall, speed of execution, scalability and customizability. Our evaluations demonstrate that Mgrep has a clear edge over MetaMap for large-scale service oriented applications. Based on our analysis we also suggest areas of potential improvements for Mgrep. We have subsequently used Mgrep to build the Open Biomedical Annotator service. The Annotator service has access to a large dictionary of biomedical terms derived from the United Medical Language System (UMLS) and NCBO ontologies. The Annotator also leverages the hierarchical structure of the ontologies and their mappings to expand annotations. The Annotator service is available to the community as a REST Web service for creating ontology-based annotations of their data.
Complete list of metadata

Cited literature [22 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00492026
Contributor : Clement Jonquet Connect in order to contact the contributor
Submitted on : Monday, June 14, 2010 - 8:12:23 PM
Last modification on : Thursday, February 25, 2021 - 9:46:04 AM
Long-term archiving on: : Tuesday, September 14, 2010 - 8:41:12 PM

File

Article-BMCBioInfo09-Mgrep_Sha...
Publisher files allowed on an open archive

Identifiers

Citation

Nigam Shah, Nipun Bhatia, Clement Jonquet, Daniel Rubin, Annie Chiang, et al.. Comparison of concept recognizers for building the Open Biomedical Annotator. BMC Bioinformatics, BioMed Central, 2009, 10 (9:S14), pp.9:S14. ⟨10.1186/1471-2105-10-S9-S14⟩. ⟨hal-00492026⟩

Share

Metrics

Record views

192

Files downloads

401