Skip to Main content Skip to Navigation
Journal articles

The GAG database: A new resource to gather genomic annotation cross-references

Abstract : Several institutions provide genomic annotation data, and therefore these data show a significant segmentation and redundancy. Public databases allow access, through their own methods, to genomic and proteomic sequences and related annotation. Although some cross-reference tables are available, they don't cover the complete datasets provided by these databases. The Genomic Annotation Gathering project intends to unify annotation data provided by GenBank and Ensembl. We introduce an intra-species, cross-bank method. Generated results provide an enriched set of cross- references. This method allows for identifying an average of 30% of new cross-references that can be integrated to other utilities dedicated to analyzing related annotation data. By using only sequence comparison, we are able to unify two datasets that previously didn't share any stable cross-bank accession method. The whole process is hosted by the GenOuest platform to provide public access to newly generated cross-references and to allow for regular updates (
Document type :
Journal articles
Complete list of metadatas
Contributor : Archive Ouverte Prodinra <>
Submitted on : Thursday, October 1, 2015 - 11:49:56 AM
Last modification on : Friday, July 10, 2020 - 4:18:15 PM



Thomas Obadia, Olivier Sallou, Marion Ouedraogo, Grégory Guernec, Frédéric Lecerf. The GAG database: A new resource to gather genomic annotation cross-references. Gene, Elsevier, 2013, 527 (2), pp.503-509. ⟨10.1016/j.gene.2013.06.063⟩. ⟨hal-01207750⟩



Record views