The GAG database: A new resource to gather genomic annotation cross-references

Thomas Obadia 1, 2 Olivier Sallou 3 Marion Ouedraogo 1 Grégory Guernec 1, 4 Frédéric Lecerf 1
3 Plateforme bioinformatique GenOuest [Rennes]
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, UR1 - Université de Rennes 1, Plateforme Génomique Santé Biogenouest®, Inria Rennes – Bretagne Atlantique
Abstract : Several institutions provide genomic annotation data, and therefore these data show a significant segmentation and redundancy. Public databases allow access, through their own methods, to genomic and proteomic sequences and related annotation. Although some cross-reference tables are available, they don't cover the complete datasets provided by these databases. The Genomic Annotation Gathering project intends to unify annotation data provided by GenBank and Ensembl. We introduce an intra-species, cross-bank method. Generated results provide an enriched set of cross- references. This method allows for identifying an average of 30% of new cross-references that can be integrated to other utilities dedicated to analyzing related annotation data. By using only sequence comparison, we are able to unify two datasets that previously didn't share any stable cross-bank accession method. The whole process is hosted by the GenOuest platform to provide public access to newly generated cross-references and to allow for regular updates (
Document type :
Journal articles
Liste complète des métadonnées
Contributor : Archive Ouverte Prodinra <>
Submitted on : Thursday, October 1, 2015 - 11:49:56 AM
Last modification on : Friday, April 12, 2019 - 4:23:21 PM



Thomas Obadia, Olivier Sallou, Marion Ouedraogo, Grégory Guernec, Frédéric Lecerf. The GAG database: A new resource to gather genomic annotation cross-references. Gene, Elsevier, 2013, 527 (2), pp.503-509. ⟨10.1016/j.gene.2013.06.063⟩. ⟨hal-01207750⟩



Record views