Hadoopizer : a cloud environment for bio-informatics data analysis

Anthony Bretaudeau 1 Olivier Sallou 1 Olivier Collin 1
1 SYMBIOSE - Biological systems and models, bioinformatics and sequences
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : Biology is evolving into a big data science, particularly with the new sequencing technologies which have emerged during the last years. Cloud computing appears as one of the answers to face the rapidly increasing volume of bioinformatics data. Here we present a private cloud environment deployed on the GenOuest bioinformatics platform. After an overview of the software publicly available for bioinformatics treatments in the cloud, we present a new framework (Hadoopizer) which is a generic tool for the parallelisation of bioinformatics analysis in the cloud using the MapReduce paradigm. These developments are available online at this address: http://genocloud.genouest.org
Complete list of metadatas

Contributor : Ccsd Sciencesconf.Org <>
Submitted on : Monday, December 17, 2012 - 3:08:58 PM
Last modification on : Friday, November 16, 2018 - 1:22:25 AM
Long-term archiving on : Sunday, December 18, 2016 - 3:50:31 AM


  • HAL Id : hal-00766066, version 1


Anthony Bretaudeau, Olivier Sallou, Olivier Collin. Hadoopizer : a cloud environment for bio-informatics data analysis. journées scientifiques mésocentres et France Grilles 2012, Oct 2012, Paris, France. ⟨hal-00766066⟩



Record views


Files downloads