Aevol-4b: Toward a new simulation platform to benchmark phylogenetic tools

Nicolas Comte 1, 2 Vincent Liard 1, 2 Carole Knibbe 1, 2 Guillaume Beslon 1, 2
2 BEAGLE - Artificial Evolution and Computational Biology
LBBE - Laboratoire de Biométrie et Biologie Evolutive - UMR 5558, Inria Grenoble - Rhône-Alpes, LIRIS - Laboratoire d'InfoRmatique en Image et Systèmes d'information
Abstract : Aevol (www.aevol.fr) is a computational platform that allows for the study and manipulation of populations of digital organisms evolving under different conditions. Using Aevol simulations, one can better understand evolutionary forces and mechanisms leading to specific genome and transcriptome structures, as well as indirect selection pressures involved in the evolution of cooperation and genetic information transfer. Recently, we used aevol as a benchmarking tool. Indeed, Molecular evolutionary methods and tools are difficult to validate, as we have almost no direct access to ancient molecules. Inference methods may be tested with simulated data but this requires that the inference methods and the simulation be design independently (Biller et al., Computation in Europe 2016; Biller et al., Jobim 2016). Using aevol we can simulate perfectly characterized phylogenies and obtain a final population that evolved accordingly. Then we can use this final population to try to recover the initial phylogeny using various tools and assess their efficiency in doing so. This approach has recently been applied to test various estimators of inversion distance, revealing their limits and suggesting important improvement directions (Biller et al., Genome Biology and Evolution 2016). However, current aevol structure – more specifically the use of a binary representation for the genomic sequence – strongly limits its usability as a benchmarking tool. That is why we recently started the development of a new version of the software in which the genome sequence will use a four-nucleotides code and the translation from genetic sequence to polypeptide sequences will use the extant genetic code to map the 4-bases alphabet to the 20-amino-acids one. Although the development of this new version is in its infancy a first prototype has been developed and we would like to discuss the main modelling choices with the Alphy community that will be the potential users of the generated benchmarks. In particular, in this prototype the genotype-to-phenotype map is be based a mathematical description of traits under selection and on A.D. Solis (Proteins, 2015) classification of amino-acids, two crucial modelling choices that deserve discussion before we start final software implementation.
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01569078
Contributor : Guillaume Beslon <>
Submitted on : Wednesday, July 26, 2017 - 11:45:45 AM
Last modification on : Tuesday, February 11, 2020 - 1:27:26 AM

File

abstract.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01569078, version 1

Citation

Nicolas Comte, Vincent Liard, Carole Knibbe, Guillaume Beslon. Aevol-4b: Toward a new simulation platform to benchmark phylogenetic tools. ALPHY (ALignments and PHYlogeny), Feb 2017, Villeurbanne, France. ⟨hal-01569078⟩

Share

Metrics

Record views

569

Files downloads

85