Skip to Main content Skip to Navigation
Journal articles

DGINN, an automated and highly-flexible pipeline for the detection of genetic innovations on protein-coding genes

Abstract : Adaptive evolution has shaped major biological processes. Finding the protein-coding genes and the sites that have been subjected to adaptation during evolutionary time is a major endeavor. However, very few methods fully automate the identification of positively selected genes, and widespread sources of genetic innovations as gene duplication and recombination are absent from most pipelines. Here, we developed DGINN, a highly-flexible and public pipeline to Detect Genetic INNovations and adaptive evolution in protein-coding genes. DGINN automates, from a gene’s sequence, all steps of the evolutionary analyses necessary to detect the aforementioned innovations, including the search for homologues in databases, assignation of orthology groups, identification of duplication and recombination events, as well as detection of positive selection using five different methods to increase precision and ranking of genes when a large panel is analyzed. DGINN was validated on nineteen genes with previously-characterized evolutionary histories in primates, including some engaged in host-pathogen arms-races. The results obtained with DGINN confirm and also expand results from the literature, establishing DGINN as an efficient tool to automatically detect genetic innovations and adaptive evolution in diverse datasets, from the user’s gene of interest to a large gene list in any species range.
Document type :
Journal articles
Complete list of metadata

https://hal.archives-ouvertes.fr/hal-03002803
Contributor : Lucie Etienne <>
Submitted on : Thursday, November 12, 2020 - 11:12:51 PM
Last modification on : Tuesday, September 21, 2021 - 11:15:58 AM

Links full text

Identifiers

Citation

Lea Picard, Quentin Ganivet, Omran Allatif, Andrea Cimarelli, Laurent Guéguen, et al.. DGINN, an automated and highly-flexible pipeline for the detection of genetic innovations on protein-coding genes. Nucleic Acids Research, Oxford University Press, 2020, 48 (18), pp.e103-e103. ⟨10.1093/nar/gkaa680⟩. ⟨hal-03002803v2⟩

Share

Metrics

Record views

83