Conditional Approximate Bayesian Computation: A New Approach for Across-Site Dependency in High-Dimensional Mutation–Selection Models

Abstract : A key question in molecular evolutionary biology concerns the relative roles of mutation and selection in shaping genomic data. Moreover, features of mutation and selection are heterogeneous along the genome and over time. Mechanistic codon substitution models based on the mutation-selection framework are promising approaches to separating these effects. In practice, however, several complications arise, since accounting for such heterogeneities often implies handling models of high dimensionality (e.g., amino acid preferences), or leads to across-site dependence (e.g., CpG hypermutability), making the likelihood function intractable. Approximate Bayesian Computation (ABC) could address this latter issue. Here, we propose a new approach, named Conditional ABC (CABC), which combines the sampling efficiency of MCMC and the flexibility of ABC. To illustrate the potential of the CABC approach, we apply it to the study of mammalian CpG hypermutability based on a new mutation-level parameter implying dependence across adjacent sites, combined with site-specific purifying selection on amino-acids captured by a Dirichlet process. Our proof-of-concept of the CABC methodology opens new modeling perspectives. Our application of the method reveals a high level of heterogeneity of CpG hypermutability across loci and mild heterogeneity across taxonomic groups; and finally, we show that CpG hypermutability is an important evolutionary factor in rendering relative synonymous codon usage. All source code is available as a GitHub repository (https://github.com/Simonll/LikelihoodFreePhylogenetics.git).
Complete list of metadatas

Cited literature [119 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-02414599
Contributor : Hervé Philippe <>
Submitted on : Monday, December 16, 2019 - 4:43:05 PM
Last modification on : Thursday, December 19, 2019 - 2:22:03 AM

File

2018-MolBiolEvol-LaurinLemay-3...
Publication funded by an institution

Identifiers

Collections

Citation

Simon Laurin-Lemay, Nicolas Rodrigue, Nicolas Lartillot, Hervé Philippe. Conditional Approximate Bayesian Computation: A New Approach for Across-Site Dependency in High-Dimensional Mutation–Selection Models. Molecular Biology and Evolution, Oxford University Press (OUP), 2018, 35 (11), pp.2819-2834. ⟨10.1093/molbev/msy173⟩. ⟨hal-02414599⟩

Share

Metrics

Record views

19

Files downloads

12