MethyLasso: a segmentation approach to analyze DNA methylation patterns and identify differentially methylation regions from whole-genome datasets
Résumé
DNA methylation is an epigenetic mark involved in the regulation of gene expression and patterns of DNA methylation anticorrelates with chromatin accessibility and transcription factor binding. DNA methylation can be profiled at the single cytosine resolution in the whole genome and has been performed in many cell types and conditions. Computational approaches are then essential to study DNA methylation patterns in a single condition or capture dynamic changes of DNA methylation levels across conditions. Towards this goal, we developed MethyLasso, a new approach based on the segmentation of DNA methylation data, that enables the identification of low-methylated regions (LMRs), unmethylated regions (UMRs), DNA methylation valleys (DMVs) and partially methylated domains (PMDs) in a single condition as well as differentially methylated regions (DMRs) between two conditions. We performed a rigorous benchmarking comparing existing approaches by evaluating the number, size, level of DNA methylation, boundaries, CpG content and coverage of the regions using several real datasets as well as the sensitivity and precision of the approaches using simulated data and show that MethyLasso performs best overall. MethyLasso is freely available at https://github.com/abardet/methylasso .
Domaines
Sciences du Vivant [q-bio]
Origine : Fichiers produits par l'(les) auteur(s)
Licence : CC BY - Paternité
Licence : CC BY - Paternité