Skip to Main content Skip to Navigation
Journal articles

High-quality genome (re)assembly using chromosomal contact data

Abstract : Closing gaps in draft genome assemblies can be costly and time-consuming, and published genomes are therefore often left 'unfinished.' Here we show that genome-wide chromosome conformation capture (3C) data can be used to overcome these limitations, and present a computational approach rooted in polymer physics that determines the most likely genome structure using chromosomal contact data. This algorithm—named GRAAL—generates high-quality assemblies of genomes in which repeated and duplicated regions are accurately represented and offers a direct probabilistic interpretation of the computed structures. We first validated GRAAL on the reference genome of Saccharomyces cerevisiae, as well as other yeast isolates, where GRAAL recovered both known and unknown complex chromosomal structural variations. We then applied GRAAL to the finishing of the assembly of Trichoderma reesei and obtained a number of contigs congruent with the know karyotype of this species. Finally, we showed that GRAAL can accurately reconstruct human chromosomes from either fragments generated in silico or contigs obtained from de novo assembly. In all these applications, GRAAL compared favourably to recently published programmes implementing related approaches.
Complete list of metadata

Cited literature [46 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01138788
Contributor : Françoise Bertrand Connect in order to contact the contributor
Submitted on : Wednesday, April 8, 2015 - 11:22:50 AM
Last modification on : Tuesday, January 4, 2022 - 2:02:03 PM
Long-term archiving on: : Tuesday, April 18, 2017 - 9:38:06 AM

File

0024535-03.pdf
Publisher files allowed on an open archive

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Citation

Hervé Marie-Nelly, Martial Marbouty, Axel Cournac, Jean-François Flot, Gianni Liti, et al.. High-quality genome (re)assembly using chromosomal contact data. Nature Communications, Nature Publishing Group, 2014, 5, pp.5695. ⟨10.1038/ncomms6695⟩. ⟨hal-01138788⟩

Share

Metrics

Les métriques sont temporairement indisponibles