Skip to Main content Skip to Navigation
Journal articles

Complete Assembly of Circular and Chloroplast Genomes Based on Global Optimization

Rumen Andonov 1, * Hristo Djidjev 2 Sébastien François 3 Dominique Lavenier 1
* Corresponding author
1 GenScale - Scalable, Optimized and Parallel Algorithms for Genomics
Inria Rennes – Bretagne Atlantique , IRISA-D7 - GESTION DES DONNÉES ET DE LA CONNAISSANCE
Abstract : This paper focuses on the last two stages of genome assembly, namely, scaffolding and gap-filling, and shows that they can be solved as part of a single optimization problem. Our approach is based on modeling genome assembly as a problem of finding a simple path in a specific graph that satisfies as many distance constraints as possible encoding the insert-size information. We formulate it as a mixed-integer linear programming (MILP) problem and apply an optimization solver to find the exact solutions on a benchmark of chloroplasts. We show that the presence of repetitions in the set of unitigs is the main reason for the existence of multiple equivalent solutions that are associated to alternative subpaths. We also describe two sufficient conditions and we design efficient algorithms for identifying these subpaths. Comparisons of the results achieved by our tool with the ones obtained with recent assemblers are presented.
Complete list of metadata

Cited literature [27 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-02151798
Contributor : Rumen Andonov <>
Submitted on : Monday, June 10, 2019 - 11:16:50 AM
Last modification on : Thursday, February 18, 2021 - 10:53:34 AM

File

HAL_JBCB-main.pdf
Files produced by the author(s)

Identifiers

Citation

Rumen Andonov, Hristo Djidjev, Sébastien François, Dominique Lavenier. Complete Assembly of Circular and Chloroplast Genomes Based on Global Optimization. Journal of Bioinformatics and Computational Biology, World Scientific Publishing, In press, pp.1-28. ⟨10.1142/S0219720019500148⟩. ⟨hal-02151798⟩

Share

Metrics

Record views

98

Files downloads

564