Skip to Main content Skip to Navigation
New interface
Journal articles

Swarm v3: towards tera-scale amplicon clustering

Abstract : Motivation: Previously we presented swarm, an open-source amplicon clustering program that produces fine-scale molecular operational taxonomic units (OTUs) that are free of arbitrary global clustering thresholds. Here we present swarm v3 to address issues of contemporary datasets that are growing towards tera-byte sizes.Results: When compared to previous swarm versions, swarm v3 has modernized C ++ source code, reduced memory footprint by up to 50%, optimized CPU-usage and multithreading (more than 7 times faster with default parameters), and it has been extensively tested for its robustness and logic.Availability: Source code and binaries are available at information: Supplementary data are available at Bioinformatics online.
Document type :
Journal articles
Complete list of metadata
Contributor : Gestionnaire HAL-SU Connect in order to contact the contributor
Submitted on : Monday, July 12, 2021 - 1:07:03 PM
Last modification on : Wednesday, October 19, 2022 - 5:05:55 AM
Long-term archiving on: : Wednesday, October 13, 2021 - 6:53:12 PM


Publication funded by an institution


Distributed under a Creative Commons Attribution 4.0 International License



Frédéric Mahé, Lucas Czech, Alexandros Stamatakis, Christopher Quince, Colomban de Vargas, et al.. Swarm v3: towards tera-scale amplicon clustering. Bioinformatics, 2022, 38 (1), pp.267-269. ⟨10.1093/bioinformatics/btab493⟩. ⟨hal-03284105⟩



Record views


Files downloads