A de novo approach to disentangle partner identity and function in holobiont systems

Abstract : This work addresses the problem of grouping by genes long reads expressed in a whole transcriptome sequencing data set. Long read sequencing produces several thousands base pair long sequences, although showing high error rate in comparison to short reads. Longreads can cover full-length RNA transcripts and thus are of high interest to complete references. However, the literature is lacking tools to cluster such data de novo, in particular for Oxford Nanopore Technologies reads. As a consequence, we propose a novel algorithm based on community detection and its implementation. Since solution is meant to be reference-free (de novo), it is especially well-tailored for non model species. We demonstrate it performs well on a real mouse data set. When a reference is available, we show that it stands as an alternative to mapping. In addition, we show that quick assessment of gene's expression is a straightforward use case of our solution.
Type de document :
Article dans une revue
Microbiome, BioMed Central, 2018, pp.1-35. 〈10.1101/221424〉
Liste complète des métadonnées

Contributeur : Camille Marchet <>
Soumis le : jeudi 13 septembre 2018 - 10:09:09
Dernière modification le : mercredi 17 octobre 2018 - 16:13:56


Fichiers éditeurs autorisés sur une archive ouverte



Arnaud Meng, Camille Marchet, Erwan Corre, Pierre Peterlongo, Adriana Alberti, et al.. A de novo approach to disentangle partner identity and function in holobiont systems. Microbiome, BioMed Central, 2018, pp.1-35. 〈10.1101/221424〉. 〈hal-01643153v2〉



Consultations de la notice


Téléchargements de fichiers