Towards a piRNA prediction using multiple kernel fusion and support vector machine

Abstract : MOTIVATION: Piwi-interacting RNA (piRNA) is the most recently discovered and the least investigated class of Argonaute/Piwi protein-interacting small non-coding RNAs. The piRNAs are mostly known to be involved in protecting the genome from invasive transposable elements. But recent discoveries suggest their involvement in the pathophysiology of diseases, such as cancer. Their identification is therefore an important task, and computational methods are needed. However, the lack of conserved piRNA sequences and structural elements makes this identification challenging and difficult. RESULTS: In the present study, we propose a new modular and extensible machine learning method based on multiple kernels and a support vector machine (SVM) classifier for piRNA identification. Very few piRNA features are known to date. The use of a multiple kernels approach allows editing, adding or removing piRNA features that can be heterogeneous in a modular manner according to their relevance in a given species. Our algorithm is based on a combination of the previously identified features [sequence features (k-mer motifs and a uridine at the first position) and piRNAs cluster feature] and a new telomere/centromere vicinity feature. These features are heterogeneous, and the kernels allow to unify their representation. The proposed algorithm, named piRPred, gives promising results on Drosophila and Human data and outscores previously published piRNA identification algorithms.
Document type :
Journal articles
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01653802
Contributor : Frédéric Davesne <>
Submitted on : Friday, December 1, 2017 - 8:35:50 PM
Last modification on : Monday, October 28, 2019 - 10:50:22 AM

Links full text

Identifiers

Collections

Citation

Jocelyn Brayet, Farida Zehraoui, Laurence Jeanson-Leh, David Israeli, Fariza Tahi. Towards a piRNA prediction using multiple kernel fusion and support vector machine. Bioinformatics, Oxford University Press (OUP), 2014, 30 (17), pp.i364--i370. ⟨10.1093/bioinformatics/btu441⟩. ⟨hal-01653802⟩

Share

Metrics

Record views

101