Tally-2.0: upgraded validator of tandem repeat detection in protein sequences - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Bioinformatics Année : 2020

Tally-2.0: upgraded validator of tandem repeat detection in protein sequences

Résumé

Proteins containing tandem repeats (TRs) are abundant, frequently fold in elongated nonglobular structures and perform vital functions. A number of computational tools have been developed to detect TRs in protein sequences. A blurred boundary between imperfect TR motifs and non-repetitive sequences gave rise to necessity to validate the detected TRs. Tally-2.0 is a scoring tool based on a machine learning approach, which allows to validate the results of TR detection. It was upgraded by using improved training datasets and additional machine learning features. Tally-2.0 performs at a level of 93% sensitivity, 83% specificity and an Area Under the Receiver Operating Characteristic Curve of 95%.
Fichier principal
Vignette du fichier
Perovic-Bioinformatics-2020HAL.pdf (1.05 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03089282 , version 1 (30-12-2020)

Identifiants

Citer

Vladimir Perovic, Jeremy Leclercq, Neven Sumonja, Francois Richard, Nevena Veljkovic, et al.. Tally-2.0: upgraded validator of tandem repeat detection in protein sequences. Bioinformatics, 2020, 36 (10), pp.3260-3262. ⟨10.1093/bioinformatics/btaa121⟩. ⟨hal-03089282⟩
31 Consultations
71 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More