Improved detection and annotation of transposable elements in sequenced genomes using multiple reference sequence sets. - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Genomics Année : 2008

Improved detection and annotation of transposable elements in sequenced genomes using multiple reference sequence sets.

Hadi Quesneville
Vincent Colot

Résumé

Transposable elements (TEs) are ubiquitous components of eukaryotic genomes that impact many aspects of genome function. TE detection in genomic sequences is typically performed using similarity searches against a set of reference sequences built from previously identified TEs. Here, we demonstrate that this process can be improved by designing reference sets that incorporate key aspects of the structure and evolution of TEs and by combining these sets with Repbase Update (RU), which is composed mainly of consensus sequences. Using the Arabidopsis genome as a test case, our approach leads to the detection of an extra 12.4% of TE sequences. These correspond to novel TE fragments as well as to the extension of TE fragments already detected by RU. Significantly, we find that TE detection could be readily optimized using only two reference sets, one containing true consensus sequences and the other mosaic sequences that capture the structural diversity of TE copies within a family.

Dates et versions

hal-00280496 , version 1 (19-05-2008)

Identifiants

Citer

Nicolas Buisine, Hadi Quesneville, Vincent Colot. Improved detection and annotation of transposable elements in sequenced genomes using multiple reference sequence sets.. Genomics, 2008, 91 (5), pp.467-475. ⟨10.1016/j.ygeno.2008.01.005⟩. ⟨hal-00280496⟩
153 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More