Skip to Main content Skip to Navigation
Conference papers

A Sub-quadratic Sequence Alignment Algorithm for Unrestricted Cost Matrices

Abstract : The classical algorithm for computing the similarity between two sequences [36, 39] uses a dynamic programming matrix, and compares two strings of size n in O(n²) time. We address the challenge of computing the similarity of two strings in sub-quadratic time, for metrics which use a scoring matrix of unrestricted weights. Our algorithm applies to both local and global alignment computations.The speed-up is achieved by dividing the dynamic programming matrix into variable sized blocks, as induced by Lempel-Ziv parsing of both strings, and utilizing the inherent periodic nature of both strings. This leads to an O(n²/log n) algorithm for an input of constant alphabet size. For most texts, the time complexity is actually O(hn²/log n) where h ≤ 1 is the entropy of the text.
Document type :
Conference papers
Complete list of metadata
Contributor : Maxime Crochemore Connect in order to contact the contributor
Submitted on : Wednesday, March 20, 2013 - 6:43:25 PM
Last modification on : Thursday, September 29, 2022 - 2:21:15 PM
Long-term archiving on: : Friday, June 21, 2013 - 3:05:09 AM


Files produced by the author(s)


  • HAL Id : hal-00619996, version 1


Maxime Crochemore, Gad M. Landau, Michal Ziv-Ukelson. A Sub-quadratic Sequence Alignment Algorithm for Unrestricted Cost Matrices. Proceedings of the Thirteen Annual ACM-SIAM Symposium on Discrete Algorithms, 2002, United States. pp.679-688. ⟨hal-00619996⟩



Record views


Files downloads