Faster subsequence recognition in compressed strings

doi:10.48550/arXiv.0707.3407

Faster subsequence recognition in compressed strings

Tiskin, Alexander

Computation on compressed strings is one of the key approaches to processing massive data sets. We consider local subsequence recognition problems on strings compressed by straight-line programs (SLP), which is closely related to Lempel--Ziv compression. For an SLP-compressed text of length $\bar m$, and an uncompressed pattern of length $n$, C{é}gielski et al. gave an algorithm for local subsequence recognition running in time $O(\bar mn^2 \log n)$. We improve the running time to $O(\bar mn^{1.5})$. Our algorithm can also be used to compute the longest common subsequence between a compressed text and an uncompressed pattern in time $O(\bar mn^{1.5})$; the same problem with a compressed pattern is known to be NP-hard.

Publication:

arXiv e-prints

Pub Date:

July 2007

DOI:

10.48550/arXiv.0707.3407

arXiv:

arXiv:0707.3407

Bibcode:

2007arXiv0707.3407T

Keywords:

Computer Science - Data Structures and Algorithms;
Computer Science - Computational Complexity;
Computer Science - Discrete Mathematics

NASA/ADS

Faster subsequence recognition in compressed strings

Abstract