Computing Lempel-Ziv Factorization Online
Abstract
We present an algorithm which computes the Lempel-Ziv factorization of a word $W$ of length $n$ on an alphabet $\Sigma$ of size $\sigma$ online in the following sense: it reads $W$ starting from the left, and, after reading each $r = O(\log_{\sigma} n)$ characters of $W$, updates the Lempel-Ziv factorization. The algorithm requires $O(n \log \sigma)$ bits of space and O(n \log^2 n) time. The basis of the algorithm is a sparse suffix tree combined with wavelet trees.
- Publication:
-
arXiv e-prints
- Pub Date:
- February 2012
- DOI:
- 10.48550/arXiv.1202.5233
- arXiv:
- arXiv:1202.5233
- Bibcode:
- 2012arXiv1202.5233S
- Keywords:
-
- Computer Science - Data Structures and Algorithms
- E-Print:
- doi:10.1007/978-3-642-32589-2