Fixed Block Compression Boosting in FM-Indexes
Abstract
A compressed full-text self-index occupies space close to that of the compressed text and simultaneously allows fast pattern matching and random access to the underlying text. Among the best compressed self-indexes, in theory and in practice, are several members of the FM-index family. In this paper, we describe new FM-index variants that combine nice theoretical properties, simple implementation and improved practical performance. Our main result is a new technique called fixed block compression boosting, which is a simpler and faster alternative to optimal compression boosting and implicit compression boosting used in previous FM-indexes.
- Publication:
-
arXiv e-prints
- Pub Date:
- April 2011
- DOI:
- 10.48550/arXiv.1104.3810
- arXiv:
- arXiv:1104.3810
- Bibcode:
- 2011arXiv1104.3810K
- Keywords:
-
- Computer Science - Data Structures and Algorithms;
- Computer Science - Information Retrieval