Enhanced Byte Codes with Restricted Prefix Properties
J. Shane Culpepper
NICTA Victoria Laboratory,
Department of Computer Science and Software Engineering,
The University of Melbourne,
Victoria 3010, Australia.
Alistair Moffat
Department of Computer Science and Software Engineering,
The University of Melbourne,
Victoria 3010, Australia.
Status
Proc. 13th Int. Symp. String Processing and Information Retrieval,
Glasgow, Scotland, October 2006, pages 337-345.
LNCS volume 4209.
Abstract
Byte codes are a practical alternative to the traditional
bit-oriented compression approaches when large alphabets are being
used, and trade away a small amount of compression effectiveness for
a relatively large gain in decoding efficiency.
Byte codes also have the advantage of being searchable using standard
string matching techniques.
Here we describe methods for searching in byte-coded compressed text
and investigate the impact of large alphabets on traditional string
matching techniques.
We also describe techniques for phrase-based searching in a
restricted type of byte code, and present experimental results that
compare our adapted methods with previous approaches.
Full text
http://dx.doi.org/10.1007/11880561_28