Improved Word-Aligned Binary Compression for Text Indexing


Vo Ngoc Anh
Department of Computer Science and Software Engineering, The University of Melbourne, Victoria 3010, Australia.

Alistair Moffat
Department of Computer Science and Software Engineering, The University of Melbourne, Victoria 3010, Australia.


Status

IEEE Trans. Knowledge and Data Engineering, June 2006, 18(6):857-861.

Abstract

We present an improved compression mechanism for handling the compressed inverted indexes used in text retrieval systems, extending the word-aligned binary coding CARRY method. Experiments using two typical document collections show that the new method obtains superior compression to previous static codes, without penalty in terms of execution speed.

Full text

http://dx.doi.org/10.1109/TKDE.2006.99.