Key points are not available for this paper at this time.
High-performance data-intensive query processing tasks like OLAP, data mining or scientific data analysis can be severely I/O bound, even when high-end RAID storage systems are used. Compression can alleviate this bottleneck only if encoding and decoding speeds significantly exceed RAID I/O bandwidth. For this purpose, we propose three new versatile compression schemes (PDICT, PFOR, and PFOR-DELTA) that are specifically designed to extract maximum IPC from modern CPUs. We compare these algorithms with compression techniques used in (commercial) database and information retrieval systems. Our experiments on the MonetDB/X100 database system, using both DSM and PAX disk storage, show that these techniques strongly accelerate TPC-H performance to the point that the I/O bottleneck is eliminated.
Building similarity graph...
Analyzing shared references across papers
Loading...
Marcin Żukowski
Bialystok University of Technology
Sándor Héman
Centrum Wiskunde & Informatica
Niels Nes
Centrum Wiskunde & Informatica
Centrum Wiskunde & Informatica
Building similarity graph...
Analyzing shared references across papers
Loading...
Żukowski et al. (Sun,) studied this question.
synapsesocial.com/papers/6a1eecbf64962c9f010504bf — DOI: https://doi.org/10.1109/icde.2006.150