Weighted Burrows–Wheeler Compression
2023, SN computer science
https://doi.org/10.1007/S42979-022-01629-5Abstract
A weight based dynamic compression method has recently been proposed, which is especially suitable for the encoding of files with locally skewed distributions. Its main idea is to assign larger weights to closer to be encoded symbols by means of an increasing weight function, rather than considering each position in the text evenly. A well known transformation that tends to convert input files into files with a more skewed distribution is the Burrows-Wheeler Transform. This paper employs the weighted approach on Burrows-Wheeler transformed files and provides empirical evidence of the efficiency of this combination.
References (10)
- Michael Burrows and David J. Wheeler. A block-sorting lossless data compression algorithm. Technical Report 124, Digital Equipment Corporation, 1994.
- Peter Elias. Universal codeword sets and representations of the integers. IEEE Trans. Information Theory, 21(2):194-203, 1975.
- Aharon Fruchtman, Yoav Gross, Shmuel T. Klein, and Dana Shapira. Weighted adaptive coding. CoRR, abs/2005.08232, 2020.
- Aharon Fruchtman, Yoav Gross, Shmuel T. Klein, and Dana Shapira. Backward weighted coding. In To appear in Data Compression Conference, DCC 2021. IEEE, 2021.
- Aharon Fruchtman, Shmuel T. Klein, and Dana Shapira. Bidirectional adaptive compression. In Proceedings of the Prague Stringology Conference 2019, pages 92-101, 2019.
- Shmuel T. Klein, Shoham Saadia, and Dana Shapira. Forward looking Huffman coding. Theory of Computing Systems, pages 1-20, 2020.
- Alistair Moffat. Huffman coding. ACM Comput. Surv., 52(4):85:1-85:35, 2019.
- Mark Nelson and Jean-Loup Gailly. The Data Compression Book. M & T Books, 1996.
- Jeffrey S. Vitter. Design and analysis of dynamic Huffman codes. JACM, 34(4):825-845, 1987.
- Jacob Ziv and Abraham Lempel. A universal algorithm for sequential data com- pression. IEEE Trans. Inf. Theory, 23(3):337-343, 1977.