Text Compression Benchmark
ranks lossless data compression programs by the compressed size (including the size of the decompression program) of the first 109 bytes of the XML text dump of the English version of Wikipedia on Mar. 3, 2006. Benchmark
- Text classification by data compression / HN
- facebook/zstd - a fast lossless compression algorithm, targeting real-time compression scenarios at zlib-level and better compression ratios.
see also
- Hutter Prize - 500’000€ Prize for Compressing Human Knowledge
- SMAZ - compression for very small strings
Written on September 19, 2020, Last update on March 17, 2025
zip
text
benchmarking