Large Text Compression Benchmark
ranks lossless data compression programs by the compressed size (including the size of the decompression program) of the first 109 bytes of the XML text dump of the English version of Wikipedia on Mar. 3, 2006. Benchmark
- Text classification by data compression / HN
- facebook/zstd - a fast lossless compression algorithm, targeting real-time compression scenarios at zlib-level and better compression ratios.
see also
- Hutter Prize - 500’000€ Prize for Compressing Human Knowledge
Written on September 19, 2020, Last update on September 1, 2023
zip
text
benchmarking