The Art of Lossless Data Compression vol. 22b

Here are the results of tests performed in May 2001 to compare lossless compression of "binary" files by all known good enough programs developed for such purpose, including RK, DC, YBS, Bzip2, IMP, RAR and 7-zip. See Archive Comparison Test by J.Gilchrist for more details: http://act.by.net If anybody wants to start or continue such tests, or can suggest some other sets of files, or other compression programs, (not sources or algorithm descriptions, executable programs only) or knows we have missed something important, (some new fantastic technology, an algorithm or even a program capable of lossless compression of up to 1000:1 etc.) please let us know immediately: artest@inbox.ru Thank you!

[[1]] COMPRESSION QUALITY

(see also [[2]] Speed [[3]] Details [[4]] Comments) Last eleventh line shows results for the sum of all 5945 files in ten sets. Original UHArc RK 777 Ufa1 ACE32 PPMonstr SBC BIX IMP 245.01% 100% 101.89 104.15 104.80 107.19 104.11 109.18 109.76 112.87 259.38% 101.42 102.53 100% 103.91 108.24 109.26 111.63 108.90 110.13 184.78% 100% 101.85 101.46 102.25 104.83 104.93 106.78 105.64 106.08 337.65% 100% 114.37 118.13 117.45 123.98 119.99 113.69 125.62 120.00 174.10% 100% 108.20 111.56 111.66 103.82 113.01 104.87 119.42 116.08 387.10% 101.73 100% 107.47 111.20 118.22 113.05 117.94 119.51 124.67 191.07% 100% 100.81 101.96 102.57 106.02 104.93 105.47 105.21 107.02 271.77% 100.60 100% 103.60 103.18 107.75 110.60 111.39 113.93 119.89 229.21% 100% 100.78 101.18 101.94 105.29 106.00 107.22 105.55 106.46 352.87% 101.12 100% 105.72 104.92 113.56 109.65 116.86 115.59 119.97 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 239.49% 100% 102.17 103.71 104.65 107.64 107.67 108.88 110.21 111.48 DC BOA ZZip RAR YBS ERI32 ArHanGel SZip BZip2 7-zip 108.55 109.97 112.08 111.73 111.09 111.54 111.63 112.17 115.21 117.17 117.70 113.89 114.51 113.37 114.52 120.44 123.86 118.71 121.72 127.49 109.91 107.35 108.41 107.93 108.86 111.01 108.54 110.24 112.11 110.17 119.40 127.77 127.19 121.43 124.43 125.12 125.40 130.87 134.96 148.61 107.00 115.55 114.14 105.93 115.67 101.99 109.26 116.00 116.69 122.67 120.00 120.13 123.30 129.39 124.59 124.68 125.00 123.14 129.62 128.19 108.30 108.29 106.46 109.70 106.47 111.18 109.99 110.50 112.94 113.78 112.37 114.85 114.95 118.44 118.22 113.36 119.84 117.93 121.24 121.25 110.43 109.07 111.72 109.64 108.51 114.84 112.32 112.11 114.39 114.26 121.00 117.72 118.64 128.74 124.25 126.68 128.80 124.20 128.98 129.09 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 111.79 111.90 112.67 112.68 112.95 114.01 114.66 114.91 117.68 119.29

[[2]] Speed

7th set, all from ftp://ftp.simtel.net/pub/simtelnet/win95/proj/tm2k.zip (17Mb) was used for this test, and an AMD-K6-400 machine with 192Mb RAM and Windows98. Programs,options Overall Average Compress Extract Compressed score, Users' time, time, size, score, seconds seconds bytes seconds % seconds % NO COMPRESSION 8010 176% 8010 184% 0 0 28838607 777 a -mg 10240 226% 7137 164% 3448 2518 15388866 7z a -tufa1 4526 100% 4341 100% 205 21 15479766 7zip a 4872 107% 4798 110% 83 10 17208734 7zip a -mx 4929 108% 4794 110% 150 10 17171854 ace32 a 4559 101% 4426 101% 147 12 15838949 ace32 a -d4096 4539 100% 4389 101% 166 13 15698733 ace32 a -d4096 -s- 4638 102% 4475 103% 181 13 16000773 ace32 a -d4096 -m1 4543 100% 4407 101% 151 13 15765625 ace32 a -d4096 -m5 4544 100% 4389 101% 173 12 15694577 arh a -1 -mm 4869 107% 4751 109% 131 20 16988963 ba -k 4974 109% 4821 111% 171 103 16925715 ba -k -1 5022 110% 4884 112% 154 96 17182514 ba -k -10 4995 110% 4840 111% 172 103 16993554 ba -k -50 5111 112% 4942 113% 188 106 17342417 bix a 4773 105% 4585 105% 209 8 16402627 bix a -mdg 4780 105% 4578 105% 225 8 16371106 bix a -mdg -m9 4642 102% 4441 102% 224 9 15878044 bix a -mdg -s 4785 105% 4511 103% 305 10 16097343 boa -m1 7047 155% 5783 133% 1405 982 16776429 boa -m15 9361 206% 6685 153% 2974 1859 16303786 boa -m15 -s 9406 207% 6678 153% 3032 1848 16296644 bzip2 -k 4937 109% 4812 110% 138 34 17152840 bzip2 -k -1 4989 110% 4882 112% 119 30 17425456 bzip2 -k -5 4932 108% 4823 111% 121 34 17200539 dc e 4982 110% 4875 112% 119 60 17290210 dc e -fx -fb 4936 109% 4810 110% 141 59 17054070 dc e -b16300 4794 105% 4687 107% 120 58 16622110 dc e -b16300 -mb5 4794 105% 4687 107% 119 58 16622110 eri a -m2 5863 129% 4879 112% 1094 83 16871942 eri a -m3 5892 130% 4865 112% 1141 89 16784470 eri a 5947 131% 4871 112% 1196 91 16780779 imp a -2 -s4 4760 105% 4618 106% 158 18 16503511 imp98 a 4584 101% 4516 104% 75 6 16213107 imp98 a -mm 4561 100% 4489 103% 80 7 16107243 imp98 a -mm -m3 4575 101% 4465 102% 122 7 16006105 imp98 a -mm -2 4696 103% 4569 105% 141 14 16348416 imp98 a -mm -s4 4569 100% 4496 103% 81 6 16137447 imp98 a -2 -s4 4757 105% 4616 106% 157 17 16503121 pkzip -es 5193 114% 5175 119% 20 7 18602185 pkzip -a 4887 107% 4850 111% 41 6 17427441 pkzip -exx 4898 108% 4847 111% 57 6 17408609 ppmd e -o5 5026 111% 4829 111% 218 247 16420589 ppmd e -o7 5014 110% 4815 110% 221 248 16365468 ppmd e -o9 5025 111% 4824 111% 223 250 16389915 ppmd e -o5 -m176 5201 114% 4923 113% 309 323 16451745 ppmd e -o6 -m176 5150 113% 4879 112% 301 328 16279459 ppmd e -o7 -m176 5110 112% 4833 111% 308 322 16131941 ppmd e -o8 -m176 5125 113% 4845 111% 311 332 16137203 ppmd e -o9 -m176 5119 113% 4840 111% 310 323 16150714 ppmonstr e -o5 5455 120% 5024 115% 478 479 16192041 ppmonstr e -o7 5448 120% 5011 115% 485 486 16116589 ppmonstr e -o9 5464 120% 5021 115% 491 492 16130605 ppmonstr e -o5 -m176 5656 124% 5119 117% 596 586 16106344 ppmonstr e -o7 -m176 5649 124% 5088 117% 623 610 15897279 ppmonstr e -o9 -m176 5612 123% 5063 116% 610 599 15853490 rar a 4810 106% 4651 107% 177 10 16645832 rar a -mm 4789 105% 4629 106% 178 10 16564347 rar a -mm -m1 4808 106% 4668 107% 156 10 16711398 rar a -mm -m5 4806 106% 4629 106% 198 10 16556916 rar a -mm -mdc 4783 105% 4658 107% 138 11 16684304 rar a -mm -mda 4816 106% 4709 108% 118 10 16876101 rar a -mm -s 4771 105% 4578 105% 215 10 16370991 rar32 a -mm -s 4770 105% 4579 105% 213 11 16370991 rk -mf1 4888 107% 4762 109% 140 117 16670096 rk -mf2 5066 111% 4793 110% 303 285 16122616 rk -mf3 5176 114% 4788 110% 430 293 16028412 rk -mx1 8077 178% 6530 150% 1719 2122 15251852 rk -mx2 8773 193% 6932 159% 2046 2527 15121448 rk -mx3 9671 213% 7449 171% 2469 3013 15083536 sbc c -m1 4766 105% 4626 106% 156 108 16211837 sbc c -m2 4884 107% 4676 107% 232 166 16153516 sbc c -m3 4946 109% 4699 108% 275 200 16097093 sbc c -m3 -b59 5058 111% 4733 109% 362 253 15998323 sbc c -a -b59 4792 105% 4587 105% 228 150 15892050 sbc c -on -b59 4834 106% 4634 106% 222 168 16000604 sbc c -oa -b59 4991 110% 4700 108% 323 218 16023257 sbc c -os -b59 5499 121% 5008 115% 545 344 16596716 szip -v0 4923 108% 4777 110% 162 71 16885005 szip -o4 4863 107% 4786 110% 86 66 16963281 szip -o8 4967 109% 4779 110% 209 76 16856647 szip -o8 -b41 5031 111% 4834 111% 220 78 17042908 uharc a -m1 -md8192 4819 106% 4444 102% 417 68 15605510 uharc a -m2 -md8192 4968 109% 4429 102% 598 68 15485799 uharc a -m3 -md8192 5212 115% 4424 101% 875 70 15362234 uharc a -mz -md8192 4805 106% 4693 108% 125 133 16372698 uharc a -mx -md8192 5577 123% 4875 112% 780 653 14919826 ybs -m1m 4851 107% 4711 108% 155 40 16764287 ybs -m2m 4895 108% 4737 109% 175 44 16835360 ybs -m4m 5041 111% 4803 110% 264 59 16984609 ybs_d -y 4924 108% 4758 109% 184 63 16835360 ybs_d -m2mu 4923 108% 4758 109% 184 63 16835360 ybs_d -m15mu 5072 112% 4809 110% 292 91 16878402 ybs_d -m15mu -r 5087 112% 4806 110% 312 88 16874162 zzip a 5699 125% 4820 111% 977 56 16800859 zzip a -mm 5654 124% 4817 110% 930 56 16807656 zzip a -mm -mx 5562 122% 4710 108% 947 56 16418643 zzip a -mm -30m 5663 125% 4821 111% 935 73 16758895 Overall score is calculated by adding compression time, extraction time, and time it would take to transfer the compressed file over a 28,800bps network: (compressed_size)/3600 , because 28800 bits_per_second is 3600 bytes_per_second Average Users' score is calculated by adding (compress_time/10)+ extract_time + time it would take to transfer the compressed file over a 28,800bps network. Compression time is divided by 10 here, because more than 90% of people would never compress anything during their life (with compression programs), but they use compressed data almost _every_ time they use computers and/or Internet. That's why compression time is not so actual for them.

[[3]] Details

are no longer put to this main text (6208 lines reporting 214416 results on 5945 files in 10 sets), but can be found in FULL version with BINARIES.DAT and *.BAT at http://geocities.com/SiliconValley/Bay/1995/artest22.zip or http://artest1.tripod.com/artest22.zip

[[4]] Comments

Links to download programs:

7-Zip 2.24 :W http://www.7-zip.com/dl/7zip224.exe 463K ACE32 2.02 :W ftp://ftp.forlangs.net/pub/windows/winace/ace202.exe 587K ERI32 4.16fre :e http://geocities.com/eri32/eri416fr.zip 94K PkzipC 4.00 :W ftp://ftp.pkware.com/pkzc400s.exe 3470K RK-dos 1.04.1 :e http://rksoft.virtualave.net/downloads/rk104a1d.exe 461K RK 1.04.1 :W http://rksoft.virtualave.net/downloads/rk104a1w.exe 380K RAR32 2.80 :e ftp://ftp.netlab.sk/public/rarsoft/rar/rarx280.exe 269K WinRAR 2.80 :W ftp://ftp.netlab.sk/public/rarsoft/rar/wrar280.exe 621K BA 1.01b5 :e http://hem.spray.se/mikael.lundqvist/ba101br5.zip 61K SBC 0.860b :e http://geocities.com/sbcarchiver/sbc0860b.zip 208K ZZip 0.36c :W http://www.via.ecp.fr/~damien/downloads/zzip-win32.zip 35K PPMD var.H, PPmonstr v.H :W ftp://ftp.cdrom.com/.2/sac/pack/ppmdh.rar 57K BIX 1.00b7 :W http://www.7-zip.com/dl/ufa/bix100b7.zip 89K 777 0.04b1 :W http://www.7-zip.com/dl/ufa/777004b1.zip 72K UFA 0.04b1 :W http://www.7-zip.com/dl/ufa/ufa004b1.zip 64K ArHanGeL 1.40 :a http://geocities.com/SiliconValley/Lab/6606/arh140.zip 50K Imp 1.1 :e http://www.winimp.com/imp110d.zip 266K Imp-win 1.12 :W http://www.winimp.com/imp112.exe 122K PkZip 2.50 :a ftp://ftp.simtel.net/pub/simtelnet/msdos/arcers/pk250dos.exe 202K ACB 2.00c :e ftp://ftp.simtel.net/pub/simtelnet/msdos/compress/acb_200c.zip 42K BOA 0.58b :e ftp://ftp.cdrom.com/.2/sac/pack/boa058.zip 74K DC 0.98b :W ftp://ftp.cdrom.com/.2/sac/pack/dc124.zip 55K Bzip2 1.0.1 :W ftp://sourceware.cygnus.com/pub/bzip2/v100/bzip2-100-x86-win32.exe 68K SZip 1.12a :W http://www.compressconsult.com/szip/szip_112a_win32.zip 71K UHArc 0.2b :e ftp://ftp.cdrom.com/.2/sac/pack/uharc02.zip 101K YBS 0.03e :e http://members.nbci.com/vycct/ybs003ed.zip 55K YBS 0.03e :W http://members.nbci.com/vycct/ybs003ew.zip 43K BEE 0.4.8 :W Andrew.Filinsky@p11.f4.n452.z2.fidonet.org :a - any DOS - DOS programs, will run under pure DOS or in a DOS box :e - extender - DOS programs using DOS extenders like DOS/4GW or CWSDPMI :W - windows - Windows95/98/NT/etc programs If direct link doesn't work-most probably newer version of the program appeared at the same site: visit web page, or read the whole directory from ftp server (i.e. try the same URL, but without filename).

Homepages:

Arhangel : http://geocities.com/SiliconValley/Lab/6606 BA : http://hem.spray.se/mikael.lundqvist Eri32 : http://geocities.com/eri32 mirror : http://artest1.tripod.com RK : http://rksoft.virtualave.net Imp,WinImp : http://www.technelysium.com.au mirror : http://www.winimp.com ACE,WinACE : http://www.winace.com PkZip : http://www.pkware.com RAR,WinRAR : http://www.rarsoft.com BZip2 : http://sources.redhat.com/bzip2 SZip : http://www.compressconsult.com/szip ZZip : http://www.zzip.f2s.com YBS : http://members.nbci.com/vycct SBC : http://geocities.com/sbcarchiver Ufa,777, BIX,7-Zip: http://www.7-zip.com PPMD, PPMonstr, ACB, Bee, BOA, DC, UHArc - no homepage.

What's new:

13 new programs tested: RK, SBC, ZZip, ACE, 7-zip, RAR32, WinRAR, ERI32, BA, PPMD, PPMonstr, UHARC, Ufa1. Test data was updated. Latest beta versions of BEE, DC, UFA, UHArc are available from authors by e-mail request: BEE: Andrew.Filinsky@p11.f4.n452.z2.fidonet.org DC: EdgarBinder@t-online.de UFA: support@7-zip.com UHARC: Uwe.Herklotz@gmx.de ACB and BEE are not tested on all binary files any more, both are too slow and not efficient on binary data, especially with uncompressed multimedia, their results can be found in previous versions of ARTest. Results of BA and Pkzip are in full version only, BINARIES.DAT file. FULL version contains all *.BAT and *.DAT files you'll need to build 10 directories with 5945 files (artest22.zip\MAKE_BIN\*.*) and to repeat all our tests (BINS.BAT, BINARIES.DAT).

WARNINGS:

ACB refuses to take files shorter than 257 bytes. About 400 such files were processed with "rar a -m0 name.acb name.bin" (see do_acb_u.bat and do_acb_r.bat in artest19.zip). BA 1.00beta5 can't losslessly decompress many files: CUN5IF~1.bin, CUN6IF~1 etc It says nothing like "CRC fails". BEE can't decompress some files compressed in "solid" mode. DC 0.99.158b fails to decompress HLPCOPY..bin, but only if you compress with "-mb5" switch and [] filename like "any.dc5". Says nothing like "CRC fails". Bugs in tested version of ZZip were found, but they are removed from latest version ZZip 0.36c . Problems in all other compressors were not found. The LATEST RELEASE, and all previous versions of these tests can be found at http://geocities.com/SiliconValley/Bay/1995/ and http://artest1.tripod.com/ Send your suggestions, comments to artest@hotmail.ru With best kind regards, A.Ratushnyak, RAO Inc. Back to main ARTest page


1