查看: 1276|回复: 5
|
壓縮人類知識,贏取50,000歐元獎金
[复制链接]
|
|
這100mb的文件其實是Wikipedia的文章。
任務:
將enwik8這個100mb的文件壓縮到小于18mb:
-編出一個Linux或Windows的程式archive8.exe,文件必須小過18,324,887 byte(目前的最低紀錄)
-程式執行後,會産生出一個100,000,000 byte的文件,內容和enwik8一模一樣。
網站:http://prize.hutter1.net
enwik8.zip
[ 本帖最后由 fxam 于 20-8-2006 11:45 PM 编辑 ] |
|
|
|
|
|
|
|
发表于 18-8-2006 10:42 AM
|
显示全部楼层
|
|
|
|
|
|
|
发表于 19-8-2006 10:33 AM
|
显示全部楼层
我不会 WinZip 或 PKzip,不过我会文件压缩的概念,DeSpace() 和 Compression() |
|
|
|
|
|
|
|
发表于 19-8-2006 08:08 PM
|
显示全部楼层
用winzip zip吧!!
不过应该有很多人都可以领奖金 |
|
|
|
|
|
|
|
发表于 20-8-2006 10:48 PM
|
显示全部楼层
回复 #4 quantum^_^ 的帖子
呵呵,如果这么容易。。。奖金也不会这么高了。。。
虽然,我还没试过用winzip来压缩那些档案。。。但是我相信winzip是不可能办到。。。
而且,那个program必须是free ware (free trial version也可以)。。。
以下是其中一些规则:
Rules for Benchmark Listing
Last update: June 17, 2006.
All results must be subject to public verification. Eligible compression programs must be available on the Internet for free download and testing. Commercial programs with a free trial period of 7 days or more are allowed. Programs that require personal information such as name or email address before they can be downloaded or used are not considered free. Extentions to existing programs such as GUI wrappers that do not change the compressed format are not eligible. Programs or versions withdrawn by the author are not eligible. Programs violating licenses of other programs are not eligible. Patented algorithms are allowed. At my discretion I may list ineligible results anyway with appropriate caveats.
Compression programs will be ranked by the compressed size of enwik9 plus the size of a zip archive (readable by unzip) containing the decompressor and any other files needed by the decompressor at run time (dictionaries, configuration files, .dll files not normally part of Windows, etc). The archive may contain either an executable program or source code in any general purpose programming language, whichever is smaller.
Only the version and combination of options achieving the best known compression for each program will appear in the ranked results. Other results may appear in the individual program descriptions. Two differently-named programs are considered different versions of the same program if they are by the same author and use the same underlying algorithm (LZ77, BWT, PPM, CM, etc).
The decompressor must be able to run without a network connection. The decompressor must run without selecting options that affect the contents of the uncompressed file, whether these options are passed on the command line, selected using a GUI, or from environment variables, configuration files, the Windows registry, or any other source that must be configured by the user or is set during compression. Changing the name or attributes of the compressed file (other than its contents) must not affect the contents after decompression. Most programs meet these requirements. If not, the length of a string containing any required settings will be added to the compressed size (e.g. epm).
Compressors and decompressors do not have to be general purpose. They may be tuned specifically to this benchmark and are allowed to reject or fail on any input other than enwik9. However, the test hardware, operating system, compiler, and programming language implementing the decompressor must be general purpose, available to the public, and not specifically designed to improve the ranking on this benchmark. (A Win32 or Linux executable or C/C++ program meets this requirement).
[ 本帖最后由 meemee 于 20-8-2006 10:49 PM 编辑 ] |
|
|
|
|
|
|
|
发表于 21-8-2006 09:24 AM
|
显示全部楼层
原帖由 quantum^_^ 于 19-8-2006 08:08 PM 发表
用winzip zip吧!!
不过应该有很多人都可以领奖金
用 WinZip 者输定,U R Disqualify。
我在顶楼的帖子网站里下载的 WinZip 文件 enwik8.zip 已 35,012,219 bytes,uncompress 了 enwik8.txt 是 100,000,000 bytes。
目前挑战者 Matt Mahoney 记录是 18,324,887 bytes,比 WinZip 压缩比例更高。他是用 NASM 写,原始码 C++ 有公布,并命名为 paq8a.exe。
我在 16 August 2006 看到的挑战者 18,000,000 bytes 已被除名了。 |
|
|
|
|
|
|
| |
本周最热论坛帖子
|