3 and 1/2 minutes to sort a Terabyte, and a look at Hadoop's code structure
Apache Hadoop Wins Terabyte Sort Benchmark: "One of Yahoo's Hadoop clusters sorted 1 terabyte of data in 209 seconds, which beat the previous record of 297 seconds in the annual general purpose (daytona) terabyte sort benchmark. The sort benchmark, which was created in 1998 by Jim Gray, specifies the input data (10 billion 100 byte records), which must be completely sorted and written to disk. This is the first tim阅读全文>
发表于 @ 2008年07月14日 21:52:00|评论(loading...)|收藏