http://lookqlp.iteye.com/blog/1742345
http://tech.qq.com/a/20121027/000056_1.htm
http://blog.linezing.com/2012/03/hbase-performance-optimization
http://www.csdn.net/article/2013-03-25/2814634-data-de-duplication-tactics-with-hdfs
http://www.alidata.org/archives/1509
http://highscalability.com/blog/2011/3/22/facebooks-new-realtime-analytics-system-hbase-to-process-20.html
http://stackoverflow.com/questions/13974736/hbase-select-distinct-query-against-the-rowkey