大数据相关论文
谷歌三剑客
Bigtable: A Distributed Storage System for Structured Data
MapReduce: Simplified Data Processing on Large Clusters
Hadoop
The Hadoop Distributed File System
Spark
Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing
pig
Building a High-Level Dataflow System on top of Map-Reduce: The Pig Experience
未完待续