SparkCore-1-概览
1.RDD(ResilientDistributedDataset)• 五大特性:– A list of partitions– A function for computing each partition– A list of dependencies on other RDDs– Optionally, a Partitioner for key-value RDDs • shuffle的时候– Optionally, a list of preferred locations to co
复制链接