并行化,在driver中
SparkConf conf = new SparkConf().setAppName("AppName").setMaster("masterIP");
JavaSparkContext sc = new JavaSparkContext(conf);
//SparkContext sc = new SparkContext(conf);
sc.parallelize(本地集合);//本地集合并行化
//sc.parallelizePairs(本地集合);
JavaPairRDD与JavaRDD区别
JavaRDD面向实体,而JavaPairRDD针对键值对
sc.close();