实例英文文档
My father was a self-taught mandolin player. He was one of the best string instrument players in our town. He could not read music, but if he heard a tune a few times, he could play it. When he was younger, he was a member of a small country music b
A A A A A A A A A A A A A A A A A
B B B B B B B B BB B B BB B B B B
C C C C C C C C C C C
D D D D D D D D D D D
统计程序:统计文档中每个单词出现的次数
/**
* Created by hbin on 2016/12/9.
*/
import java.util.Arrays;
import org.apache.spark.SparkConf;
import org.apache.spark.api.java.JavaPairRDD;
import org.apache.spark.api.java.JavaRDD;
import org.apache.spark.api.java.JavaSparkContext;
import org.apache.spark.api.java.function.*;
import scala.Boolean;
import scala.Tuple2;
/**
* spark对数据的核心抽象 RDD(弹性分布式数据集)
* RDD就是分布式的元素集合,在spark中对数据的所有操作不外乎创建RDD
* 转化已有RDD以及调用RDD操作进行求值,spark会自动将RDD中的数据分发到集群上,