先读取fileName,然后再将加载的结果收集一下collect,转换成List,再打印。
Configuration configuration = new Configuration(); configuration.set("io.serializations", "org.apache.hadoop.io.serializer.WritableSerialization,org.apache.hadoop.hbase.mapreduce.ResultSerialization"); JavaPairRDD input = sc.newAPIHadoopFile(fileName, SequenceFileInputFormat.class, ImmutableBytesWritable.