原创 Hbase之filter
import java.io.IOException;import java.util.ArrayList;import java.util.List;import org.apache.hadoop.conf.Configuration;import org.apache.hadoop.hbase.HBaseConfiguration;import org.apache.
2017-10-23 19:09:24 294
原创 Spark之join
import org.apache.spark.HashPartitioner;import org.apache.spark.SparkConf;import org.apache.spark.api.java.JavaPairRDD;import org.apache.spark.api.java.JavaSparkContext;import org.apache.spark.sto
2017-10-23 19:05:32 380
原创 Spark之分区
import org.apache.spark.Partitioner;import org.apache.spark.api.java.JavaPairRDD;import org.apache.spark.api.java.JavaRDD;import org.apache.spark.sql.SparkSession;import scala.Tuple2;import java
2017-10-23 19:04:46 405
原创 Spark之combineByKey详解Java
import org.apache.spark.SparkConf;import org.apache.spark.api.java.JavaPairRDD;import org.apache.spark.api.java.JavaRDD;import org.apache.spark.api.java.JavaSparkContext;import org.apache.spark.ap
2017-10-23 18:59:14 1186
原创 KNN(NearestNeighbor)临近算法,自然语言讲解
主要是讲了一些算法的主要思想,一个大致的应用方向,就没细说了,可结合网上的一些资料看看
2017-10-22 14:08:03 610