- 博客(4)
- 收藏
- 关注
原创 spark graphx 教程 04 (join 算子)
spark graphx 04 (join 算子)为了演示graph的join算子,首先我们定义一个graphval users: RDD[(VertexId, (String, String))] = sc.parallelize(Array( (1L, ("a", "student")), (2L, ("b", "salesman")), (3L, ("c", "pro...
2019-05-27 19:59:56 641 1
原创 spark graphx 教程 03 (结构化算子)
spark graphx 03 (结构化算子)spark有如下4种结构化算子:reversesubgraphmaskgroupEdges为了演示以上4种结构化算子,我们首先初始化一个graph,代码如下val users: RDD[(VertexId, (String, String))] = sc.parallelize(Array((1L, ("a", "student"...
2019-05-27 19:48:33 464
原创 spark graphx 教程 02(基本代码)
现在我们从代码着手来快速体验一下spark graphx,首先graph类在spark中的路径是org.apache.spark.graphx,我们进去看看源代码graph基本属性我们在spark的graph类,看到了很多方法,但是属性只看到了如下3个基本属性:/** * An RDD containing the vertices and their associated att...
2019-05-14 20:15:58 681
原创 spark graphx 教程01(graphx基本概念)
spark graphx是什么GraphX is a new component in Spark for graphs and graph-parallel computation. At a high level, GraphX extends the Spark RDD by introducing a new Graph abstraction: a directed multigra...
2019-05-13 20:04:23 586
空空如也
空空如也
TA创建的收藏夹 TA关注的收藏夹
TA关注的人