Spark GraphX

GraphX is a new component in Spark for graphs and graph-parallel computation. At a high level, GraphX extends the Spark RDD by introducing a new Graph abstraction: a directed multigraph with properties attached to each vertex and edge.

Algorithms

PageRank
Connected components
Label propagation
SVD++
Strongly connected components
Triangle count

Giraph, and GraphLab

Apache Giraph is an iterative graph processing framework, built on top of Apache Hadoop.
It is currently used at Facebook to analyze the social graph formed by users and their connections. Giraph originated as the open-source counterpart to Pregel, the graph processing architecture developed at Google and described in a 2010 paper.
GraphLab is high performance, distributed computation framework written in C++.
Started by Prof. Carlos Guestrin of Carnegie Mellon University in 2009. A distributed graph processing system that is written in C++ and uses MPI for communication. Similar to Giraph, it keeps the graph in memory. However, it does not depend on Hadoop

Performance Comparisons

在这里插入图片描述
在这里插入图片描述

A Small Pipeline in GraphX

在这里插入图片描述

GraphX, Giraph, and GraphLab end-to-end performance

在这里插入图片描述

Plato vs GraphX

在这里插入图片描述

Intel Article on October 19, 2020

https://newsroom.intel.com/articles/intel-katana-graph-team-large-scale-graph-analytics
在这里插入图片描述

Katana vs GraphX

在这里插入图片描述

oneDAL Graph Analytics

在这里插入图片描述

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值