1.下载GIRAPH-798.patch
源文件:https://issues.apache.org/jira/browse/GIRAPH-818
(修改GIRAPH-798.patch的第一个文件pom.xml的diff,改为创建新文件,其他酌情修改)
修改好的源文件:https://pan.baidu.com/s/1wFJFqg1OEU75V20s12TQlA
2.maven 3.2.3
mkdir giraph_plus
cd giraph_plus
patch -p0 < GIRAPH-798.patch
mvn compile
3.下载hadoop-0.20.203.0
wget http://archive.apache.org/dist/hadoop/core/hadoop-0.20.203.0/
hadoop-0.20.203.0的分布式配置
http://blog.csdn.net/matraxa/article/details/7179366
(配置文件中端口号选择需要慎重,fs.default.name 9000 and mapred.job.tracker 9001,其他请自行尝试)
4.测试Giraph++(查看jar文件 jar vtf fileName.jar)
hadoop jar /home/username/giraph_plus/target/giraph-0.2-SNAPSHOT-jar-with-dependencies.jar com.ibm.giraph.graph.example.pagerank.DeltaPRGraph enron output 3 true 3
(worker为3,参数详情请查看源代码)