Spark源码编译
编译前准备工具
-
64位jdk8
-
idea安装好scala、antlr v4插件
-
maven-setting增加阿里云镜像
-
从gitHub上clonespark源码
-
导入idea,设置好对应的maven仓库
-
切换branch到3.2
-
项目构建
mvn -Pyarn -Phive -Phive-thriftserver -DskipTests clean package
问题汇总
-
An Ant BuildException has occured: Execute failed: java.io.IOException: Cannot run program “bash”: CreateProcess error=2, 系统找不到指定的文件
spark-core的pom.xml需要注释掉这段,不生成构建信息
<!-- <plugin>-->
<!-- <groupId>org.apache.maven.plugins</groupId>-->
<!-- <artifactId>maven-antrun-plugin</artifactId>-->
<!-- <executions>-->
<!-- <execution>-->
<!-- <phase>generate-resources</phase>-->
<!-- <configuration>-->
<!-- <!– Execute the shell script to generate the spark build information. –>-->
<!-- <target>-->
<!-- <exec executable="bash">-->
<!-- <arg value="${project.basedir}/../build/spark-build-info"/>-->
<!-- <arg value="${project.build.directory}/extra-resources"/>-->
<!-- <arg value="${project.version}"/>-->
<!-- </exec>-->
<!-- </target>-->
<!-- </configuration>-->
<!-- <goals>-->
<!-- <goal>run</goal>-->
<!-- </goals>-->
<!-- </execution>-->
<!-- </executions>-->
<!-- </plugin>-->
- [ERROR] Java heap space -> [Help 1]
资源不足,参考官网设置maven相关参数
set MAVEN_OPTS=-Xss64m -Xmx2g -XX:ReservedCodeCacheSize=1g
参考链接
https://spark.apache.org/docs/3.2.1/building-spark.html#buildmvn