在window 环境下使用spark - xgboost会出现一些问题,这里记录一下。
环境:window 7 + spark 2.31 + xgboost 8.1 + idea + maven
一.依赖以及代码
数据集下载地址
UCI Machine Learning Repository: Iris Data Setarchive.ics.uci.edupom依赖
<!-- https://mvnrepository.com/artifact/ml.dmlc/xgboost4j -->
<dependency>
<groupId>ml.dmlc</groupId>
<artifactId>xgboost4j</artifactId>
<version>0.81</version>
</dependency>
<!-- https://mvnrepository.com/artifact/ml.dmlc/xgboost4j-spark -->
<dependency>
<groupId>ml.dmlc</groupId>
<artifactId>xgboost4j-spark</artifactId>
<version>0.81</version>
</dependency>
测试代码
import org.apache.spark.ml.feature.{
StringIndexer}
import org.apache.spark.sql.types.{
DoubleType, StringType, StructField, StructType}
import org.apache.spark.ml.feature.VectorAssembler
import org.apache.spark.sql. SparkSession
import ml.dmlc.xgboost4j.scala.spark.{
XGBoostClassificationModel, XGBoostClassifier}
/**
* author :wy
* todo : xgboost鸢尾花分类
* Created by pc-admin on 2020-03-12 11:21
**/
object xgboostIrisDataTest {
def main(args: Array[String]): Unit = {
val ss = SparkSession.builder().master("local[4]").appName("xgboostRisiDataTest").getOrCreate()
val dataPa