逻辑回归、GBDT可以参考pyspark开发文档:http://spark.apache.org/docs/latest/api/python/pyspark.ml.html#pyspark.ml.classification.LogisticRegression 。
xgboost查看:https://xgboost.ai 。
下面只列出分类是的参数介绍:(对于回归时的自行查看)
逻辑回归:
featuresCol = 'features'
labelCol = 'label'
predictionCol = 'prediction'
# max number of iterations (>= 0). 最大迭代次数(>= 0)
self.maxIter = 100
# regularization parameter (>= 0). 正则化参数(>= 0)
regParam = 0.0
# range [0, 1]. For alpha = 0, the penalty is an L2 penalty. For alpha = 1, it is an L1 penalty.') 选择L1或者L2
elasticNetParam = 0.0
# the convergence tolerance for iterative algorithms (>= 0).