求近似分位数:
DataFrame.approxQuantile()方法
对DataFrame添加index:
先建窗口,再用sql.functions的row_number()
例:
from pyspark.sql import functions as F
w = Window.orderBy("aggressive")
withIndexDF = tmpDF.withColumn("index", F.row_number().over(w))
求近似分位数:
DataFrame.approxQuantile()方法
对DataFrame添加index:
先建窗口,再用sql.functions的row_number()
例:
from pyspark.sql import functions as F
w = Window.orderBy("aggressive")
withIndexDF = tmpDF.withColumn("index", F.row_number().over(w))