df=spark.createDataFrame(pd.read_csv('sc.csv'))
# df.show()
window=Window.orderBy('question_id')
df = df.withColumn('topn', F.row_number().over(window))
df.sort('question_id').show()```
pyspark 添加一列从一单调递增编号
最新推荐文章于 2023-02-06 17:34:14 发布