DataFrame的schema列的结构如下:
则pyspark定义的schema的代码实现如下:
schema =StructType([StructField("window",StructType([StructField("start",TimestampType(),True),
StructField("end",TimestampType(),True)]),True),
StructField("lng_lat", StringType(), True),
StructField("mac", StringType(), True),
StructField("count",IntegerType(),True)])