±---------±---------±-----+
|1159822043|1975964455| yes|
|1159822043| 252302513| yes|
|1159822043|4226086795| yes|
±---------±---------±-----+
only showing top 3 rows
### 三、同理将其余隔行依次映射
scala> val no = df.filter(col(“no”).isNotNull).select(col(“event”),col(“no”)).withColumn(“userid”,explode(split(col(“no”)," "))).drop($“no”).withColumn(“status”,lit(“no”))
±---------±---------±-----+
| event| userid|status|
±---------±---------±-----+
|1159822043|3575574655| no|
|1159822043|1077296663| no|
|1186208412|1728988561| no|
±---------±---------±-----+
only showing top 3 rows
no: Unit = ()
scala> val invited = df.filter(col(“invited”).isNotNull).select(col(“event”),col(“invited”)).withColumn(“userid”,explode(split(col(“invited”)," "))).drop($“invited”).withColumn(“status”,lit(“invited”)).show(3)
±-------