java面试题网站:www.javaoffers.com
collectAsMap 的使用对象必须是Tuple 元组类型,在spark中将元组类型转换为Map类型,
应用示例:
val a = sc.parallelize(List(2,3,4,5))
val b = sc.parallelize(List("a","b","c","d"))
a.zip(b).collectAsMap 结果为:scala.collection.Map[Int,String] = Map(2 -> a, 5 -> d, 4 -> c, 3 -> b)
b.zip(a).collectAsMap 结果为:scala.collection.Map[String,Int] = Map(b -> 3, d -> 5, a -> 2, c -> 4)