如果是 二元组:
val a = List(("a",2),("a",3),("d",333),("c",3),("c",4))
val b = a.groupBy(_._1)
println(b)
结果:
a: List[(String, Int)] = List((a,2), (a,3), (d,333), (c,3), (c,4))
b: scala.collection.immutable.Map[String,List[(String, Int)]] = Map(d -> List((d,333)), a -> List((a,2), (a,3)), c -> List((c,3), (c,4)))
Map(d -> List((d,333)), a -> List((a,2), (a,3)), c -> List((c,3), (c,4)))
只要同key的不同value组成的list
val c = b.mapValues(r => {r.map(r => {r._2})})
println(c)
结果:
c: scala.collection.immutable.Map[String,List[Int]] = Map(d -> List(333), a -> List(2, 3), c -> List(3, 4))
Map(d -> List(333), a -> List(2, 3), c -> List(3, 4))
如果是多元组,先将其变成为二元组
val e = List(("a",1000,2),("a",2222,3),("d",6888,333),("c",13,3),("c",14,4))
val f = e.map(r => {(r._1, (r._2, r._3))})
结果:
f: List[(String, (Int, Int))] = List((a,(1000,2)), (a,(2222,3)), (d,(6888,333)), (c,(13,3)), (c,(14,4)))
暂时没有查询到以多行数据作为key进行聚合的函数方法。