Scala中WordCountLow

最新推荐文章于 2024-07-20 19:40:33 发布

午饭有鱼有虾9

最新推荐文章于 2024-07-20 19:40:33 发布

阅读量64

点赞数

文章标签： Scala WordCount 文件操作单词切分分组统计

本文链接：https://blog.csdn.net/weixin_56765170/article/details/117231279

版权

import scala.io.Source
object $17_WordCountLow {
  def main(args: Array[String]): Unit = {
    //1、读取文件
    val datas = Source.fromFile("datas/wc2.txt","utf-8").getLines().toList
    //List(hello hadoop flume kafka,kafka spark scala hadoop,hello java python hadoop,kafka flume spark spark,hello flume scala java)
    //2、切割+压平
    val words = datas.flatMap(line=> line.split(" "))
    //List(hello,hadoop,flume,kafka,kafka,spark,....)

    //3、按照单词分组
    val groupedMap = words.groupBy(x=>x)
    //Map(
    //    hello-> List(hello,hello，hello，hello，..)
    //    ...
    // )
    //
    //4、统计次数
    val result = groupedMap.map(x=>{
      //x = hello-> List(hello,hello，hello，hello，..)
      (x._1,x._2.size)
    })
    //List( (单词，总次数),(单词,总次数),... )
    result.foreach(x=>println(x))
    println("-"*100)
    //
    Source.fromFile("datas/wc.txt","utf-8").getLines().toList.flatMap(_.split(" ")).groupBy(x=>x).map(x=>(x._1,x._2.size)).foreach(println(_))
  }
}

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

午饭有鱼有虾9

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Scala中WordCountLow

import scala.io.Sourceobject $17_WordCountLow { def main(args: Array[String]): Unit = { //1、读取文件 val datas = Source.fromFile("datas/wc2.txt","utf-8").getLines().toList //List(hello hadoop flume kafka,kafka spark scala hadoop,hello java pyt..
复制链接

扫一扫