mapreduce循环笔记

偷吃熊猫的竹子

于 2021-10-18 11:04:33 发布

阅读量159

点赞数

文章标签： java

本文链接：https://blog.csdn.net/m0_52672980/article/details/120822605

版权

mapreduce

1_bean

首先要实现（implements）接口WritableComparable +<泛型>，后定义所需要的变量，构造方法，编写get，set方法

其次实现比较器，重写compareTo（类+自定义对象）方法，进行升序和降序

  最后进行序列化    
  public void write(DataOutput out) throws IOException {
  out.writeUTF(word);
  out.writeInt(num);
  
  public void readFields(DataInput in) throws IOException {
  this.word = in.readUTF();
  this.num = in.readInt();
  }

最后最后进行 toString（）方法 return 参数+“/t”+参数形式

2_Mapper

首先要实现继承（extends）Mapper+<LongWritable,Text,SortBean(这是自定义bena类的名字),NullWritable/Text>（k—v，死记吧）

之后是重写 map方法(LongWritable key, Text value, Context context ) throws IOException, InterruptedException

之后是固定套路

  String[] split = value.toString().split("\t");

  SortBean sortBean = new SortBean();
  //将数据收集到SortBean对象中
  sortBean.setWord(split[0]);
  sortBean.setNum(Integer.parseInt(split[1]));

  //将K2和V2写入上下文中
  context.write(sortBean, NullWritable.get())

3_Reduce

首先继承 Reduce +<SortBean,NullWritable,SortBean,NullWritable>

之后重写reduce 方法

protected void reduce(SortBean key, Iterable values, Context context)

throws IOException, InterruptedException {
//收集数据
context.write(key, NullWritable.get());
}

4_Runner

mian 方法 thorws Exception 之后

new Configuration 对象

new job .getInstance（Configuration 对象的名字，“ 自定义名字”）

//指定job所在的jar包

job.setJarbyclass（名字.class）

偷吃熊猫的竹子

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
mapreduce循环笔记

mapreduce1_bean 首先要实现（implements）接口WritableComparable +<泛型>，后定义所需要的变量，构造方法，编写get，set方法其次实现比较器，重写compareTo（类+自定义对象）方法，进行升序和降序最后进行序列化 public void write(DataOutput out) throws IOException { out.writeUTF(word); out.wri
复制链接

扫一扫