基于JAVA语言编写MapReduce进行流量统计_使用mapreduce编程统计用户对网站内各网址的访问次数。-CSDN博客

本文链接：https://blog.csdn.net/weixin_46043015/article/details/106885651

首先来看下我们的数据
在这里插入图片描述注意:每个数据之间是以\t分割的。
每一行数据分别对应：手机号，IP地址，访问网址，上传流量，下载流量，状态码
任务：总计每个手机号的总上传和下载流量
java编写MapReduce程序，主要分为两大类

Mapper类

package com.zlj.mrtest.flowcount;

import java.io.IOException;

import org.apache.hadoop.io.LongWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Mapper;

public class FlowCountMapper extends Mapper<LongWritable, Text, Text, Text>
{
   
	/**
	 * 这个context可以存储一些job conf的信息
	 * 同时context作为了map和reduce执行中各个函数的一个桥梁，这个设计和java
	 */
	@Override
	protected void map(LongWritable key, Text value,Context context)
			throws IOException, InterruptedException
	{
   
		//对数据进行切分
		String line = value.toString();
		String[] files = line.split("\t");
		//获取手机号码
		String phone = files[1];
		//获取下载流量
		String upflow = files[files.length - 3