Java用mapreduce输出数据保存到本地

最新推荐文章于 2024-05-11 14:39:48 发布

Xi Zi

最新推荐文章于 2024-05-11 14:39:48 发布

阅读量244

点赞数

文章标签： java mapreduce hadoop 开发语言大数据

本文链接：https://blog.csdn.net/weixin_35755188/article/details/129072164

版权

使用 MapReduce 进行数据处理时，通常是将数据处理后的结果输出到 Hadoop 的文件系统(例如 HDFS)中。但是，如果你想将处理后的数据保存到本地文件系统，也是可以的。

下面是一个示例代码，展示了如何使用 MapReduce 输出数据到本地文件系统：

``` import java.io.IOException; import java.io.OutputStream; import org.apache.hadoop.conf.Configuration; import org.apache.hadoop.fs.FileSystem; import org.apache.hadoop.fs.Path; import org.apache.hadoop.io.IntWritable; import org.apache.hadoop.io.Text; import org.apache.hadoop.mapreduce.Job; import org.apache.hadoop.mapreduce.Mapper; import org.apache.hadoop.mapreduce.Reducer; import org.apache.hadoop.mapreduce.lib.input.FileInputFormat; import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;

public class WordCount {

public static class TokenizerMapper extends Mapper<Object, Text, Text, IntWritable>{

private final static IntWritable one = new IntWritable(1);
private Text word = new Text();

public void map(Object key, Text value, Context context
                ) throws IOException, InterruptedException {
  String[] words = value.toString().split(" ");
  for (String w : words) {
    word.set(w);
    context.write(word, one);
  }
}

}

public static class IntSumReducer extends Reducer<Text,IntWritable,Text,IntWritable> { private IntWritable result = new IntWritable();

public void reduce(Textkey, Iterable<IntWritable> values,
                   Context context
                   ) throws IOException, InterruptedException {
  int sum = 0;
  for (IntWritable val : values) {
    sum += val.get();
  }
  result.set(sum);
  context.write(key, result);
}

}

public static void main(String[] args) throws Exception { Configuration conf = new Configuration(); Job job = Job.getInstance(conf, "word count"); job.setJarByClass(WordCount.class); job.setMapperClass(TokenizerMapper.class); job.setCombinerClass(IntSumReducer.class); job.setReducerClass(IntSumRed

Xi Zi

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Java用mapreduce输出数据保存到本地

使用 MapReduce 进行数据处理时，通常是将数据处理后的结果输出到 Hadoop 的文件系统(例如 HDFS)中。但是，如果你想将处理后的数据保存到本地文件系统，也是可以的。下面是一个示例代码，展示了如何使用 MapReduce 输出数据到本地文件系统：```import java.io.IOException;import java.io.OutputStream;import ...
复制链接

扫一扫