MapReduce之文件合并

最新推荐文章于 2024-05-16 15:46:43 发布

吃顿烧烤又胖三斤�

最新推荐文章于 2024-05-16 15:46:43 发布

阅读量932

点赞数

本文链接：https://blog.csdn.net/Werkple/article/details/104877373

版权

将一些小文件合并成大文件

/**

将大量的小文件合并
@author DOIT_HANG_GE
@version 2019年3月1日
*/
public class FileMerger {
public static class FileMapper extends Mapper<LongWritable, Text, Text, Text> {
String fileName = null;
StringBuilder sb = new StringBuilder();
@Override
protected void setup(Mapper<LongWritable, Text, Text, Text>.Context context)
throws IOException, InterruptedException {
FileSplit fs = (FileSplit) context.getInputSplit();
fileName = fs.getPath().getName();
}
@Override
protected void map(LongWritable key, Text value, Mapper<LongWritable, Text, Text, Text>.Context context)
throws IOException, InterruptedException {
sb.append(value.toString() + “\t”);
}
@Override
protected void cleanup(Mapper<LongWritable, Text, Text, Text>.Context context)
throws IOException, InterruptedException {
context.write(new Text(fileName), new Text(sb.toString()));
}
}
public static class FileReducer extends Reducer<Text, Text, Text, Text>{
@Override
protected void reduce(Text key, Iterable iters, Reducer<Text, Text, Text, Text>.Context context)
throws IOException, InterruptedException {
context.write(key, iters.iterator().next());
}
}
public static void main(String[] args) throws Exception {
Configuration conf = new Configuration();
/ conf.set(“mapreduce.framework.name”,“local”);
conf.set(“fs,defaultFS”, “file:///”);/
Job job = Job.getInstance(conf);
job.setMapperClass(FileMapper.class);
job.setReducerClass(FileReducer.class);
```
 job.setMapOutputKeyClass(Text.class);
 job.setMapOutputValueClass(Text.class);
 
 job.setOutputKeyClass(Text.class);
 job.setOutputValueClass(Text.class);
 
 FileInputFormat.setInputPaths(job, new Path("D:\\data\\merger\\input"));
 FileOutputFormat.setOutputPath(job, new Path("D:\\data\\merger\\output"));
 
 job.setNumReduceTasks(1);
 boolean b = job.waitForCompletion(true);
 System.exit(b?0:-1);
```
}
}

吃顿烧烤又胖三斤�

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
MapReduce之文件合并

将一些小文件合并成大文件/**将大量的小文件合并@author DOIT_HANG_GE@version 2019年3月1日*/public class FileMerger {public static class FileMapper extends Mapper<LongWritable, Text, Text, Text> {String fileN...
复制链接

扫一扫