Hadoop调试信息的输出办法

最新推荐文章于 2024-12-05 20:03:26 发布

poson

最新推荐文章于 2024-12-05 20:03:26 发布

阅读量4.4k

点赞数 1

CC 4.0 BY-SA版权

分类专栏：开发技巧文章标签： hadoop null

本文链接：https://blog.csdn.net/poson/article/details/3538356

开发技巧专栏收录该内容

55 篇文章

订阅专栏

本文介绍了一种在Hadoop中进行调试的方法，通过将调试信息输出到reduce阶段，并使用特定的key值来标识这些信息，最终将所有调试信息集中在一个文件中。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

Hadoop

调试是比较麻烦的事情，考虑到只能通过reduce输出数据，我们可以把调试信息输出到reduce中，然后固定到某个文件中。

我们可以把所有的调试数据都是用key=“Debug”，调试信息作为value=“debugInfo”。

（1）在map中直接使用

output.collect(new Text("debug"), new Text("调试信息"));

（2）在reduce中判断

if(key. equals ("debug"))
{
                            while (values.hasNext()) 
                            {
                                String line = values.next().toString();
                                output.collect(new Text("debug"), new Text(line));
                            }
}

(3)

增加类ReportOutFormat

 public static class ReportOutFormat<K extends WritableComparable<?>, V extends Writable>
    extends MultipleOutputFormat<K, V> {
private TextOutputFormat<K, V> theTextOutputFormat = null;
@Override
protected RecordWriter<K, V> getBaseRecordWriter(FileSystem fs,
        JobConf job, String name, Progressable arg3) throws IOException {
    if (theTextOutputFormat == null) {
        theTextOutputFormat = new TextOutputFormat<K, V>();
    }
    return theTextOutputFormat.getRecordWriter(fs, job, name, arg3);
}
@Override
protected String generateFileNameForKeyValue(K key, V value, String name) {
    if(key.equals("debug"))   ///注意这个判断 
        return "debug"+name;
    return name ;
}
}

（

4）在configJob里面添加代码

protected void configJob(JobConf conf) 
{
          conf.setMapOutputKeyClass(Text.class);
          conf.setMapOutputValueClass(Text.class);
          conf.setOutputKeyClass(Text.class);  
          conf.setOutputValueClass(Text.class); 
          conf.setOutputFormat(ReportOutFormat.class); //增加该行 
}