一个Hadoop小程序:将Hello world写入文件中,简要代码:
public static void main(String[] args) throws Exception {
Configuration conf = new Configuration();
Job job = new Job(conf,"demo");
job.setJarByClass(HelloWorld.class);
job.setMapperClass(MyMapper.class);
job.setCombinerClass(MyReducer.class);
job.setReducerClass(IntSumReducer.class);
job.setOutputKeyClass(Text.class);
job.setOutputValueClass(IntWritable.class);
FileInputFormat.addInputPath(job,new Path("/anders/passwd"));
FileOutputFormat.setOutputPath(job, new Path("/anders/out1"));
System.exit(job.waitForCompletion(true) ? 0 : 1);
}
mapper reducer 不上了……
用eclipse 运行:run on hadoop出现
Exception in thread "main" org.apache.hadoop.mapreduce.lib.input.InvalidInputException: Input path does not exist: file:/anders/passwd
at org.apache.hadoop.mapreduce.lib.input.FileInputFormat.listStatus(FileInputFormat.java:235)
at org.apache.hadoop.mapreduce.lib.input.FileInputFormat.getSplits(FileInputFormat.java:252)
at org.apache.hadoop.mapred.JobClient.writeNewSplits(JobClient.java:962)
at org.apache.hadoop.mapred.JobClient.writeSplits(JobClient.java:979)
at org.apache.hadoop.mapred.JobClient.access$600(JobClient.java:174)
at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:897)
at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:850)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:415)
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121)
at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:850)
at org.apache.hadoop.mapreduce.Job.submit(Job.java:500)
at org.apache.hadoop.mapreduce.Job.waitForCompletion(Job.java:530)
at cn.sdu.hadoop.HelloWorld.main(HelloWorld.java:58)
就是文件路径不对,google一下
修改代码:
FileInputFormat.addInputPath(job,new Path("hdfs://localhost:9000/anders/passwd"));
FileOutputFormat.setOutputPath(job, new Path("hdfs://localhost:9000/anders/out1"));
这样才能正确,这个路径是根据core-site.xml里面的fs.default.name
这回就可以用eclipse运行了。。。。。