java MapReduce步骤

1、Eclipse简历Maven工程,由Maven管理jar,打开pom.xml——>添加

<build>  
        <plugins>  
            <plugin>  
            <artifactId>maven-assembly-plugin</artifactId>  <span style="white-space:pre">			</span><!--maven一个插件,具体查看另一博客-->
            <version>2.4.1</version>
                <configuration>  
                    <archive>  
                        <manifest>  
                            <mainClass>hadooptest.TestStart</mainClass>         <!--执行主函数-->
                        </manifest>  
                    </archive>  
                    <descriptorRefs>  
                        <descriptorRef>jar-with-dependencies</descriptorRef>  <span style="white-space:pre">	</span><!--打包时把依赖jar包打入包中-->
                    </descriptorRefs>  
                </configuration>  
            </plugin>  
        </plugins>  
    </build>  
	<dependencies>
		<dependency>
			<groupId>org.apache.hadoop</groupId><span style="white-space:pre">			</span><!--maven管理依赖-->
			<artifactId>hadoop-core</artifactId>
			<version>1.2.1</version>
		</dependency>
	</dependencies>


2、新建mapper处理,代码如下

import java.io.IOException;
import java.util.StringTokenizer;

import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.LongWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapred.MapReduceBase;
import org.apache.hadoop.mapred.Mapper;
import org.apache.hadoop.mapred.OutputCollector;
import org.apache.hadoop.mapred.Reporter;

public class TestMapper extends MapReduceBase implements Mapper<LongWritable, Text, Text, IntWritable> {
	private final static IntWritable one = new IntWritable(1);
	private final Text word = new Text();
	public void map(LongWritable key, Text value,
			OutputCollector<Text, IntWritable> output, Reporter reporter)
			throws IOException {
		String line = value.toString();
		StringTokenizer tokenizer = new StringTokenizer(line);
		while (tokenizer.hasMoreTokens()) {
			word.set(tokenizer.nextToken());
			output.collect(word, one);
		}
	}

}

3、新建Reduce处理,代码如下:

import java.io.IOException;
import java.util.Iterator;

import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapred.MapReduceBase;
import org.apache.hadoop.mapred.OutputCollector;
import org.apache.hadoop.mapred.Reducer;
import org.apache.hadoop.mapred.Reporter;

public class TestReduce extends MapReduceBase implements
		Reducer<Text, IntWritable, Text, IntWritable> {

	public void reduce(Text key, Iterator<IntWritable> value,
			OutputCollector<Text, IntWritable> output, Reporter reporter)
			throws IOException {
		int sum = 0;
		while (value.hasNext()) {
			sum += value.next().get();
		}
		output.collect(key, new IntWritable(sum));
	}

}
4、新建main方法,代码如下

import java.io.IOException;
import java.util.Iterator;

import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapred.MapReduceBase;
import org.apache.hadoop.mapred.OutputCollector;
import org.apache.hadoop.mapred.Reducer;
import org.apache.hadoop.mapred.Reporter;

public class TestReduce extends MapReduceBase implements
		Reducer<Text, IntWritable, Text, IntWritable> {

	public void reduce(Text key, Iterator<IntWritable> value,
			OutputCollector<Text, IntWritable> output, Reporter reporter)
			throws IOException {
		int sum = 0;
		while (value.hasNext()) {
			sum += value.next().get();
		}
		output.collect(key, new IntWritable(sum));
	}

}

5、所有ok后,在当前工程下,执行命令mvn assembly:assembly。在target下面找到jar包


6、拷贝生成的jar到,运行的hadoop服务器下。启动hadoop

执行java -jar XXX.jar  本地输入文件   本地输出文件


  • 0
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值