java文本文件保存在hbase中,java – 从System读取文本文件到Hbase MapReduce

最新推荐文章于 2022-10-24 10:56:54 发布

uptbio

最新推荐文章于 2022-10-24 10:56:54 发布

阅读量193

点赞数

文章标签： java文本文件保存在hbase中

对于从文本文件中读取,首先文本文件应该在hdfs中.

您需要为作业指定输入格式和输出格式

Job job = new Job(conf, "example");

FileInputFormat.addInputPath(job, new Path("PATH to text file"));

job.setInputFormatClass(TextInputFormat.class);

job.setMapperClass(YourMapper.class);

job.setMapOutputKeyClass(Text.class);

job.setMapOutputValueClass(Text.class);

TableMapReduceUtil.initTableReducerJob("hbase_table_name", YourReducer.class, job);

job.waitForCompletion(true);

YourReducer应扩展org.apache.hadoop.hbase.mapreduce.TableReducer< Text,Text,Text>

样本减速器代码

public class YourReducer extends TableReducer {

private byte[] rawUpdateColumnFamily = Bytes.toBytes("colName");

/**

* Called once at the beginning of the task.

*/

@Override

protected void setup(Context context) throws IOException, InterruptedException {

// something that need to be done at start of reducer

}

@Override

public void reduce(Text keyin, Iterable values, Context context) throws IOException, InterruptedException {

// aggregate counts

int valuesCount = 0;

for (Text val : values) {

valuesCount += 1;

// put date in table

Put put = new Put(keyin.toString().getBytes());

long explicitTimeInMs = new Date().getTime();

put.add(rawUpdateColumnFamily, Bytes.toBytes("colName"), explicitTimeInMs,val.toString().getBytes());

context.write(keyin, put);

}

}

}

示例映射器类

public static class YourMapper extends Mapper {

private final static IntWritable one = new IntWritable(1);

private Text word = new Text();

public void map(LongWritable key, Text value, Context context) throws IOException, InterruptedException {

String line = value.toString();

StringTokenizer tokenizer = new StringTokenizer(line);

while (tokenizer.hasMoreTokens()) {

word.set(tokenizer.nextToken());

context.write(word, one);

}

}

}

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。