hbase协处理器endpoint应用：hbase数据加盐（Salting）后的数据查询方法

最新推荐文章于 2020-09-27 09:30:10 发布

finad01

最新推荐文章于 2020-09-27 09:30:10 发布

阅读量4.1k

点赞数 1

分类专栏：大数据-hbase 文章标签： endpoint hbase hbase协处理器 hbase数据加盐

本文链接：https://blog.csdn.net/finad01/article/details/46841933

版权

本文介绍了如何使用hbase协处理器endpoint来实现加盐（Salting）后的数据查询。首先，文章概述了endpoint的基本概念，并以RowCountEndpoint为例说明其实现。接着，详细讲述了如何创建协议类、实现server端的查询功能，包括根据起止行rowkey和单个rowkey查询。最后，给出了客户端的查询实现。部署方面，包括打包、通过hbase-site.xml配置、shell命令加载和卸载协处理器的步骤。

摘要由CSDN通过智能技术生成

hbase协处理器endpoint应用：hbase数据加盐（Salting）后的数据查询方法

1 介绍

上一篇文章中介绍了hbase数据加盐的方法，并简单介绍了加盐后的数据查询思路，但没有给出具体的实现方法，本文则介绍一下具体用hbase协处理器endpoint的实现。

协处理器分两种类型，系统协处理器可以全局导入region server上的所有数据表，表协处理器即是用户可以指定一张表使用协处理器。协处理器框架为了更好支持其行为的灵活性，提供了两个不同方面的插件。一个是观察者（observer），类似于关系数据库的触发器。另一个是终端(endpoint)，动态的终端有点像存储过程。本文介绍的实现是endpoint的应用。

2.实现

2.1 示例

首先看一下hbase给出的示例计算表的行数RowCountEndpoint.java,源代码在hbase-examples的org.apache.hadoop.hbase.coprocessor.example

public voidgetRowCount(RpcController controller, ExampleProtos.CountRequest request,

RpcCallback<ExampleProtos.CountResponse> done) {

Scan scan= newScan();

scan.setFilter(new FirstKeyOnlyFilter());

ExampleProtos.CountResponse response = null;

InternalScanner scanner = null;

try{

scanner= env.getRegion().getScanner(scan);

List<Cell> results = newArrayList<Cell>();

booleanhasMore= false;

byte[]lastRow= null;

longcount= 0;

do{

hasMore = scanner.next(results);

for (Cell kv : results) {

byte[] currentRow = CellUtil.cloneRow(kv);

if (lastRow == null|| !Bytes.equals(lastRow, currentRow)){

lastRow = currentRow;

count++;

}

results.clear();

} while(hasMore);

response= ExampleProtos.CountResponse.newBuilder()

.setCount(count).build();

} catch(IOException ioe){

ResponseConverter.setControllerException(controller, ioe);

} finally{

if(scanner!= null){

try {

scanner.close();

} catch (IOException ignored) {}

}

done.run(response);

}

实现比较简单，region遍历所有的行返回行数，客户端再把所有的region行数相加即得到整个表的行数。

2.2 server实现

接下来给出仿照RowCountEndpoint实现hbase数据加盐（Salting）后的数据查询方法。

1）接口协议定义

由于hbase内部通信使用的protobuf协议，首先我们要生存协议类，如上面的ExampleProtos,定义自己要实现的协议类DataProtos

package generated;

optionjava_package="com.bigdata.coprocessor.endpoint.generated";

optionjava_outer_classname="DataProtos";

option java_generic_services = true;

option java_generate_equals_and_hash = true;

option optimize_for = SPEED;

message DataQueryRequest {

optional string tableName = 1;

optional string startRow = 2;

optional string endRow = 3;

optional string rowKey = 4;

optional bool incluedEnd = 5;

optional bool isSalting = 6;

}

message DataQueryResponse {

messageCell{

requiredbytes value = 1;

requiredbytes family = 2;

requiredbytes qualifier = 3;

requiredbytes row = 4;

}

message Row{

optionalbytes rowKey = 1;

repeatedCell cellList = 2;

}

repeated Row rowList = 1;

}

service QueryDataService{

rpcqueryByStartRowAndEndRow(DataQueryRequest)

returns (DataQueryResponse);

rpcqueryByRowKey(DataQueryRequest)

returns (DataQueryResponse);

}

里面定义了请求对象DataQueryRequest与响应对象BigDataQueryResponse，定义了一个服务DataService,服务里定义了两个方法，分别是根据起止行rowkey查询和根据单个rowkey查询，然后需要用protoc.exe生成对应的java实现类

执行命令protoc.exe DataProtos.proto --java_out=e:\hbase\protoc-2.4.1即可生成DataProtos.java,protoc.exe工具我也上传了，可以下载使用。

2）实现协处理器

server

最低0.47元/天解锁文章

finad01

关注

1
点赞
踩
5

收藏

觉得还不错? 一键收藏
3
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录