Hive Impala Count 效率比对

本文对比了Hive和Impala在处理十万数量级数据时的count查询效率,探讨了两种技术在批处理和准实时查询场景的应用差异。
摘要由CSDN通过智能技术生成

Hive 主要是跑批处理的,Impala主要是做准实时的内存查询,我拿他俩做比较~~感觉我好没意思阿~~

一、count 比较

十万数量级

Imapal

[cdh-node2:21000] > select count(1) from userinfo;
Query: select count(1) from userinfo
+----------+
| count(1) |
+----------+
| 124850   |
+----------+
Fetched 1 row(s) in 2.39s
[cdh-node2:21000] > select count(1) from userinfo;
Query: select count(1) from userinfo
+----------+
| count(1) |
+----------+
| 124850   |
+----------+
Fetched 1 row(s) in 0.57s
[cdh-node2:21000] > 

Hive

    > select count(1) from userinfo;
Total jobs = 1
Launching Job 1 out of 1
Number of reduce tasks determined at compile time: 1
In order to change the average load for a reducer (in bytes):
  set hive.exec.reducers.bytes.per.reducer=<number>
In order to limit the maximum number of reducers:
  set hive.exec.reducers.max=<number>
In order to set a constant number of reducers:
  set mapreduce.job.reduces=<number>
Starting Job = job_1422624309453_0060, Tracking URL = http://cdh-node1:8088/proxy/application_1422624309453_0060/
Kill Command = /opt/cloudera/parcels/CDH-5.3.0-1.cdh5.3.0.p0.30/lib/hadoop/bin/hadoop job  -kill job_1422624309453_0060
Hadoop job information for Stage-1: number of mappers: 1; number of reducers: 1
2015-01-31 18:06:57,974 Stage-1 map = 0%,  reduce = 0%
2015-01-31 18:07:06,297 Stage-1 map &#
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值