hive分区对应hadoop_Hadoop之hive中sql常用函数汇总

最新推荐文章于 2022-06-27 10:20:34 发布

weixin_39733948

最新推荐文章于 2022-06-27 10:20:34 发布

阅读量423

点赞数

文章标签： hive分区对应hadoop

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/weixin_39733948/article/details/111907380

版权

1、hive执行引擎 mr/tez/spark

set hive.execution.engine = mr;

2、开启动态分区

set hive.exec.dynamic.partition = true;

set hive.exec.dynamic.partition.mode = nonstrict;

## 删除分区：

ALTER TABLE dm.user_action_self_help_w_wi DROP IF EXISTS PARTITION (dt='2019-08-15',pd=2);

3、with 连接词

with TABLE_NAME AS (

SELECT ... FROM ... WHERE ...

)

-- 首个连接需要with，后续不要with：

TABLE_NAME AS (

SELECT ... FROM ... WHERE ...

)

4、为字段重命名

old_name as new_name

-- 或(不加as)：

old_name new_name

5、row_number() over(partition by A order by B asc/desc)

row_number() over(partition by A,B,C order by D asc/desc)

-- 将查询结果按照A,B,C字段分组(partition)，

-- 然后组内按照D字段排序，至于asc还是desc，可自行选择，

-- 然后为每行记录返回一个row_number用于标记顺序(编号)

特色功能1：给已有hive表(dm.official_accounts_funscount_w) 添加一列序号(sample_key)，例：

select

row_number() over(

partition by case when t.source is not null then 1 end

order by t.source asc,t.funCounts desc

) as sample_key,

t.source,

t.cityName,

t.weight,

t.strArea,

t.end_date,

t.funCounts

from dm.official_accounts_funscount_w t;

特色功能2：给表(多个字段)中某个字段去重，例：

-- 临时表2：去重数据

drop table if exists dm.table_info__02;

create table dm.table_info_02 stored as parquet as

select

*

from

(

select

*,

row_number() over(partition by id order by ti

最低0.47元/天解锁文章

weixin_39733948

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
hive分区对应hadoop_Hadoop之hive中sql常用函数汇总

1、hive执行引擎 mr/tez/sparkset hive.execution.engine = mr;2、开启动态分区set hive.exec.dynamic.partition = true;set hive.exec.dynamic.partition.mode = nonstrict;## 删除分区：ALTER TABLE dm.user_action_self_help_w_wi ...
复制链接

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。