Hive / Presto 行转列 列转行

行转列

1、Hive:

collect_set转为数组并去重,concat_ws将数组用逗号间隔连接成字符串

select 
    fuid,
    concat_ws(',', collect_set(cast(fdeal_id as string) )) as order_ids
from tmp.tmp_test
where dt = '2022-03-31'
    and event_type = 1
group by fuid

2、Presto:

array_agg转为数组,array_distinct去重,array_join将数组用逗号间隔连接成字符串

select 
	fuid,
    array_join(array_distinct(array_agg( cast(fdeal_id as varchar) )), ',') as order_ids
from tmp.tmp_test
where dt = '2022-03-31'
	and event_type = 1
group by fuid

列转行

Hive

1、split将order_ids拆分成数组,lateral view explode将数组炸裂开

select a.fuid
    , b.fdeal_id
from tmp.tmp_test a
lateral view explode(split(order_ids, ',')) b as fdeal_id

###炸裂 + map

select  model_code,
        item_code,
        refer_enum,
        busi_cnt
from
(select model_code
       item_code,
       count(distinct if(item_value >= 2 and item_value <= 5,business_id,null)) as cnt2,
       count(distinct if(item_value >= 6 and item_value <= 9,business_id,null)) as cnt3,
       count(distinct if(item_value >= 10 and item_value <= 12,business_id,null)) as cnt4
from tmp.tmp_test
where dt = '2021-05-24'
group by model_code,
         item_code) a
lateral view explode(map('2-5', cnt2,
                         '6-9', cnt3,
                         '10-12', cnt4)) b as refer_enum, busi_cnt

Presto

1、split将order_ids拆分成数组,cross join unnest将数组炸裂开

select a.fuid
    , b.fdeal_id
from tmp.tmp_test a
cross join unnest(split(order_ids, ',')) as b(fdeal_id) 

2、炸裂 + map

select 
    t1.fuid,
    t2.lable_name, 
    t2.label_value
from (
        select t1.fuid,                                   
               cast(t1.bus_type as varchar) bus_type,        
               t1.dept_code,                    
               t1.dept_name,                    
               cast(t1.black_gold as varchar) black_gold,                         
               cast(t1.chat_tag as varchar) chat_tag                       
          from tmp.tmp_test t1
         where t1.dt = '2021-06-30'
       ) t1
 cross join unnest (
  array['bus_type', 'dept_code', 'dept_name', 'black_gold', 'chat_tag'],
  array[bus_type, dept_code, dept_name, black_gold,chat_tag]
 ) t2 (lable_name, label_value)
  • 12
    点赞
  • 9
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
Hive中的列转行行转列是通过使用一些特定的内置函数和关键字来实现的。列转行可以使用EXPLODE函数,该函数可以将一个包含复杂结构的数组或者映射拆分成多行。行转列可以使用collect_set函数,该函数将某一列的所有数据转化为一个集合,并且可以使用concat_ws函数将集合中的所有元素以逗号分割连接成一个字符串。此外,为了使用EXPLODE和LATERAL VIEW函数,你可以使用LATERAL VIEW关键字,语法为LATERAL VIEW udtf(expression) tableAlias AS columnAlias。<span class="em">1</span><span class="em">2</span><span class="em">3</span> #### 引用[.reference_title] - *1* [hive列转行案例](https://download.csdn.net/download/weixin_38581777/14037437)[target="_blank" data-report-click={"spm":"1018.2226.3001.9630","extra":{"utm_source":"vip_chatgpt_common_search_pc_result","utm_medium":"distribute.pc_search_result.none-task-cask-2~all~insert_cask~default-1-null.142^v93^chatsearchT3_2"}}] [.reference_item style="max-width: 33.333333333333336%"] - *2* [hive行转列列转行](https://blog.csdn.net/qq_24790473/article/details/109710145)[target="_blank" data-report-click={"spm":"1018.2226.3001.9630","extra":{"utm_source":"vip_chatgpt_common_search_pc_result","utm_medium":"distribute.pc_search_result.none-task-cask-2~all~insert_cask~default-1-null.142^v93^chatsearchT3_2"}}] [.reference_item style="max-width: 33.333333333333336%"] - *3* [hive操作(行转列列转行)](https://blog.csdn.net/aiduo3346/article/details/102085019)[target="_blank" data-report-click={"spm":"1018.2226.3001.9630","extra":{"utm_source":"vip_chatgpt_common_search_pc_result","utm_medium":"distribute.pc_search_result.none-task-cask-2~all~insert_cask~default-1-null.142^v93^chatsearchT3_2"}}] [.reference_item style="max-width: 33.333333333333336%"] [ .reference_list ]
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值