hive求近三个月数据的平均环比值

计算公式已在摘要给出
假设有表t1,用户id,金额amount,交易日期bill_date
首先要计算出每月的汇总金额

with t2 as 
select
id,
sum(amount) sum_amount,
date_format(bill_date,'yyyy-MM') date_month
from t1
group by id,
date_format(bill_date,'yyyy-MM')

然后,使用开窗函数row_number() over()和lead(),以id分组,以时间倒序排序,lead分别取上个月,上上个月,上上上个月的汇总金额,最后count开窗,是为了统计有几个月发生交易

with t3 as
select
t2.id,
t2.date_month,
t2.sum_amount,
row_number() over(partition by t2.id order by t2.date_month desc) rk,
lead(t2.sum_amount,1,null) OVER(PARTITION BY t2.id ORDER BY t2.date_month desc) last_one,
lead(t2.sum_amount,2,null) OVER(PARTITION BY t2.id ORDER BY t2.date_month desc) last_two,
lead(t2.sum_amount,3,null) OVER(PARTITION BY t2.id ORDER BY t2.date_month desc) last_three,
count(t2.date_month) OVER(PARTITION BY t2.id) count_month
from t2

最后使用coalesce函数,此函数用法,将字段放入其中从前往后一直取到不为null的值
rk=1,为了祛去除多余数据,因为已经将前几个月数据已经取到

select
t3.id,
(CASE WHEN t3.count_month = 4 THEN (t3.sum_amount / t3.last_one + t3.last_one / t3.last_two + t3.last_two / t3.last_three) / (t3.count_month - 1)
when t3.count_month = 3 THEN (coalesce(t3.sum_amount,t3.last_one) / coalesce(t3.last_one,t3.last_two) + coalesce(t3.last_one,t3.last_two) /coalesce(t3.last_two,t3.last_three)) / (t3.count_month - 1)
when t3.count_month = 2 THEN  coalesce(t3.sum_amount,t3.last_one,t3.last_two) / coalesce(t3.last_one,t3.last_two,t3.last_three)
when t3.count_month = 1 then 999999
ELSE -999999
END) 
AS flag
from t3
where t3.rk = 1

结束,一直更新工作中所遇到我认为难写的sql,来给大家分享

  • 1
    点赞
  • 6
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值