问题描述:1、hive中表数据字段为json格式,如何解析数据及处理数据?
2、hive中如何将etl处理后的数据存为json格式?
1、解析hive中表数据字段为json格式的数据
app_name,container,platform,get_json_object(biz,'$.desc') ===> pv
-- 传统计算,给定参数值:
SELECT
count(distinct distinct_id) UV, -- 2
count(distinct_id) PV -- 61
from ods.t_user_behavior
WHERE get_json_object(biz,'$.desc') = '点击-新运营位'
and platform = "miniProgram"
and container = "WX_MiniApp"
and app_name = "xxxxx小程序"
and get_json_object(biz,'$.action') is not null;
--------------------
-- 自助化计算:
-- 给定计算维度:
-- 1、常规维度:app_name,container,platform,
-- 2、业务维度:get_json_object(biz,'$.desc')
-- 3、指标:pv
-- 步骤一:根据设定的维度计算所有情况(T+1):
SELECT
app_name,
platform,
container,
get_json_object(biz,'$.desc') desc,
count(di