1、有多张大表group by的hql,关闭hive.groupby.skewindata,按需手动处理数据倾斜
2、设置Mapjoin参数:
set hive.auto.convert.join = true;
set hive.mapred.local.mem=2048;
set hive.mapjoin.localtask.max.memory.usage = 0.999;
set hive.mapjoin.smalltable.filesize = 10000000;
set hive.auto.convert.join.noconditionaltask = true;
set hive.auto.convert.join.noconditionaltask.size = 25000000;