自己构建疫情数据实现多重分区表

最新推荐文章于 2024-11-17 00:00:00 发布

风口浪尖上的鼠标

最新推荐文章于 2024-11-17 00:00:00 发布

阅读量608

点赞数

分类专栏： Hadoop大数据技术与应用文章标签： hive 数据库 hadoop

本文链接：https://blog.csdn.net/qq_52770248/article/details/127976471

版权

Hadoop大数据技术与应用专栏收录该内容

10 篇文章 0 订阅

订阅专栏

自己构建疫情数据实现多重分区表

– 多重分区：分区字段之间有一个递进关系
–大于等于2个分区字段的就是多重分区。常用的是2重分区
– 创建一个多重分区表：

create table if not exists hive_Zhangyue_nolocation.sichuan_covid_count(
id int comment 'ID',
country string comment "分区",
cas int comment "确诊",
asy int comment "无症状"
)partitioned by (province string,city string)row format delimited fields terminated by ',';

– 加载数据

load data local inpath "/home/myhadoop/data/chendu.txt"into table hive_Zhangyue_nolocation.sichuan_covid_count
partition (province='sicuan',city='chendu');
load data local inpath "/home/myhadoop/data/deyang.txt"into table hive_Zhangyue_nolocation.sichuan_covid_count
partition (province='sicuan',city='deyang');
load data local inpath "/home/myhadoop/data/bazhong.txt"into table hive_Zhangyue_nolocation.sichuan_covid_count
partition (province='sicuan',city='bazhong');

– 验证分区表

show partitions hive_Zhangyue_nolocation.sichuan_covid_count;

– 验证数据

select *from sichuan_covid_count;

你还感兴趣：
1、疫情数据实现静态和动态加载数据到分区表
2、使用美国疫情数据创建分区表
3、使用有location 和无location创建HIVE数据库
4、在数据集目录中选取1个数据文件内部表创建表，选取1个数据文件创建外部表。

风口浪尖上的鼠标

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
打赏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录