hive~分区表和数据关联的三种方式

最新推荐文章于 2024-05-23 11:34:01 发布

17245

最新推荐文章于 2024-05-23 11:34:01 发布

阅读量1.4k

点赞数

分类专栏： hive 文章标签： hive

本文链接：https://blog.csdn.net/houkai18792669930/article/details/105904671

版权

hive 专栏收录该内容

7 篇文章 0 订阅

订阅专栏

把数据直接上传到分区目录上，让分区表和数据产生关联的三种方式

test_partitions 表创建：

create table test_partitions(name string)
partitioned by (month string, day string)
row format delimited fields terminated by '\t';

方式一：上传数据后修复

#上传数据
hadoop fs -put test1.txt /hive/test_partitions/month=202005/day=3;
#执行修复命令
msck repair table test_partitions;
#查询数据则会查询到数据
select * from test_partitions where month='202005' and day='3';

方式二：上传数据后添加分区

#创建目录
hadoop fs -mkdir -p /hive/test_partitions/month=202005/day=3;
#上传数据
hadoop fs -put test.txt  /hive/test_partitions/month=202005/day=3;
#执行添加分区
alter table test_partitions add partition(month='202005',  day='3');
#查询数据
select * from test_partitions where month='202005' and day='3';

方式三：创建文件夹后 load 数据到分区

#创建目录
hadoop fs -mkdir -p /hive/test_partitions/month=202005/day=3;
#上传数据
load data local inpath 'test.txt' into table test_partitions partition(month='202005', day='3');
#查询数据
select * from test_partitions where month='202005' and day='3';

17245

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
hive~分区表和数据关联的三种方式

把数据直接上传到分区目录上，让分区表和数据产生关联的三种方式test_partitions 表创建：create table test_partitions(name string)partitioned by (month string, day string)row format delimited fields terminated by '\t';方式一：上传数据后修复...
复制链接

扫一扫