Hive表分区

最新推荐文章于 2023-09-19 10:35:45 发布

哈维先生

最新推荐文章于 2023-09-19 10:35:45 发布

阅读量125

点赞数

分类专栏： Hive数据仓库文章标签： hive 分区

本文链接：https://blog.csdn.net/magicharvey/article/details/20387059

版权

Hive数据仓库专栏收录该内容

13 篇文章 0 订阅

订阅专栏

新建一个有一个分区dt的表baseinfo。

hive> create table baseinfo(
    > id int,
    > name string
    > )partitioned by (dt string) row format delimited fields terminated by '\t' lines terminated by '\n' stored as textfile; 
OK
Time taken: 0.132 seconds

在linux下面建立一个文件/baseinfo.txt，文件内容为

1 qiu

2 liu

将文件中的内容导入baseinfo表中，设置分区的值为当前日期'2014-03-03'。

hive> load data local inpath '/baseinfo.txt' overwrite into table baseinfo partition(dt = '2014-03-03');
Copying data from file:/baseinfo.txt
Copying file: file:/baseinfo.txt
Loading data to table hbmsdb.baseinfo partition (dt=2014-03-03)
Deleted hdfs://hadoop00:9000/user/hive/warehous/hbmsdb.db/baseinfo/dt=2014-03-03
OK
Time taken: 0.442 seconds

在linux下面建立一个文件/baseinfo2.txt，文件内容为

3 zheng

4 yang

将文件中的内容导入baseinfo表中，设置分区的值为当期日期‘2014-03-04’。

hive> load data local inpath '/baseinfo.txt' overwrite into table baseinfo partition(dt = '2014-03-04');                                                      
Copying data from file:/baseinfo.txt
Copying file: file:/baseinfo.txt
Loading data to table hbmsdb.baseinfo partition (dt=2014-03-04)
OK
Time taken: 0.189 seconds

查看分区

hive> show partitions baseinfo;                                                                         
OK
dt=2014-03-03
dt=2014-03-04
Time taken: 0.054 seconds

查看数据

hive> select * from baseinfo;
OK
1       qiu     2014-03-03
2       liu     2014-03-03
3       zheng   2014-03-04
4       yang    2014-03-04
Time taken: 0.141 seconds
hive> select * from baseinfo where dt = '2014-03-03';
OK
1       qiu    2014-03-03
2       liu    2014-03-03
Time taken: 0.149 seconds

通过查询将内容保存在本地文件系统上

hive> insert overwrite  local directory '/home/hive' select * from baseinfo;

哈维先生

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Hive表分区

新建一个有一个分区dt的表baseinfo。hive> create table baseinfo( > id int, > name string > )partitioned by (dt string) row format delimited fields terminated by '\t' lines terminated by '\n' stored
复制链接

扫一扫

专栏目录