hive 创建和删除库、表

最新推荐文章于 2024-04-07 19:42:46 发布

YannAdams

最新推荐文章于 2024-04-07 19:42:46 发布

阅读量1k

点赞数

分类专栏： Hive Spark 文章标签： Hive

本文链接：https://blog.csdn.net/u012280876/article/details/101296306

版权

Spark 同时被 2 个专栏收录

6 篇文章 1 订阅

订阅专栏

Hive

3 篇文章 0 订阅

订阅专栏

hive 创建和删除库、表

前言
hive 库操作
hive表操作
参考博客

前言

本文主要介绍hive 中操作库和表的语句。

hive 库操作

查看数据库：show databases
创建数据库：create database if not exists hive_testdb
使用某个数据库：use hive_testdb
查询数据库：show databases like ‘hive*’
显示数据库的信息：describe database hive_testdb
添加数据库备注：create database teacherdb comment
删除空数据库：drop database hive_testdb
强制删除数据库：drop database hive_testdb casecade

hive表操作

查看表的列表：show tables
创建表：create table if not exists hive_db.table_test(id string, name string, age int, …)
删除表：drop table if exists table_test
重命名表：alter table table_test rename to table1
修改表中列信息：alter table table_test change columns id tb_id int
增加列：alter table table_test add columns(class string commet “班级”)
删除或者替换列：alter table table_test replace columns(id string commet “备注”，name string commet “备注”);//这里是将所用列全部删除后再新建id和name列
复制表及数据：create table table_test1 as select * from table_test
导入表数据：load data local inpath ‘xxx(数据目录)/data/xxx.xxx’ overwrite into table hive_db.table_test
查询表：show tables in hive_db like “table*”
查看表信息：desc table_test
内部表与外部表的相互转换：
alter table table_test set tblproperties(“EXTERNAL”=“TRUE”)　　//内部表转换为外部表
alter table table_test set tblproperties(“EXTERNAL”=“FALSE”)　　//外部表转换为内部表
分区表（物理分区，逻辑上还是整表）
创建分区表(按照月份来分区)：create table table_partition(id string,name string) partitioned by (month string) row format delimited fields terminated by ‘\t’
上传数据到分区表：load data local inpath ‘xxx(数据目录)/data/xxx.xxx’ into table table_partition partition(month=“201909”)
查找分区表：select * from table_partition //查找分区表中的所有记录
select * from table_partition where month=“201909”　　//查找分区表中分区名201909中的所有记录
查看分区：show partitions table_partition
增加分区：alter table table_partition add partition (month=“201910”)
alter table table_partition add partition (month=“201911”), partition (month=“201912”)
删除分区：alter table table_partition drop partition(month=“201909”)
ps：二级分区指的是2个分区字段，按照字段的顺序来设置分区顺序，比如：partition(month=“201909”,day=“01”)就是一个二级分区，目录结构中day是month文件夹的子文件夹。

参考博客

hive的数据定义之创建数据库和表 - hdc520 - 博客园

YannAdams

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
hive 创建和删除库、表

hive 创建和删除库、表前言hive 库操作hive表操作参考博客前言本文主要介绍hive 中操作库和表的语句。hive 库操作查看数据库：show databases创建数据库：create database if not exists hive_testdb使用某个数据库：use hive_testdb查询数据库：show databases like ‘hive*’显示数...
复制链接

扫一扫

专栏目录