Hive分区
分区的目的:将大的数据分割成小数据,减少分析时的扫描量,提高效率。如:可以将数据按天分区,每天分析前一天的数据。
- 创建分区表语法
分区字段不能和表中字段重复
create table score(s_id string, c_id string, s_score int)
partitioned by (month string)
row format delimited fields terminated by '\t';
- 创建一个表带多个分区
create table score2 (s_id string,c_id string, s_score int)
partitioned by (year string, month string, day string)
row format delimited fields terminated by '\t';
- 加载数据到分区表中去
load data local inpath '/hivedatas/score.csv' into table score
partition (month='201806');;
- 加载数据到多分区表中去
load data local inpath '/hivedatas/score.csv' into table score2 partition(year='2018',