hive 动态分区及load和insert用法

最新推荐文章于 2024-07-23 09:23:34 发布

Ebaugh

最新推荐文章于 2024-07-23 09:23:34 发布

阅读量9k

点赞数 2

分类专栏： hive 文章标签： hive load insert 分区

本文链接：https://blog.csdn.net/AntKengElephant/article/details/83184347

版权

hive 专栏收录该内容

28 篇文章 0 订阅

订阅专栏

hive load用法：
load data local inpath '/home/data/stg_activity_antirush_apply.txt'
overwrite into table stg_activity_antirush_apply;
关键字local 不加数据从hdfs上加载，如果加local数据从本地加载；
关键字overwrite 为覆盖加载，会覆盖掉原有的数据；
动态分区设置

也可以在配置文件中修改
--SET hive.exec.dynamic.partition=true;
--SET hive.exec.dynamic.partition.mode=nonstrict;
--SET hive.exec.max.dynamic.partitions.pernode = 1000;
--SET hive.exec.max.dynamic.partitions=1000;

<property>
<name>hive.exec.dynamic.partition</name>
<value>true</value>
<description>Whether or not to allow dynamic partitions in DML/DDL.</description>
</property>
<property>
<name>hive.exec.dynamic.partition.mode</name>
<value>nonstrict</value>
<description>
In strict mode, the user must specify at least one static partition
in case the user accidentally overwrites all partitions.
In nonstrict mode all partitions are allowed to be dynamic.
</description>
</property>
<property>
<name>hive.exec.max.dynamic.partitions</name>
<value>100000</value>
<description>Maximum number of dynamic partitions allowed to be created in total.</description>
</property>
<property>
<name>hive.exec.max.dynamic.partitions.pernode</name>
<value>10000</value>
<description>Maximum number of dynamic partitions allowed to be created in each mapper/reducer node.</description>
</property>
<property>
<name>hive.exec.max.created.files</name>
<value>100000</value>
<description>Maximum number of HDFS files created by all mappers/reducers in a MapReduce job.</description>
</property>



insert overwrite table dw_activity_antirush_apply PARTITION(DateID = ${etlDate}) --手动分区
select * from tablename;