hive中导入json格式的数据（hive分区表）

最新推荐文章于 2022-09-05 19:06:53 发布

SeaSky_Steven

最新推荐文章于 2022-09-05 19:06:53 发布

阅读量397

点赞数

分类专栏： hive 文章标签： hive json

原文链接：https://www.bbsmax.com/A/QW5YY36N5m/

版权

12 篇文章 0 订阅

订阅专栏

hive中建立外部分区表，外部数据格式是json的如何导入呢？

json格式的数据表不必含有分区字段，只需要在hdfs目录结构中体现出分区就可以了

This is all according to this guide: http://blog.cloudera.com/blog/2012/12/how-to-use-a-serde-in-apache-hive/

In /tmp/new I have a file abc.json

The CREATE EXTERNAL TABLE command runs properly, but it doesn't take in any data:

注意外部分区表需要手工添加分区

具体步骤

.) Run the create table statement.
.) In the directory /tmp/new/ create a sub directory datehour=<some int value>, and then put your .json file inside this.这里就是说只需要在hdfs上建立目录，目录体现分区信息，将数据放到对应目录，然后直接add partiton就好了
.) Run alter table statement adding this partition to metadata:
alter table tweets add partition(datehour=<some int value>);
.) Now run the select statement.

关注