hive partition分区表使用Python代码导入数据

写这篇博客的背景,先有数据,想使用hive进行分析,导入hive中很是吃力。
先看数据源
前10行
NASDAQ,ABXA,2009-12-09,2.55,2.77,2.50,2.67,158500,2.67
NASDAQ,ABXA,2009-12-08,2.71,2.74,2.52,2.55,131700,2.55
NASDAQ,ABXA,2009-12-07,2.65,2.76,2.65,2.71,174200,2.71
NASDAQ,ABXA,2009-12-04,2.63,2.66,2.53,2.65,230900,2.65
NASDAQ,ABXA,2009-12-03,2.55,2.62,2.51,2.60,360900,2.60
NASDAQ,ABXA,2009-12-02,2.41,2.59,2.40,2.53,287700,2.53
NASDAQ,ABXA,2009-12-01,2.35,2.44,2.27,2.40,302000,2.40
NASDAQ,ABXA,2009-11-30,2.36,2.36,2.11,2.25,446100,2.25
NASDAQ,ABXA,2009-11-27,2.35,2.42,2.30,2.35,135200,2.35
NASDAQ,ABXA,2009-11-25,2.48,2.49,2.40,2.45,77500,2.45
后10行
NYSE,IX,1998-10-06,31.74,31.74,31.36,31.36,2000,25.43
NYSE,IX,1998-10-05,32.55,32.55,32.55,32.55,1400,26.39
NYSE,IX,1998-10-02,34.49,34.49,34.49,34.49,500,27.96
NYSE,IX,1998-09-30,34.61,34.61,34.61,34.61,800,28.06
NYSE,IX,1998-09-29,34.36,34.36,34.36,34.36,100,27.86
NYSE,IX,1998-09-24,32.49,32.49,32.49,32.49,200,26.34
NYSE,IX,1998-09-23,31.61,31.99,31.61,31.99,2200,25.93
NYSE,IX,1998-09-21,32.92,32.99,32.92,32.99,200,26.74
NYSE,IX,1998-09-17,32.80,32.92,32.80,32.86,1800,26.64
NYSE,IX,1998-09-16,32.67,32.80,32.67,32.74,2600,26.54

数据规律
第1列数据去重后只两个值:NASDAQ/NYSE
第2列数据去重后 742个值ABXA/AIPC/AAME/ATML/AVCA/ATAX
数据总共2075391行

hive 中创建两个表stocks 带分区,stocks1不带分区

create_stocks1.sql

CREATE EXTERNAL TABLE IF NOT EXISTS stocks1
(
exchanges STRING,
ymd STRING,
symbol STRING,
price_open FLOAT,
price_high FLOAT,
price_low FLOAT,
price_close FLOAT,
volume INT,
price_adj_close FLOAT
)
ROW FORMAT DELIMITED FIELDS TERMINATED BY ',';

creat_stocks.sql

CREATE EXTERNAL TABLE IF NOT EXISTS stocks
(
symbol STRING,
price_open FLOAT,
price_high FLOAT,
price_low FLOAT,
price_close FLOAT,
volume INT,
price_adj_close FLOAT
)
PARTITIONED BY (exchanges STRING,ymd STRING)
ROW FORMAT DELIMITED FIELDS TERMINATED BY ',';

创建两个的目的是,先将数据导入到hive中,然后使用动态分区的方式将数据关联至stocks中,这个表有分区,优化查询性能。刚接触hive&#x

  • 0
    点赞
  • 2
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值