股票数据自动入库hive到Oracle

------import_stock_d.py-----------------------------------

#!/usr/bin/python

import tushare as ts
import os
import re
stocklistpath = '/home/cloudera/data/list/stocklist.txt'
savepath='/home/cloudera/data/data/'
openstock = open(stocklistpath,'r+')
for i in openstock:
    r = r"S[ZH]\d\d\d\d\d\d"
    stocklist = re.findall(r,i)


for i in stocklist:
    stocknum = i[2:8]
    df = ts.get_hist_data(stocknum)
    df.to_csv(savepath + i +'.txt')
    print i


for path,d,filelist in os.walk('/home/cloudera/data/data/'):
    for filename in filelist:
        filepath = os.path.join(path,filename)
        print filepath
        file = open(filepath,'r+')
        file.seek(0,0)
        filename1 = filename[0:8]+','
        print filename1
        for line in file.readlines():
           
            print file.writelines(filename1 + line)
        file.close()

--------------------------------StockRun.sh------------------------------------------------------

python /home/cloudera/python/import_stock_d.py

hadoop fs -put /home/cloudera/data/data /stock


hive -e "LOAD DATA INPATH '/stock/data/*' OVERWRITE INTO TABLE import_stock_d";


hive -e "insert overwrite table import_stock_d select * from import_stock_d where turnover is not null"


sqoop  export --table import_stock_d  -connect jdbc:oracle:thin:@192.168.1.10:1521:orcl  --username stock --password stock     --export-dir '/user/hive/warehouse/import_stock_d/*'  --input-fields-terminated-by ',' --input-lines-terminated-by '\n' --columns 'code,T_DATE,OPEN,HIGH,CLOSE,LOW,VOLUME,PRICE_CHANGE,P_CHANGE,MA5,MA10,MA20,V_MA5,V_MA10,V_MA20,TURNOVER'
  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值