python写hive方法_python 读写hive

最新推荐文章于 2023-07-09 21:53:03 发布

有赞技术团队

最新推荐文章于 2023-07-09 21:53:03 发布

阅读量1k

点赞数

文章标签： python写hive方法

本文链接：https://blog.csdn.net/weixin_35318685/article/details/114956139

版权

本文介绍了如何使用Python的pyhive库在Hive中读取和写入数据。通过创建hive.Connection连接Hive，使用pd.read_sql进行查询，并通过迭代DataFrame的行插入数据。注意在Windows上不可用，适用于CentOS和macOS/Linux。

摘要由CSDN通过智能技术生成

最近正在做一个项目，需要把算法模型的结果持久化至hive.

目前使用的 pyhive，切记在windows上不能使用，我目前在centos6.5上使用，官方说再macos和linux上可用。

from pyhive import hive

import pandas as pd

# from sqlalchemy import create_engine

# from pyspark.sql import sqlContext

conn = hive.Connection(host='xxx', port=10000, username='xxx', database='default')

cur = conn.cursor()

#读取hive

dftt=pd.read_sql("select * from dw.ml_catalog limit 10",con=conn)

print(dftt)

# test data

listpandas=[[456,'test456'],[789,'test456'],[123,'test123'],[110,'test110']]

# engine=create_engine('hive://xxx@xxx:10000/default')

df=pd.DataFrame(listpandas,columns=['id','name'])

# must use the follow to write hive,to_sql 目前有bug,只能存入一条语

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

关注关注