1、安装pyhs2
A Python client driver for HiveServer2 is available at https://github.com/BradRuderman/pyhs2 (thanks, Brad). It includes all the required packages such as SASL and Thrift wrappers.The driver has been certified for use with Python 2.6 and newer.
pip install pyhs2
我的环境还需要提前安装VCForPython27,下载地址:https://www.microsoft.com/en-us/download/details.aspx?id=44266
2、参考代码:
import pyhs2
with pyhs2.connect(host='localhost',
port=10000,
authMechanism="PLAIN",
user='root',
password='test',
database='default') as conn:
with conn.cursor() as cur:
#Show databases
print cur.getDatabases()
#Execute query
cur.execute("select * from table")
#Return column info from query
print cur.getSchema()
#Fetch table results
for i in cur.fetch():
print i
参考地址:https://cwiki.apache.org/confluence/display/Hive/Setting+Up+HiveServer2#SettingUpHiveServer2-PythonClientDriver
也可以考虑pyhive,github地址:https://github.com/dropbox/PyHive