hive 3.1.1 高可用集群搭建（与zookeeper集成）搭建笔记

最新推荐文章于 2024-06-03 22:07:33 发布

犀利哗啦760596103

最新推荐文章于 2024-06-03 22:07:33 发布

阅读量3.6k

点赞数 2

分类专栏： zookeeper 集群文章标签： hive

本文链接：https://blog.csdn.net/liuhuabing760596103/article/details/89175063

版权

集群同时被 2 个专栏收录

3 篇文章 0 订阅

订阅专栏

zookeeper

2 篇文章 0 订阅

订阅专栏

一、简介

1. hive

三个节点分别在hdp01、hdp02 、hdp03

2. zookeeper

5个节点分别在 hdp04、hdp05、hdp06 、hdp07 、hdp08

3. hadoop

7个节点： namenode hdp01 、hdp02

datanode hdp03、hdp04、hdp05 、hdp06、hdp07 、hdp08

二、搭建步骤

1.安装mysql

rpm -ivh MySQL-server-5.6.26-1.linux_glibc2.5.x86_64.rpm

rpm -ivh MySQL-client-5.6.26-1.linux_glibc2.5.x86_64.rpm

可能会缺少perl

yum install perl

注意要点：

首次登陆完成之后，注意要直接配置好root可以远程登录，否则还需要进行其他的修改。

如何修改在另外一篇文章中有记录

https://mp.csdn.net/postedit/89081368

2.安装hive与mysql 、zk集成

这里安装步骤省略。主要是修改配置文件，这里只说明一下hive的安装文件的配置以及说明

scp hive-env.sh.template hive-env.sh

scp hive-default.xml.template hive-site.xml

修改hive-env.sh

具体的配置根据自己的情况来定，我的配置是这样

HADOOP_HOME=/usr/hadoop/hadoop-2.8.1/
HIVE_CONF_DIR=/root/app/apache-hive-3.1.1-bin/conf
HIVE_AUX_JARS_PATH=/root/app/apache-hive-3.1.1-bin/lib

修改hive-site.xml

-----------------------------------------数据库集成配置-----------------------------------------
<property>
<name>javax.jdo.option.ConnectionURL</name>
<value>jdbc:mysql://hdp01:3306/hivedb?createDatabaseIfNotExist=true</value>
<description>JDBC connect string for a JDBC metastore</description>
</property>

<property>
<name>javax.jdo.option.ConnectionDriverName</name>
<value>com.mysql.jdbc.Driver</value>
<description>Driver class name for a JDBC metastore</description>
</property>

<property>
<name>javax.jdo.option.ConnectionUserName</name>
<value>root</value>
<description>username to use against metastore database</description>
</property>

<property>
<name>javax.jdo.option.ConnectionPassword</name>
<value>root</value>
<description>password to use against metastore database</description>
</property>

--------------------------------------------hive工作目录的配置--------------------------------------------------

环境变量最好配置一下，否则获取不了的话，生成的文件名很杂乱，比如下面的我就设置了

<property>
<name>hive.exec.local.scratchdir</name>
<value>/root/app/apache-hive-3.1.1-bin/tmp/hiveuser</value>
<description>Local scratch space for Hive jobs</description>
</property>
<property>
<name>hive.downloaded.resources.dir</name>
<value>/root/app/apache-hive-3.1.1-bin/tmp/${hive.session.id}_resources</value>
<description>Temporary local directory for added resources in the remote file system.</description>
</property>

-------------------------------------------元数据的配置创建很重要，不配置的话与mysql集成有可能会出问题------------

<property>
<name>datanucleus.schema.autoCreateAll</name>
<value>true</value>
<description>Auto creates necessary schema on a startup if one doesn't exist. Set this to false, after creating it once.To enable auto create also set hive.metastore.schema.verification=false. Auto creation is not recommended for production use cases, run schematool command instead.</description>
</property>
<property>
<name>hive.metastore.schema.verification</name>
<value>false</value>
<description>
Enforce metastore schema version consistency.
True: Verify that version information stored in is compatible with one from Hive jars. Also disable automatic
schema migration attempt. Users are required to manually migrate schema after Hive upgrade which ensures
proper metastore schema migration. (Default)
False: Warn if the version information stored in metastore doesn't match with one from in Hive jars.
</description>
</property>

--------------------------------------连接mysql数据库的用户名密码---------------------------

<property>
<name>javax.jdo.option.ConnectionUserName</name>
<value>hiveuser</value>
<description>Username to use against metastore database</description>
</property>

<property>
<name>javax.jdo.option.ConnectionPassword</name>
<value>123456789</value>
<description>password to use against metastore database</description>
</property>

---------------------------------------其他配置--------------------------------------------------

<property>
<name>hive.querylog.location</name>
<value>/root/app/apache-hive-3.1.1-bin/tmp/qrylog</value>
<description>Location of Hive run time structured log file</description>
</property>

---------------------------------------zookeeper---------------------------------------------------

<property>
<name>hive.zookeeper.quorum</name>
<value>
hdp04:2181,hdp05:2181,hdp06:2181,hdp07:2181,hdp08:2181
</value>
<description>
List of ZooKeeper servers to talk to. This is needed for:
1. Read/write locks - when hive.lock.manager is set to
org.apache.hadoop.hive.ql.lockmgr.zookeeper.ZooKeeperHiveLockManager,
2. When HiveServer2 supports service discovery via Zookeeper.
3. For delegation token storage if zookeeper store is used, if
hive.cluster.delegation.token.store.zookeeper.connectString is not set
4. LLAP daemon registry service
5. Leader selection for privilege synchronizer
</description>
</property>

<property>
<name>hive.server2.support.dynamic.service.discovery</name>
<value>true</value>
<description>Whether HiveServer2 supports dynamic service discovery for its clients. To support this, each instance of HiveServer2 currently uses ZooKeeper to register itself, when it is brought up. JDBC/ODBC clients should use the ZooKeeper ensemble: hive.zookeeper.quorum in their connection string.</description>
</property>
<property>
<name>hive.server2.zookeeper.namespace</name>
<value>hiveserver2_zk</value>
<description>The parent node in ZooKeeper used by HiveServer2 when supporting dynamic service discovery.</description>
</property>
<property>
<name>hive.server2.zookeeper.publish.configs</name>
<value>true</value>
<description>Whether we should publish HiveServer2's configs to ZooKeeper.</description>
</property>

--------------------------------------hiveserver的日志路径配置-----------------------------------------------

<property>
<name>hive.server2.logging.operation.log.location</name>
<value>/root/app/apache-hive-3.1.1-bin/tmp/operation_logs</value>
<description>Top level directory where operation logs are stored if logging functionality is enabled</description>
</property>

<property>
<name>hive.server2.thrift.client.user</name>
<value>root</value>
<description>Username to use against thrift client</description>
</property>
<property>
<name>hive.server2.thrift.client.password</name>
<value>root</value>
<description>Password to use against thrift client</description>
</property>

<property>
<name>hive.server2.thrift.port</name>
<value>10000</value>
<description>Port number of HiveServer2 Thrift interface when hive.server2.transport.mode is 'binary'.</description>
</property>

<property>
<name>hive.server2.transport.mode</name>
<value>binary</value>
<description>
Expects one of [binary, http].
Transport mode of HiveServer2.
</description>
</property>
<property>
<name>hive.server2.thrift.bind.host</name>
<value>hdp01</value>
<description>Bind host on which to run the HiveServer2 Thrift service.</description>
</property>

------------------------------------------------关于hdfs的相关配置-------------------------------------------

<property>
<name>hive.exec.scratchdir</name>
<value>/tmp/hive</value>
<description>HDFS root scratch dir for Hive jobs which gets created with write all (733) permission. For each connecting user, an HDFS scratch dir: ${hive.exec.scratchdir}/<username> is created, with ${hive.scratch.dir.permission}.</description>
</property>
<property>
<name>hive.repl.rootdir</name>
<value>/user/hive/repl/</value>
<description>HDFS root dir for all replication dumps.</description>
</property>

hdfs-site.xml 文件配置

<property>
<name>dfs.webhdfs.enabled</name>
<value>true</value>
</property>

core-site.xml 配置这里配置很重要还有要注意这里的name页签中的root 这里是hdfs登录的具体的用户名，写错了就会报错，访问hive的时候会报错

<property>
<name>hadoop.proxyuser.root.hosts</name>
<value>*</value>
</property>
<property>
<name>hadoop.proxyuser.root.groups</name>
<value>*</value>
</property>

配置完成之后，需要有一个jdbc的jar包驱动

三、操作指令

1.后台启动服务. 在hive节点上启动即可

nohup hiveserver2 -hiveconf hive.root.logger=DEBUG,console 1> hive.log 2>&1 &

2.客户端访问 belline 敲入以下指令登录

!connect jdbc:hive2://hdp04:2181,hdp05:2181,hdp06:2181,hdp07:2181,hdp08:2181/;serviceDiscoveryMode=zooKeeper;zooKeeperNamespace=hiveserver2_zk root "root"

犀利哗啦760596103

关注

2
点赞
踩
10

收藏

觉得还不错? 一键收藏
0
评论
hive 3.1.1 高可用集群搭建（与zookeeper集成）搭建笔记

一、简介 1. hive 三个节点分别在hdp01、hdp02 、hdp03 2. zookeeper 5个节点分别在 hdp04、hdp05、hdp06、hdp07、hdp08 3. hadoop 7个节点...
复制链接

扫一扫

专栏目录