最全大数据集群搭建之Linux安装Hive2(1)，腾讯大数据开发开发面试凉凉

2401_84182636

于 2024-05-16 04:22:58 发布

阅读量620

点赞数 16

文章标签：大数据面试学习

本文链接：https://blog.csdn.net/2401_84182636/article/details/138935999

版权

网上学习资料一大堆，但如果学到的知识不成体系，遇到问题时只是浅尝辄止，不再深入研究，那么很难做到真正的技术提升。

需要这份系统化资料的朋友，可以戳这里获取

一个人可以走的很快，但一群人才能走的更远！不论你是正从事IT行业的老鸟或是对IT行业感兴趣的新人，都欢迎加入我们的的圈子（技术交流、学习资源、职场吐槽、大厂内推、面试辅导），让我们一起学习成长！

四、Hive配置

cd $HIVE_HOME/conf

touch hive-env.sh hive-site.xml

chmod +x hive-env.sh

1、hive-env.sh配置

export HADOOP_HEAPSIZE=4096

export JAVA_HOME=/usr/java/jdk1.8

export HADOOP_HOME=/usr/local/hadoop/hadoop

export HIVE_HOME=/usr/local/hadoop/hive

export HIVE_CONF_DIR=/usr/local/hadoop/hive/conf

export HBASE_HOME=/usr/local/hadoop/hbase

export SPARK_HOME=/usr/local/hadoop/spark

export ZOO_HOME=/usr/local/hadoop/zookeeper

2、hive-site.xml配置

<?xml version="1.0" encoding="UTF-8" standalone="no"?> <?xml-stylesheet type="text/xsl" href="configuration.xsl"?>

hive.metastore.warehouse.dir

/user/hive/warehouse

location of default database for the warehouse

hive.exec.local.scratchdir

/usr/local/hadoop/hive/tmp

Local scratch space for Hive jobs

hive.downloaded.resources.dir

/usr/local/hadoop/hive/tmp/resources

Temporary local directory for added resources in the remote file system.

hive.querylog.location

/user/hadoop/hive/logs

Location of Hive run time structured log file

hive.metastore.schema.verification

false

Enforce metastore schema version consistency.

True: Verify that version information stored in is compatible with one from Hive jars. Also disable automatic

schema migration attempt. Users are required to manually migrate schema after Hive upgrade which ensures

proper metastore schema migration. (Default)

False: Warn if the version information stored in metastore doesn’t match with one from in Hive jars.

hive.metastore.db.type

mysql

Expects one of [derby, oracle, mysql, mssql, postgres].

Type of database used by the metastore. Information schema & JDBCStorageHandler depend on it.

javax.jdo.option.ConnectionURL

jdbc:mysql://hadoop001:3306/hive?createDatabaseIfNotExist=true&useSSL=false

javax.jdo.option.ConnectionDriverName

com.mysql.jdbc.Driver

Driver class name for a JDBC metastore

javax.jdo.option.ConnectionUserName

hive

Username to use against metastore database

javax.jdo.option.ConnectionPassword

hive

Comma separated list of configuration options which should not be read by normal user like passwords

datanucleus.schema.autoCreateAll

true

Auto creates necessary schema on a startup if one doesn’t exist. Set this to false, after creating it once.To enable auto create also set hive.metastore.schema.verification=false. Auto creation is not recommended for production use cases, run schematool command instead.

hive.server2.thrift.bind.host

hadoop001

Bind host on which to run the HiveServer2 Thrift service.

hive.server2.thrift.port

10000

Port number of HiveServer2 Thrift interface when hive.server2.transport.mode is ‘binary’.

hive.metastore.uris

Thrift URI for the remote metastore. Used by metastore client to connect to remote metastore.

hive.server2.logging.operation.log.location

/usr/local/hadoop/hive/tmp/operation_logs

Top level directory where operation logs are stored if logging functionality is enabled

hive.server2.webui.host

hadoop001

The host address the HiveServer2 WebUI will listen on

hive.server2.webui.port

10002

The port the HiveServer2 WebUI will listen on. This can beset to 0 or a negative integer to disable the web UI

hive.server2.webui.max.threads

The max HiveServer2 WebUI threads

hive.server2.webui.use.ssl

false

Set this to true for using SSL encryption for HiveServer2 WebUI.

hive.server2.thrift.client.user

root

Username to use against thrift client

hive.server2.thrift.client.password

root

Password to use against thrift client

五、初始化Hive

1、复制mysql jdbc驱动包到hive lib目录

cd $HIVE_HOME/lib

wget https://repo1.maven.org/maven2/mysql/mysql-connector-java/5.1.47/mysql-connector-java-5.1.47.jar

wget https://repo1.maven.org/maven2/mysql/mysql-connector-java/8.0.16/mysql-connector-java-8.0.16.jar

2、MySQL创建用户并赋予权限

– 创建hive用户，密码为hive

CREATE USER ‘hive’@‘%’ IDENTIFIED BY ‘hive’;

– 赋予hive用户全部权限

GRANT ALL PRIVILEGES ON . TO ‘hive’@‘%’ IDENTIFIED BY ‘hive’ WITH GRANT OPTION;

– 刷新权限

FLUSH PRIVILEGES;

3、启动zk和hadoop集群

zkServer.sh start

hdfs --daemon start zkfc

start-all.sh

4、创建hive目录并赋权

hadoop fs -mkdir /tmp

hadoop fs -mkdir /user/hive/warehouse

hadoop fs -chmod g+w /tmp

hadoop fs -chmod g+w /user/hive/warehouse

5、初始化hive数据库

schematool -dbType mysql -initSchema

6、查看hive初始化的数据库

六、启动Hive

1、启动hive客户端

hive

SHOW DATABASES;

CREATE DATABASE db01;

USE db01;

set hive.cli.print.current.db=true;

CREATE TABLE pokes (foo INT, bar STRING);

CREATE TABLE invites (foo INT, bar STRING) PARTITIONED BY (ds STRING);

SHOW TABLES;

SHOW TABLES ‘.*s’;

DESCRIBE invites;

2、HDFS查看Hive目录

http://hadoop001:9870/explorer.html#/user/hive/warehouse/db01.db

3、启动 HiveServer2 服务

nohup hiveserver2 > /dev/null 2>&1 &

HiveServer2服务支持多线程多用户同时连接，还同时还支持JDBC连接

JDBC驱动：org.apache.hive.jdbc.HiveDriver

JDBCURL：jdbc:hive2://hadoop001:10000/dbname

4、查看 Hive 日志

tail -n 300 /tmp/root/hive.log

既有适合小白学习的零基础资料，也有适合3年以上经验的小伙伴深入学习提升的进阶课程，涵盖了95%以上大数据知识点，真正体系化！

由于文件比较多，这里只是将部分目录截图出来，全套包含大厂面经、学习笔记、源码讲义、实战项目、大纲路线、讲解视频，并且后续会持续更新

需要这份系统化资料的朋友，可以戳这里获取

nohup hiveserver2 > /dev/null 2>&1 &

HiveServer2服务支持多线程多用户同时连接，还同时还支持JDBC连接

JDBC驱动：org.apache.hive.jdbc.HiveDriver

JDBCURL：jdbc:hive2://hadoop001:10000/dbname

4、查看 Hive 日志

tail -n 300 /tmp/root/hive.log

[外链图片转存中…(img-xh7cEbYu-1715804556716)]
[外链图片转存中…(img-WBPkyRbF-1715804556717)]
[外链图片转存中…(img-2SMDkQct-1715804556717)]

既有适合小白学习的零基础资料，也有适合3年以上经验的小伙伴深入学习提升的进阶课程，涵盖了95%以上大数据知识点，真正体系化！

由于文件比较多，这里只是将部分目录截图出来，全套包含大厂面经、学习笔记、源码讲义、实战项目、大纲路线、讲解视频，并且后续会持续更新

需要这份系统化资料的朋友，可以戳这里获取

2401_84182636

关注

16
点赞
踩
10

收藏

觉得还不错? 一键收藏
0
评论
最全大数据集群搭建之Linux安装Hive2(1)，腾讯大数据开发开发面试凉凉

Hive临时文件，用于存储每个查询的临时/中间数据集，通常在完成查询后由配置单元客户端清除。HiveServer2服务支持多线程多用户同时连接，还同时还支持JDBC连接。JDBC驱动：org.apache.hive.jdbc.HiveDriver。HiveServer2服务支持多线程多用户同时连接，还同时还支持JDBC连接。JDBC驱动：org.apache.hive.jdbc.HiveDriver。使用MySQL作为hive的元数据Metastore数据库。HiveServer2 webUI 监听主机。
复制链接

扫一扫