简介:Apache Hive是数据仓库。通过Hive可以使用HQL语言查询存放在HDFS上的数据。HQL是一种类SQL语言,这种语言最终被转化为Map/Reduce. 虽然Hive提供了SQL查询功能,但是Hive不能够进行交互查询--因为它是基于MapReduce算法。
划重点:hive不用和其他组件一样进行分布式集群安装
Note, Hive is not something you install on worker nodes. Hive is a Hadoop client. Just run Hive according to the instructions you see at the Hive site.
Hive不需要安装到每个HADOOP节点,它是一个HADOOP客户端,仅在需要的地方安装即可
1,安装JDK
2,安装Hadoop 集群。参考博文:https://blog.csdn.net/u010916338/article/details/78493056
3,配置JDK和Hadoop的环境变量
4,安装MySQL
5,下载并解压hive(不用管是Hadoop几群的哪个节点)
6,进入apache-hive-1.2.2-bin/conf目录下配置hive-site.xml文件
<configuration>
<!--配置MySQL链接地址:IP和数据库名,
hive启动后会在MySQL中创建一个名字叫做hive的数据库,用来保存元数据信息-->
<property>
<name>javax.jdo.option.ConnectionURL</name>
<value>jdbc:mysql://big07:3306/hive?createDatabaseIfNotExist=true</value>
</property>
<property>
<name>javax.jdo.option.ConnectionDriverName</name>
<value>com.mysql.jdbc.Driver</value>
</property>
<property>
<name>javax.jdo.option.ConnectionUserName</name>
<value>root</value>
</property>
<property>
<name>javax.jdo.option.ConnectionPassword</name>
<value>root</value>
</property>
<property>
<name>hive.metastore.schema.verification</name>
<value>false</value>
</property>
<property>
<name>hive.server2.thrift.port</name>
<value>10000</value>
</property>
<property>
<name>hive.server2.thrift.bind.host</name>
<value>big07</value>
</property>
</configuration>
7,把MySQL驱动加入到apache-hive-1.2.2-bin/lib中。下载地址:链接:https://pan.baidu.com/s/1-q4LNql1hBg0WxPxLnyrfQ
提取码:45v4
8,进入到apache-hive-1.2.2-bin/bin目录,运行:sh hive (或者执行:./hive)
9,报错
(1)javax.jdo.JDODataStoreException: Required table missing : "`VERSION`" in Catalog "" Schema "". DataNucleus requires this table to perform its persistence operations. Either your MetaData is incorrect, or you need to enable "datanucleus.schema.autoCreateTables"
--初始化数据库
|
(2)java.sql.SQLException: Access denied for user 'root'@'localhost' (using password: YES)
MySQL没有赋予权限,参考博文,https://blog.csdn.net/u010916338/article/details/89765613