kettle 连接hive导数:
先将hive/lib里面的关于hive的jar包全部导入到kettle/中hdp25目录中的Lib库里
配置修改:
plugin.properties属性值修改:
# here see the config.properties file in that configuration's directory.
active.hadoop.configuration=hdp25
# Path to the directory that contains the available Hadoop configurations
hadoop.configurations.path=hadoop-configurations
# Version of Kettle to use from the Kettle HDFS installation directory. This can be set globally here or overridden per job
# as a User Defined property. If not set we will use the version of Kettle that is used to submit the Pentaho MapReduce job.
pmr.kettle.installation.id=
权限问题修改:
在hadoop
core-site.xml
文件中添加如下内容:
<property>
<name>hadoop.proxyuser.root.groups</name>
<value>*</value>
</property>
<property>
<name>hadoop.proxyuser.root.hosts</name>
<value>*</value>
</property>
hiveserver2启动
后台启动hiveserver2
nohup hive --service hiveserver2 &
hadoop重新启动,连接hive就ok了。
端口占用查看:
netstat -tunlp |grep 22
<property>
<name>hadoop.proxyuser.bigdata.groups</name>
<value>*</value>
</property>
<property>
<name>hadoop.proxyuser.bigdata.hosts</name>
<value>*</value>
</property>
使用
hive --service metastore &
hive --service hiveserver2 &
启动hive后,隔一天使用jps查询发现2个RUNJAR消失了
原因:使用&启动后在账户退出后会自动关闭进程
解决:
nohup hive --service metastore &
nohup hive --service hiveserver2 &
nohup hive --service metastore 2>&1 >> /home/data/appcom/hive-2.3.3/log/hive_metastore.log &
nohup hive --service hiveserver2 2>&1 >> /home/data/appcom/hive-2.3.3/log/hive_server2.log &