Spark连接mysql、hive

最新推荐文章于 2022-02-12 16:13:29 发布

珹先生

最新推荐文章于 2022-02-12 16:13:29 发布

阅读量258

点赞数

分类专栏：初学文章标签： hive

本文链接：https://blog.csdn.net/giantleech/article/details/116670902

版权

初学专栏收录该内容

136 篇文章 4 订阅

订阅专栏

文章目录

Spark连接mysql
- 利用idea工具连接
Spark连接hive

Spark连接mysql

将mysql-connector包导入spark/jars/ 路径内
在这里插入图片描述

利用idea工具连接

代码如下：

package nj.zb.kb11

import org.apache.spark.{SparkConf, SparkContext}
import org.apache.spark.sql.{DataFrame, SparkSession}

object DataFrameToMysql {
  def main(args: Array[String]): Unit = {
    val spark: SparkSession = SparkSession.builder().appName("sparktohive")
      .master("local[*]").config("hive.metastore.uris", "thrift://192.168.146.222:9083")
      .enableHiveSupport()   //连接必须的
      .getOrCreate()

    val url="jdbc:mysql://192.168.146.222:3306/emp"
    val user="root"
    val password="1"
    val properties = new java.util.Properties()
    properties.setProperty("user",user)
    properties.setProperty("password",password)
    properties.setProperty("driver","com.mysql.jdbc.Driver")

    val tableDF: DataFrame = spark.read.jdbc(url,"emp",properties)
    tableDF.printSchema()
    tableDF.show()

    import org.apache.spark.sql.functions._
    val frame: DataFrame = tableDF.agg(max("RETENTION"))
    frame.write.jdbc(url,"tttt",properties)


  }
}

Spark连接hive

配置文件

配置hive文件路径下，conf内的hive-site.xml文件，添加如下内容：

<property>
  <name>hive.server2.thrift.client.user</name>
  <value>root</value>
  <description>Username to use against thrift client</description>
</property>
<property>
  <name>hive.server2.thrift.client.password</name>
  <value>root</value>
  <description>Password to use against thrift client</description>
</property>
<property>
 <name>hive.metastore.uris</name>
 <value>thrift://192.168.168.222:9083</value>
</property>

保存退出后，将该文件拷贝一份到spark/conf/ 目录内

启动hive服务

nohup /opt/soft/hive/bin/hive --service metastore &     //启动hive元数据Metastore服务
nohup /opt/soft/hive/bin/hive --service hiveserver2 &    //启动hiveserver2  服务

通过idea工具连接

package nj.zb.kb11

import org.apache.spark.sql.SparkSession

//用spark读取hive数据
object SparkToHive {
  def main(args: Array[String]): Unit = {
    val spark: SparkSession = SparkSession.builder().appName("sparktohive")
      .master("local[*]").config("hive.metastore.uris", "thrift://192.168.146.222:9083")
      .enableHiveSupport()   //连接hive必须的
      .getOrCreate()
    spark.sql("show databases").collect().foreach(println)

  }
}