spark 写入hive出错。 mysql驱动版本低。
spark写入Hive找不到表
16/12/21 17:57:36 INFO SparkSqlParser: Parsing command: dtw.dtw_sbt_adm_disease_result
Exception in thread “main” org.apache.spark.sql.catalyst.analysis.NoSuchDatabaseException: Database ‘dtw’ not found;
at org.apache.spark.sql.catalyst.catalog.ExternalCatalog.requireDbExists(ExternalCatalog.scala:37)
at org.apache.spark.sql.catalyst.catalog.InMemoryCatalog.tableExists(InMemoryCatalog.scala:271)
at org.apache.spark.sql.catalyst.catalog.SessionCatalog.tableExists(SessionCatalog.scala:255)
at org.apache.spark.sql.DataFrameWriter.saveAsTable(DataFrameWriter.scala:359)
at org.apache.spark.sql.DataFrameWriter.saveAsTable(DataFrameWriter.scala:354)
at com.hulb.sql.HiveOpeTest
.directInsertHive(HiveOpeTest.scala:245)atcom.hulb.sql.HiveOpeTest
.main(HiveOpeTest.scala:102)
at com.hulb.sql.HiveOpeTest.main(HiveOpeTest.scala)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at org.apache.spark.deploy.SparkSubmit.org。apache
spark
deploy
SparkSubmit
runMain(SparkSubmit.scala:736)atorg.apache.spark.deploy.SparkSubmit
.doRunMain
1(SparkSubmit.scala:185)atorg.apache.spark.deploy.SparkSubmit
.submit(SparkSubmit.scala:210)
at org.apache.spark.deploy.SparkSubmit$.main(SparkSubmit.scala:124)
at org.apache.spark.deploy.SparkSubmit.main(SparkSubmit.scala)
原因
没有添加hive支持
解决办法:
.enableHiveSupport()
spark sql
处理空字符串 或者null 或者NULL
不能用 !=”” 会把 所有都丢弃,返回结果为空。
用row=> !row.getString(1).equals(“”) 替代。