Spark SQL 和 Hive 的交互

最新推荐文章于 2023-03-08 17:54:06 发布

ManBeCool

最新推荐文章于 2023-03-08 17:54:06 发布

阅读量925

点赞数

文章标签： Spark SQL Hive

本文链接：https://blog.csdn.net/sinat_34763749/article/details/81282376

版权

Spark SQL能够读写Hive表，兼容大部分Hive函数和特性，但不支持如桶分区、UNION类型等特定Hive功能。文章讨论了两者之间的兼容性和不支持的Hive特性。

摘要由CSDN通过智能技术生成

Spark SQL可以读写Hive表

Spark SQL also supports reading and writing data stored in Apache Hive. However, since Hive has a large number of dependencies, these dependencies are not included in the default Spark distribution. If Hive dependencies can be found on the classpath, Spark will load them automatically. Note that these Hive dependencies must also be present on all of the worker nodes, as they will need access to the Hive serialization and deserialization libraries (SerDes) in order to access data stored in Hive.
Ref: https://spark.apache.org/docs/2.2.0/sql-programming-guide.html#hive-tables