spark连接hive

最新推荐文章于 2022-11-21 16:01:13 发布

Cnnnnnnnnn，

最新推荐文章于 2022-11-21 16:01:13 发布

阅读量150

点赞数

本文链接：https://blog.csdn.net/weixin_43698185/article/details/103312311

版权

一、读取Hive中的数据加载成DataFrame

HiveContext是SQLContext的子类，连接Hive建议使用HiveContext。

由于本地没有Hive环境，要提交到集群运行，提交命令：

./spark-submit 
--master spark://node1:7077,node2:7077 
--executor-cores 1 
--executor-memory 2G 
--total-executor-cores 1
--class com.bjsxt.sparksql.dataframe.CreateDFFromHive 
/root/test/HiveTest.jar

java

SparkConf conf = new SparkConf();
conf.setAppName("hive");
JavaSparkContext sc = new JavaSparkContext(conf);
//HiveContext是SQLContext的子类。
HiveContext hiveContext = new HiveContext(sc);
hiveContext.sql("USE spark");
hiveContext.sql("DROP TABLE IF EXISTS student_infos");
//在hive中创建student_infos表
hiveContext.sql("CREATE TABLE IF NOT EXISTS student_infos (name STRING,age INT) row format delimited fields terminated by '\t' ");
hiveContext.sql("load data loc

最低0.47元/天解锁文章

Cnnnnnnnnn，

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
spark连接hive

一、读取Hive中的数据加载成DataFrameHiveContext是SQLContext的子类，连接Hive建议使用HiveContext。由于本地没有Hive环境，要提交到集群运行，提交命令：./spark-submit --master spark://node1:7077,node2:7077 --executor-cores 1 --executor-memory ...
复制链接

扫一扫