Hive Join的几种方式

最新推荐文章于 2024-07-29 11:41:47 发布

mustbesomebody

最新推荐文章于 2024-07-29 11:41:47 发布

阅读量1.6k

点赞数

分类专栏： Hadoop

本文链接：https://blog.csdn.net/QQ331948781/article/details/63254859

版权

本文介绍了Hive中的几种join操作，包括如何处理导入文件时出现的空行问题，以及左连接、右连接和全连接的概念及其应用。通过实例展示了如何删除NULL行，并详细解释了不同类型的JOIN在数据保留上的差异。

摘要由CSDN通过智能技术生成

下面实验几种hive中常用到的join操作

首先创建两个文件用于导入表中

hadoop@master:~/17$ cat data1
1,a
2,b
3,c
4,d
5,e
8,u
9,r

hadoop@master:~/17$ cat data2
1,aa
2,gg
7,www
19,ee

实验步骤:

1.创建hive表
create table a(id int, name string) row format delimited fields terminated by ',';
create table b(id int, name string) row format delimited fields terminated by ',';

2.导入数据
load data local inpath '/home/hadoop/17/data1' into table a;
load data local inpath '/home/hadoop/17/data2' into table b;

备注：

如果导入的文件有空行的情况，就会出现为NULL的行，判断条件为int用is NULL或者is not NULL判断，string 类型用='NULL'或者!='NULL'来判断

如果想把为NULL的行删除,可以这样

insert overw

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

mustbesomebody

关注关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
Hive Join的几种方式

下面实验几种hive中常用到的join操作首先创建两个文件用于导入表中hadoop@master:~/17$ cat data11,a2,b3,c4,d5,e8,u9,rhadoop@master:~/17$ cat data21,aa2,gg7,www19,ee实验步骤:1.创建hive表create table a(id int, na
复制链接

扫一扫