Spark的LeftOuterJoin详解

最新推荐文章于 2023-08-01 07:51:21 发布

乐成阿宝

最新推荐文章于 2023-08-01 07:51:21 发布

阅读量4.7k

点赞数 1

分类专栏： SPARK 文章标签： spark

本文链接：https://blog.csdn.net/weixin_42881395/article/details/107227531

版权

一、RDD的LeftOuterJoin操作
1.1 RDD的LeftOuterJoin方法定义在Spark中，LeftOutJoin的方法源码定义如下：

/**   * Perform a left outer join of `this` and `other`. For each element (k, v) in `this`, the   * resulting RDD will either contain all pairs (k, (v, Some(w))) for w in `other`, or the   * pair (k, (v, None)) if no elements in `other` have key k. Hash-partitions the output   * using the existing partitioner/parallelism level.   */ 
 def leftOuterJoin[W](other: RDD[(K, W)]): RDD[(K, (V, Option[W]))] = self.withScope {
       leftOuterJoin(other, defaultPartitioner(self, other))  }   

/**   * Perform a left outer join of `this` and `other`. For each element (k, v) in `this`, the   * resulting RDD will either contain all pairs (k, (v, Some(w))) for w in `other`, or the   * pair (k, (v, None)) if no elements in `other` have key k. Hash-partitions the output   * into `numPartitions` partitions.   */  
def leftOuterJoin[W](      other: RDD[(K, W)],      numPartitions: Int): RDD[(K, (V, Option[W]))] = self.withScope {
       leftOuterJoin(other, new HashPartitioner</

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

乐成阿宝

关注关注

1
点赞
踩
4

收藏

觉得还不错? 一键收藏
0
评论
Spark的LeftOuterJoin详解

一、RDD的LeftOuterJoin操作 1.1 RDD的LeftOuterJoin方法定义在Spark中，LeftOutJoin的方法源码定义如下：/** * Perform a left outer join of `this` and `other`. For each element (k, v) in `this`, the * resulting RDD will either contain all pairs (k, (v, Some(w))) for w in `
复制链接

扫一扫