【云星数据---Apache Flink实战系列(精品版)】：Apache Flink批处理API详解与编程实战015--DateSet实用API详解015

最新推荐文章于 2023-03-08 20:16:21 发布

李国华技术博客

最新推荐文章于 2023-03-08 20:16:21 发布

阅读量7k

点赞数

分类专栏： bigdata cloudcomputing flink 文章标签： apache api string 批处理编程

本文链接：https://blog.csdn.net/liguohuabigdata/article/details/78557871

版权

bigdata 同时被 3 个专栏收录

187 篇文章 2 订阅

订阅专栏

cloudcomputing

183 篇文章 0 订阅

订阅专栏

flink

86 篇文章 57 订阅

订阅专栏

DateSet的API详解十五

getParallelism

def getParallelism: Int

Returns the parallelism of this operation.

获取DataSet的并行度。

执行程序：

//1.创建一个 DataSet其元素为String类型
val input0: DataSet[String] = benv.fromElements("A", "B", "C")

//2.获取DataSet的并行度。
input0.getParallelism

执行结果：

res98: Int = 1

setParallelism

def setParallelism(parallelism: Int): DataSet[T]

Sets the parallelism of this operation. This must be greater than 1.

设置DataSet的并行度，设置的并行度必须大于1

执行程序：

//1.创建一个 DataSet其元素为String类型
val input0: DataSet[String] = benv.fromElements("A", "B", "C")

//2.设置DataSet的并行度。
input0.setParallelism(2)

//3.获取DataSet的并行度。
input0.getParallelism

执行结果：

res102: Int = 2

writeAsText

def writeAsText(filePath: String, writeMode: WriteMode = null): DataSink[T]

Writes this DataSet to the specified location.

将DataSet写出到存储系统。不同的存储系统写法不一样。

hdfs文件路径：
    hdfs:///path/to/data
本地文件路径：
    file:///path/to/data

执行程序：

//1.创建 DataSet[Student]
case class Student(age: Int, name: String,height:Double)
val input: DataSet[Student] = benv.fromElements(
Student(16,"zhangasn",194.5),
Student(17,"zhangasn",184.5),
Student(18,"zhangasn",174.5),
Student(16,"lisi",194.5),
Student(17,"lisi",184.5),
Student(18,"lisi",174.5))

//2.将DataSet写出到存储系统
input.writeAsText("hdfs:///output/flink/dataset/testdata/students.txt")

//3.执行程序
benv.execute()

hadoop web ui中的执行效果：

这里写图片描述

terminal中查看文件效果：

这里写图片描述

李国华技术博客

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
【云星数据---Apache Flink实战系列(精品版)】：Apache Flink批处理API详解与编程实战015--DateSet实用API详解015

DateSet的API详解十五getParallelismdef getParallelism: IntReturns the parallelism of this operation.获取DataSet的并行度。执行程序：//1.创建一个 DataSet其元素为String类型val input0: DataSet[String] = benv.fromElements("A", "B", "
复制链接

扫一扫