记录:Flink 检查点和状态后端在实际生产中用得特别多,通过学习记录,如果有不对的地方大家多多指教
1 Flink checkpoint实战篇
1.1 Flink配置
jobmanager.rpc.address: dw501
jobmanager.rpc.port: 6123
jobmanager.memory.process.size: 1600m
taskmanager.memory.process.size: 1728m
taskmanager.numberOfTaskSlots: 1
parallelism.default: 1
state.backend: filesystem
state.checkpoints.dir: hdfs://dw501:9820/flink-demo
state.savepoints.dir: hdfs://dw501:9820/flink-savepoints
jobmanager.execution.failover-strategy: region
jobmanager.archive.fs.dir: hdfs://dw501:9820/completed-jobs/
historyserver.web.address: dw501
historyserver.web.port: 8082
historyserver.archive.fs.dir: hdfs://dw501:9820/completed-jobs/
historyserver.archive.fs.refresh-interval: 10000
上面的这些配置文件,大家可以可选也可以进行单独的其它的配置项
1.2 Flink wordcount
package com.wmy.example.wordcount;
import lombok.AllArgsConstructor;
import lombok.Data;
import lombok.NoArgsConstructor;
import org.apache.flink.api.common.functions.FlatMapFunction;
import org.apache.flink.api.common.restartstrategy.RestartStrategies;
import org.apache.flink.api.common.time.Time;
import org.apache.flink.api.common.typeinfo.TypeInformation;
import org.apache.flink.streaming.api.CheckpointingMode;
import org.apache.flink.streaming.api.d