Spark报错处理系列之:Could not execute broadcast in 300 secs. You can increase the timeout for broadcasts via spark.sql.broadcastTimeout or disable broadcast join by setting spark.sql.autoBroadcastJoinThreshold to -1
一、完整报错
- 23/12/15 18:44:36 ERROR FileFormatWriter: Aborting job 34ec90b0-cff3-4eb6-ae00-552f511f8d4b.
org.apache.spark.SparkException: Could not execute broadcast in 300 secs. You can increase the timeout for broadcasts via spark.sql.broadcastTimeout or disable broadcast join by setting spark.sql.autoBroadcastJoinThreshold to -1
at org.apache.spark.sql.execution.adaptive.BroadcastQueryStageExec$$anon 1. r u n ( Q u
博客详细介绍了Spark在执行广播操作时遇到的超时问题,分析了`spark.sql.broadcastTimeout`参数的含义和作用,以及广播数据过大或网络环境导致的超时原因。提供了解决方案,包括调整超时时间和关闭自动Broadcast Join,并给出了官方配置文档链接。
订阅专栏 解锁全文
215

被折叠的 条评论
为什么被折叠?



