(1)spark报错:run() received nonzero return code 1 while executing!
self.df_judgedoc_info_sample = self.session.read.option("multiLine", True).load(
self.judgedoc_info_sample_table_input, format="csv", schema=self.judgedoc_info_schema, delimiter=',',
escape='"')
去掉后面的repartition(1)
出错现象:会打印出许多汉字,报索引出界,有可能是repartition(1)之后,一个节点不够用导致的;
(2)spark报错2:
原表当中有些字段是这样的:
spark报错
最新推荐文章于 2022-05-04 18:33:41 发布