Spark
Sinsa_SI
数据玩家|编程教练|自由人 的自媒体。
分享编程、数据、风控、反欺诈、励志等方面经验和知识。
展开
-
【解决方案】pyspark 绘图报错:_tkinter.TclError: no display name and no $DISPLAY environment variable
问题描述matplotlib画图失败(pyspark、pyspark3),报错如下:no display name and no $DISPLAY environment variableTraceback (most recent call last): File "<stdin>", line 21, in plot_with_labels File "/usr/in...原创 2019-10-31 19:11:18 · 2358 阅读 · 1 评论 -
[解决办法] Cannot have map type columns in DataFrame which calls set operations(intersect, except, etc.)
[解决办法] Sql执行错误:org.apache.spark.sql.AnalysisException: Cannot have map type columns in DataFrame which calls set operations(intersect, except, etc.), but the type of column extend_value is map&lt;stri...原创 2018-09-29 15:29:40 · 4855 阅读 · 2 评论 -
[解决办法] Invalid PythonUDF <lambda>(), requires attributes from more than one child.
[解决办法] Invalid PythonUDF (), requires attributes from more than one child.报题中的错误,解决办法:在过滤过程前 加 df.cache() (这里的 df 为过滤的 DataFrame)The sequence of steps that causes this are:join two dataframes A a...原创 2018-10-17 19:51:47 · 1919 阅读 · 0 评论 -
[解决方案]spark 2.4 报错:grouping expressions sequence is empty, *** is not an aggregate function.
一、报错详情codeselect id , content_mapfrom test_db.test_tbhaving content_map is not null errorgrouping expressions sequence is empty, and 'test_db.test_tb.`id`' is not an aggregate function. Wrap ...原创 2019-03-13 11:08:29 · 17223 阅读 · 0 评论