pyspark读取ES数据报数组错误
...
20/12/03 10:56:04 WARN Resource: Detected type name in resource [media_index/media]. Type names are deprecated and will be removed in a later release.
20/12/03 10:56:12 WARN ScalaRowValueReader: Field 'tags' is backed by an array but the associated Spark Schema does not reflect this;
(use es.read.field.as.array.include/exclude)
20/12/03 10:56:12 WARN ScalaRowValueReader: Field 'vod_cat_tags' is backed by an array but the associated Spark Schema does not reflect this;
(use es.read.field.as.array.include/exclude)
20/12/03 10:56:12 WARN ScalaRowValueReader: Field 'vod_cat_tags' is backed by an array but the associated Spark Schema does not reflect this;
(use es.read.field.as.array.include/exclude)
20/12/03 10:56:12 WARN ScalaRowValueReader: Field 'vod_cat_tags' is backed by an array but the associated Spark Schema does not reflect this;
(use es.read.field.as.array.include/exclude)
20/12/03 10:56:12 WARN ScalaRowValueReader: Field 'vod_cat_tags' is backed by an array but the associated Spark Schema does not reflect this;
(use es.read.field.as.array.include/exclude)
20/12/03 10:56:12 ERROR Executor: Exception in task 0.0 in stage 17.0 (TID 17)
org.elasticsearch.hadoop.rest.EsHadoopParsingException: org.elasticsearch.hadoop.EsHadoopIllegalStateException: Field 'vod_cat_tags.level5_name' not found; typically this occurs with arrays which are not mapped as single value
at org.elasticsearch.hadoop.serialization.ScrollReader.readHit(ScrollReader.java:514)
at org.elasticsearch.hadoop.serialization.ScrollReader.read(ScrollReader.java:292)
at org.elasticsearch.hadoop.serialization.ScrollRe