如何在Spark Java中使用StructType为以下数据定义数据类型
sam|mars|1234567|"report": {"Details": [{"subject": "science","grade": "A","remark": "good"},{"subject": "maths","grade": "E","remark": "excellent"},{"subject": "geography","grade": "E","remark": "excellent"}]}
harry|venus|987654|"report": {"Details": [{"subject": "science","grade": "O","remark": "outstanding"},{"subject": "history","grade": "A","remark": "good"}]}
字段是:NAME,ADDRESS,ID,REPORTCARD
我有以下代码:
JavaRDD row = javaRDD.map(new Function(){
@Override
public Row call(String line) throws Exception {
return RowFactory