java spark dataset,如何将自定义Java类转换为Spark数据集

I can't figure out a way to convert a List of Test objects to a Dataset in Spark

This is my class:

public class Test {

public String a;

public String b;

public Test(String a, String b){

this.a = a;

this.b = b;

}

public List getList(){

List l = new ArrayList();

l.add(this.a);

l.add(this.b);

return l;

}

}

解决方案

Your code in the comments to create a DataFrame is correct. However, there is a problem with the way you define Test. You can create DataFrames using your code only from Java Beans. Your Test class is not a Java Bean. Once you fix that, you can use the following code to create a DataFrame:

Dataset dataFrame = spark.createDataFrame(listOfTestClasses, Test.class);

and these lines to create a typed Dataset:

Encoder encoder = Encoders.bean(Test.class);

Dataset dataset = spark.createDataset(listOfTestClasses, encoder);

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值