sparksql之通过 structType 创建 DataFrames（编程接口）

最新推荐文章于 2024-03-17 10:02:47 发布

qq_35561207

最新推荐文章于 2024-03-17 10:02:47 发布

阅读量865

点赞数

分类专栏：大数据组件

本文链接：https://blog.csdn.net/qq_35561207/article/details/103037917

版权

import org.apache.avro.generic.GenericData.StringType
import org.apache.spark.rdd.RDD
import org.apache.spark.sql.types.{IntegerType, StructField, StructType}
import org.apache.spark.sql.{DataFrame, Row, SQLContext}
import org.apache.spark.{SparkConf, SparkContext}
import org.spark_project.dmg.pmml.True

object TestDataFrame2 {
  def main(args: Array[String]): Unit = {
    val conf=new SparkConf().setAppName("TestDataFrame2").setMaster("local")
    val sc=new SparkContext(conf)
    val SQLContext=new SQLContext(sc)
    //将本地的数据读入RDD
    val fileRDD=sc.textFile("D:\\Users\\shashahu\\Desktop\\work\\spark-2.4.4\\examples\\src\\main\\resources\\people.txt")
    //将RDD数据映射成Row,需要import org.apache.spark.sql.row
    val rowRDD:RDD[Row]=fileRDD.map(line=>{
      val fields=line.split(",")
      Row(fields(0),fields(1).trim.toInt)
    })

    //创建StructType来定义结构
    val structType: St

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

qq_35561207

关注关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
sparksql之通过 structType 创建 DataFrames（编程接口）

import org.apache.avro.generic.GenericData.StringTypeimport org.apache.spark.rdd.RDDimport org.apache.spark.sql.types.{IntegerType, StructField, StructType}import org.apache.spark.sql.{DataFrame, ...
复制链接

扫一扫