Spark_SQL-DataFrame数据写出以及读写数据库（以MySQl为例）_dataframe mysql option

最新推荐文章于 2024-04-27 21:28:43 发布

2401_83973995

最新推荐文章于 2024-04-27 21:28:43 发布

阅读量385

点赞数 3

分类专栏： 2024年程序员学习文章标签：数据库 spark sql

本文链接：https://blog.csdn.net/2401_83973995/article/details/137743104

版权

    appName('write').\
    master('local[*]').\
    getOrCreate()

sc = spark.sparkContext

# 1.读取文件
schema = StructType().add('user_id', StringType(), nullable=True).\
    add('movie_id', IntegerType(), nullable=True).\
    add('rank', IntegerType(), nullable=True).\
    add('ts', StringType(), nullable=True)

df = spark.read.format('csv').\
    option('sep', '\t').\
    option('header', False).\
    option('encoding', 'utf-8').\
    schema(schema=schema).\
    load('../input/u.data')

# write text 写出，只能写出一个列的数据，需要将df转换为单列df
df.select(F.concat_ws('---', 'user_id', 'movie_id', 'rank', 'ts')).\
    write.\
    mode('overwrite').\
    format('text').\
    save('../output/sql/text')

# write csv
df.write.mode('overwrite').\
    format('csv').\
    opti

最低0.47元/天解锁文章

2401_83973995

关注

3
点赞
踩
9

收藏

觉得还不错? 一键收藏
0
评论
Spark_SQL-DataFrame数据写出以及读写数据库（以MySQl为例）_dataframe mysql option

最近很多小伙伴找我要Linux学习资料，于是我翻箱倒柜，整理了一些优质资源，涵盖视频、电子书、PPT等共享给大家！
复制链接

扫一扫