spark.read.jdbc方法

最新推荐文章于 2024-09-18 11:34:41 发布

楓尘林间

最新推荐文章于 2024-09-18 11:34:41 发布

阅读量5k

点赞数

分类专栏： Spark

本文链接：https://blog.csdn.net/bowenlaw/article/details/98612971

版权

 jdbc(url, table, column=None, lowerBound=None, upperBound=None, numPartitions=None, predicates=None, properties=None)[source]

    Construct a DataFrame representing the database table named table accessible via JDBC URL url and connection properties.

    Partitions of the table will be retrieved in parallel if either column or predicates is specified. lowerBound`, ``upperBound and numPartitions is needed when column is specified.

    If both column and predicates are specified, column will be used.

    Note

    Don’t create too many partitions in parallel on a large cluster; otherwise Spark might crash your external database systems.

    Parameters

            url – a JDBC URL of the form jdbc:subprotocol:subname

            table – the name of the table

            column – the name of an integer column that will be used for partitioning; if this parameter is specified, then numPartitions, lowerBound