使用Pandas的read_html方法读取网页Table表格数据

最新推荐文章于 2024-01-15 16:04:59 发布

彭世瑜

最新推荐文章于 2024-01-15 16:04:59 发布

阅读量7k

点赞数 5

本文为博主原创文章，欢迎转载，请注明出处

本文链接：https://blog.csdn.net/mouday/article/details/105278570

版权

本文通过一个小实例，说明使用Pandas的read_html方法读取网页Table表格数据

要读取的网页表格数据
http://vip.stock.finance.sina.com.cn/q/go.php/vComStockHold/kind/jjzc/index.phtml

在这里插入图片描述
完整代码

# -*- coding: utf-8 -*-

import pandas as pd

# 数据出现省略号
pd.set_option('display.width', None)

url = 'http://vip.stock.finance.sina.com.cn/q/go.php/vComStockHold/kind/jjzc/index.phtml'

# 可能有多个表格，我们取第一个
df = pd.read_html(url)[0]
# print(data)

# 保存数据
df.to_csv('./data.csv', encoding='utf-8')