爬取中国银行外汇数据

最新推荐文章于 2024-07-10 18:31:35 发布

datasing

最新推荐文章于 2024-07-10 18:31:35 发布

阅读量3.5k

点赞数 2

文章标签： python beautifulsoup

本文链接：https://blog.csdn.net/qq_43275241/article/details/92414294

版权

该博客介绍了一个使用Python进行网络爬虫的实践案例，通过requests和BeautifulSoup库爬取中国银行的外汇数据，并将数据存储为CSV文件。博主详细展示了如何构建URL、爬取网页、解析HTML获取表格数据，以及如何组织和保存数据。

摘要由CSDN通过智能技术生成

import requests
import pandas as pd
from bs4 import BeautifulSoup
import time
import csv
#构建网页
def get_url(url):
for i in range(2,3):
url=“http://www.boc.cn/sourcedb/whpj/index_"+str(i)+".html”
return url
#爬取网页
def get_html(url):
try:
r=requests.get(url,timeout=3.5)
r.raise_for_status()
r.encoding=r.apparent_encoding
html=r.text
return html
except:
print(‘无法爬取’)
#解析网页
def get_data(name_lsts,html):
soup=BeautifulSoup(html,“html.parser”)
content = soup.prettify()
ths=soup.find_all(‘th’)
#提取表名
name_lst=[]
for th in ths:
name_lst.append(th.string)
name_lsts.append(name_lst)