借鉴了某博主@Yunheeee,发现写的很不错,帮助我完成了此次作业,感谢!
紧接着,我跟着它的代码尝试了一下,大体如下:
首先引入库
import requests
import numpy as np
import pandas as pd
接着设置url的网站
url='https://datachart.500.com/ssq/history/newinc/history.php?start=03001'
#获取历史所有双色球数据
response = requests.get(url)
response.encoding = 'utf-8'
re_text = response.text
#网页数据解析
re=re_text.split('<tbody id="tdata">')[1].split('</tbody>')[0]
result=re.split('<tr class="t_tr1">')[1:]
all_numbers=[]
for i in result:
each_numbers=[]
i=i.replace('<!--<td>2</td>-->','')
each=i.split('</td>')[:-1]
for j in each:
each_numbers.append