话不多说,直接上代码!
import requests
from bs4 import BeautifulSoup
from urllib import parse
import time
headers = {“User-Agent”:“Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/84.0.4147.105 Safari/537.36 Edg/84.0.522.52”}
def get_html(url):
html = requests.get(url,headers=headers)
if html.status_code==200:
print(“获取页面成功”)
parse_html(html.text)
else:
print(“ERROR”,html.text)
return
def parse_html(content):
soup = BeautifulSoup(content,‘lxml’)
trs = soup.select(‘table tbody tr’)
for tr in trs:
title = tr.select_one(‘td a’).text
url = tr.select_one(‘td a’)[‘href’]
如果你也是看准了Python,想自学Python,在这里为大家准备了丰厚的免费学习大礼包,带大家一起学习,给大家剖析Python兼职、就业行情前景的这些事儿。