1 爬取菜谱网站
目标:爬取热门菜谱清单,内含:菜名、原材料、详细烹饪流程的URL。
url:http://www.xiachufang.com/explore/
import requests
from bs4 import BeautifulSoup
url = 'http://www.xiachufang.com/explore/'
sv = {
'user-agent': 'Moziller/5.0'}
r = requests.get(url, headers=sv)
r.encoding = 'utf-8'
html = r.text
soup = BeautifulSoup(html, 'html.parser')
items = soup.find_all('div', class_='info pure-u')
print(r.status_code)
ls=[]
for item in items:
name = item.find('a