爬虫入门学习笔记3
get()和post()的数据请求
- 在get()里面叫paramas
- 在post()里面叫data
尝试动态页面的爬取(豆瓣排名)
老规矩,先附上源代码:
import requests
import json
url='https://movie.douban.com/j/chart/top_list?'
header={
"User-Agent":"Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/87.0.4280.88 Safari/537.36"}
params = {
'type':'5',
'interval_id':'100:90',
'action':'',
'start': '0',
'limit':'20'
}
response=requests.get(url=url,headers=header,params=params).content.decode()
res=json.loads(response)
for i in res:
rating = i['rating'][