Python爬取猫眼榜单

最新推荐文章于 2022-01-01 12:18:34 发布

菜鸡_王大仙

最新推荐文章于 2022-01-01 12:18:34 发布

阅读量160

点赞数

文章标签： Python

本文链接：https://blog.csdn.net/qq_20253377/article/details/96483444

版权

import urllib.request
import urllib.parse

url = "http://maoyan.com/board/4?"
headers ={“User-Agent”:“Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/75.0.3770.100 Safari/537.36”}

i = 1
while 1:
offset = (i-1)*10
parms = {‘offset’:offset}
parms = urllib.parse.urlencode(parms)
urls = url + parms
request = urllib.request.Request(urls,headers =headers)
response = urllib.request.urlopen(request)
html = response.read().decode(“utf-8”)

with open("第%d页.html" % i, 'a', encoding='utf-8') as f:
    print("正在写入第%d页" % i)
    f.write(html)
    print("第%d页写入完成" % i)

# if  not response:
#     print("爬取已完成，爬虫自动关闭")
#     break
num = input("是否继续爬取（y/n）

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

菜鸡_王大仙

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Python爬取猫眼榜单

import urllib.requestimport urllib.parseurl = "http://maoyan.com/board/4?"headers ={“User-Agent”:“Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/75.0.3770.1...
复制链接

扫一扫