爬虫学习第二天ajax请求

最新推荐文章于 2024-07-12 16:42:46 发布

ChinaGeographer

最新推荐文章于 2024-07-12 16:42:46 发布

阅读量120

点赞数 1

分类专栏： python爬虫学习文章标签：爬虫

本文链接：https://blog.csdn.net/weixin_45547832/article/details/99967874

版权

python爬虫学习专栏收录该内容

6 篇文章 0 订阅

订阅专栏

爬虫学习第二天ajax请求

目标抓取豆瓣网动态页面的电影目录
代码如下

from urllib.request import Request,urlopen
from fake_useragent import UserAgent
base_url = "https://movie.douban.com/j/chart/top_list?type=5&interval_id=100%3A90&action=&start={}&limit=20"
i = 0
while True:
    headers = {
        "User-Agent": UserAgent().chrome
    }
    url = base_url.format(i * 20)
    request = Request(url,headers=headers)
    response =urlopen(request)
    info = response.read().decode()
    print(info)
    if info==""or info is None:
        break
    i += 1

中间遇到的问题有：
1、请求模块Request写掉了导致报错：NameError: name ‘Request’ is not defined
2、在代码运行的时候最后出现了很多的：[] 问了大佬，大佬们说是应该匹配到了，不知道博客里面的大佬有没有知道原因的，该怎么处理这个问题的。
注：小菜鸟一枚刚接触python，代码照着视频里老师写的。

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

ChinaGeographer

关注关注

1
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
爬虫学习第二天ajax请求

爬虫学习第二天ajax请求目标抓取豆瓣网动态页面的电影目录代码如下from urllib.request import Request,urlopenfrom fake_useragent import UserAgentbase_url = "https://movie.douban.com/j/chart/top_list?type=5&interval_id=100%3A9...
复制链接

扫一扫