关于爬虫的request的时间问题

最新推荐文章于 2023-07-10 03:42:47 发布

天天码怪

最新推荐文章于 2023-07-10 03:42:47 发布

阅读量600

点赞数

分类专栏：爬虫 Python

本文链接：https://blog.csdn.net/qq_38832624/article/details/94736327

版权

Python 同时被 2 个专栏收录

10 篇文章 0 订阅

订阅专栏

爬虫

6 篇文章 0 订阅

订阅专栏

def get_content(url):
    # try:
    resp = requests.get(url, headers=header, timeout=0.5)
    resp.encoding = 'utf-8'
    html = resp.text
    bs = BeautifulSoup(html, "html.parser")
    # except:
    #     bs = "死链"
    # print(bs)
    # a = input("pause")
    return str(bs)

原本的代码是包括注释的，因为源数据中有很多死链，所以我设置了timeout为0.5。结果大多数百度都爬不到了，最后去了注释成功了

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

天天码怪

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
关于爬虫的request的时间问题

def get_content(url): # try: resp = requests.get(url, headers=header, timeout=0.5) resp.encoding = 'utf-8' html = resp.text bs = BeautifulSoup(html, "html.parser") # except: ...
复制链接

扫一扫