【爬虫基础】第10讲 urlerror的使用及捕获异常

娜年花开666

已于 2024-04-09 13:12:00 修改

阅读量355

点赞数 3

分类专栏： # 爬虫基础文章标签： Python 爬虫

于 2024-03-28 15:44:12 首次发布

本文链接：https://blog.csdn.net/a272329874a/article/details/137113715

版权

爬虫基础专栏收录该内容

22 篇文章 3 订阅

订阅专栏

URLError是Python中的一个异常类，用于处理与URL相关的错误。它是urllib.error模块中的一个类。

URLError通常在以下情况下被引发：

网络连接问题：例如无法连接到服务器、超时等。
URL不正确：例如无效的URL、无法解析主机名等。
服务器错误：例如服务器返回500错误。

以下是使用URLError处理URL连接错误的示例.我们尝试打开一个不存在的URL，并使用try-except语句来捕获可能发生的URLError异常。

如果包含code属性，则说明是服务器错误。

from urllib.request import Request,urlopen
from fake_useragent import UserAgent
from urllib.error import URLError

url='http://127.0.0.1:81/1123/'
headers = {
    'User-Agent' : UserAgent().chrome
}
req = Request(url,headers=headers)
try:
    resp = urlopen(req)
    print(resp.read().decode())
except URLError as e:
    # print(e)
    if e.args:
        print(e.args[0].errno)
    else:
        print(e.code)
print('爬取完毕')

代码执行结果

如果URLError包含reason属性，则说明是网络连接问题；

代码实现：

from urllib.request import Request,urlopen
from fake_useragent import UserAgent
from urllib.error import URLError

url='http://127.0.0.1:5000/zendao/'
headers = {
    'User-Agent' : UserAgent().chrome
}
req = Request(url,headers=headers)
try:
    resp = urlopen(req)
    print(resp.read().decode())
except URLError as e:
    # print(e)
    if e.args:
        print(e.args[0].errno)
    else:
        print(e.code)
print('爬取完毕')

执行结果：