创建爬虫-----爬虫异常处理：

最新推荐文章于 2024-10-16 11:04:34 发布

yyq675886993

最新推荐文章于 2024-10-16 11:04:34 发布

阅读量368

点赞数 1

分类专栏： python爬虫文章标签：爬虫异常处理

本文链接：https://blog.csdn.net/yyq675886993/article/details/73996290

版权

python爬虫专栏收录该内容

5 篇文章 0 订阅

订阅专栏

爬虫异常处理：

from urllib.request import urlopen
from urllib.error import HTTPError,URLError
from bs4 import BeautifulSoup
def getTitle(url):
      try:
         html=urlopen(url)
      except(HTTPError,URLError) as e:
         return None
      try:
         bsObj=BeautifulSoup(html.read())
         title=bsObj.body.h1
      except AttributeError as e:
         return None
      return title
title=getTitle("http://www.pythonscraping.com/pages/pages1.html")
if title==None:
     print("title could not be found")
else:
    print(title)