Python 网络爬虫权威指南第一章练习

最新推荐文章于 2024-09-09 23:28:21 发布

学技术的翻译小白

最新推荐文章于 2024-09-09 23:28:21 发布

阅读量132

点赞数

分类专栏：爬虫文章标签： python

本文链接：https://blog.csdn.net/Laurencenter/article/details/112097306

版权

爬虫专栏收录该内容

11 篇文章 0 订阅

订阅专栏

获取网页的标题：

from urllib.request import urlopen
from urllib.error import URLError
from bs4 import BeautifulSoup


def get_title(url):
    try:
        html = urlopen(url)
    except URLError as e:
        return None
    try:
        bs = BeautifulSoup(html.read(), 'html.parser')
        title = bs.body.h1
    except AttributeError as e:
        return None
    return title


my_title = get_title('https://www.alibabacloud.com/zh/'
                     'knowledge/what-is-cloud-computing?spm=a3c0i.243649.2033761600.2.a974d9130g0iYV')
if my_title is None:
    print('Title could not be found.')
else:
    print(my_title)