python爬虫考试题记录——获取class="title"的内容

最新推荐文章于 2022-09-18 19:16:36 发布

Wang.T

最新推荐文章于 2022-09-18 19:16:36 发布

阅读量8.9k

点赞数 1

分类专栏： python 文章标签： python 爬虫 BeautifulSoup requests class=title

本文链接：https://blog.csdn.net/qq_39360985/article/details/94437306

版权

这是一篇关于Python爬虫的考试题目记录，内容涉及从指定URL抓取HTML，并使用BeautifulSoup库提取class为'title'的p标签内的文本信息。

摘要由CSDN通过智能技术生成

python考试的时候一道简单的爬虫题记录：
给一个URL后，获取html内容并提取p标签中class="title"的文本内容

import requests
from bs4 import BeautifulSoup

html = """
<html><head><title>TheDormouse'sstory</title></head>
<body>
<p class="title"name="dromouse"><b>TheDormouse'sstory</b></p>
<p class="story">Onceuponatimetherewerethreelittlesisters;andtheirnameswere
<a href="http://example.com/elsie"class="sister"id="link1"><!--Elsie--></a>,
<a href="http://example.com/lacie"class="sister"id="link2">Lacie</a>and
<a href="http://example.com/tillie"class="sister"id="link3">Tillie</a>;
andtheylivedatthebottomofawell.</p>
<pclass="story">...</p>"""

bsObj = BeautifulSoup(html, "html.parser")
print(bsObj.title)
print(bsObj.title.string