Python爬虫包 BeautifulSoup 学习（七） children等应用

最新推荐文章于 2024-08-20 14:41:57 发布

SuPhoebe

最新推荐文章于 2024-08-20 14:41:57 发布

阅读量9.2k

点赞数 2

分类专栏： Python & Django开发文章标签： bs4 爬虫

本文链接：https://blog.csdn.net/u013007900/article/details/54630489

版权

本文介绍了PythonBeautifulSoup库中处理HTML元素子节点的方法，包括.tag.contents用于获取子节点列表，.tag.children遍历直接子节点，.tag.descendants遍历所有后代节点。同时，讲解了.tag.string属性用于获取单一文本子节点，以及.strings和.stripped_strings用于处理多个字符串子节点并去除空白内容。

摘要由CSDN通过智能技术生成

所使用的html为：

html_doc = """ 
<html>
<head><title>The Dormouse's story</title></head> 
<p class="title"><b>The Dormouse's story</b></p> 
<p class="story">Once upon a time there were three little sisters; and their names were 
<a href="http://example.com/elsie" class="sister" id="link1"><