Python 网络爬虫权威指南 2.2 选择标签

最新推荐文章于 2023-06-18 16:58:19 发布

学技术的翻译小白

最新推荐文章于 2023-06-18 16:58:19 发布

阅读量219

点赞数

分类专栏：爬虫文章标签： python

本文链接：https://blog.csdn.net/Laurencenter/article/details/112098537

版权

本文详细介绍了Python网络爬虫中选择标签的技巧，包括如何使用XPath和CSS选择器高效地抓取网页数据，同时探讨了在处理复杂网页结构时的选择策略和实战案例。

摘要由CSDN通过智能技术生成

from urllib.request import urlopen
from bs4 import BeautifulSoup

html = urlopen('http://www.pythonscraping.com/pages/warandpeace.html')
bs = BeautifulSoup(html.read(), 'html.parser')

# find_all()返回的是所有匹配结果的列表
namelist = bs