Python 爬虫获取标签下面的子标签

最新推荐文章于 2024-02-21 22:10:44 发布

weixin_33755554

最新推荐文章于 2024-02-21 22:10:44 发布

阅读量4.4k

点赞数

文章标签： python 爬虫开发工具

原文链接：http://www.cnblogs.com/littlebob/p/9219841.html

版权

thr_msgs = soup.find_all('div',class_=re.compile('msg'))

for i in thr_msgs:
    print(i)
    first = i.select('em:nth-of-type(1)')
    print(first)



>>>

<div class='\"msg\"'><em>佛山</em><em>1-3年</em><em>大专</em></div>
[<em>佛山</em>]
<div class='\"msg\"'><em>南京</em><em>3-5年</em><em>本科</em></div>
[<em>南京</em>]
<div class='\"msg\"'><em>南阳</em><em>1-3年</em><em>大专</em></div>
[<em>南阳</em>]
<div class='\"msg\"'><em>深圳</em><em>1年以内</em><em>本科</em></div>
[<em>深圳</em>]

>>>

需要下载代码的可以到我的GitHub上下载 https://github.com/FightingBob/-Web-Crawler-training 如果觉得可以，请给我颗star鼓励一下，谢谢！

转载于:https://www.cnblogs.com/littlebob/p/9219841.html

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

weixin_33755554

关注关注

0
点赞
踩
3

收藏

觉得还不错? 一键收藏
0
评论
Python 爬虫获取标签下面的子标签

thr_msgs = soup.find_all('div',class_=re.compile('msg'))for i in thr_msgs: print(i) first = i.select('em:nth-of-type(1)') print(first)>>><div class='\"msg\...
复制链接

扫一扫