python 使用xpath获取网页标签内容

最新推荐文章于 2024-05-24 10:40:25 发布

darling331

最新推荐文章于 2024-05-24 10:40:25 发布

阅读量6.1k

点赞数 1

文章标签： python html xpath js selenium

本文链接：https://blog.csdn.net/brightgreat/article/details/124263968

版权

获取指定html的标签内容

打开网页的开发者模式,得到路径标签，然后加上/text() 即可得到标签的文本内容//*[@id="sonsyuanwen"]/div[1]/h1

对于网页爬取来说，还是很方便的

# -*- ecoding: utf-8 -*-
# @ModuleName: test005
# @Function: 
# @Author: darling
# @Time: 2022-04-18 13:58

import requests

from lxml import etree


def get_url():
    resource = requests.get('https://so.gushiwen.cn/shiwenv_444df93c9bdf.aspx')
    html = etree.HTML(resource.text)
    title = html.xpath('//*[@id="sonsyuanwen"]/div[1]/h1/text()')
    neir=html.xpath('//*[@id="contson444df93c9bdf"]/text()')
    print(title,neir)
    return resource


if __name__ == "__main__":
    res = get_url()
    print(res)

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

darling331

关注关注

1
点赞
踩
11

收藏

觉得还不错? 一键收藏
4
评论
python 使用xpath获取网页标签内容

获取指定html的标签内容打开网页的开发者模式,得到路径标签，然后加上/text() 即可得到标签的文本内容//*[@id="sonsyuanwen"]/div[1]/h1对于网页爬取来说，还是很方便的# -*- ecoding: utf-8 -*-# @ModuleName: test005# @Function: # @Author: darling# @Time: 2022...
复制链接

扫一扫