python如何进行xpath解析

最新推荐文章于 2022-04-20 16:21:56 发布

代码演奏家

最新推荐文章于 2022-04-20 16:21:56 发布

阅读量540

点赞数

文章标签： python xpath html

本文链接：https://blog.csdn.net/weixin_45312417/article/details/106011681

版权

基本代码

1.头文件

from lxml import etree

解析html字符串，使用的lxml.etree.HTML进行解析

htmlElement = etree.HTML(tengxun)
print(htmlElement.xpath("//div/a/h4/text()"))# 打印岗位信息
print(etree.tostring(htmlElement, encoding='utf-8').decode('utf-8')) # 打印整个html内容

解析标准的html文件，使用的是lxml.etree.parse进行解析
本例的拉钩是不标准的，腾讯是标准的html 结构

htmlElement = etree.parse("lagou.html")
<

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

代码演奏家

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python如何进行xpath解析

基本代码1.头文件from lxml import etree解析html字符串，使用的lxml.etree.HTML进行解析htmlElement = etree.HTML(tengxun)print(htmlElement.xpath("//div/a/h4/text()"))# 打印岗位信息print(etree.tostring(htmlElement, encoding='utf-8').decode('utf-8')) # 打印整个html内容解析标准的html文件，使
复制链接

扫一扫