用xpath出现Element 一堆字符怎么办？ python

最新推荐文章于 2022-08-01 19:59:26 发布

远啊远学python

最新推荐文章于 2022-08-01 19:59:26 发布

阅读量3.9k

点赞数 2

分类专栏：积累文章标签： python html 正则表达式 xpath

本文链接：https://blog.csdn.net/dsfgdgsdf/article/details/104544835

版权

积累专栏收录该内容

0 篇文章 0 订阅

订阅专栏

print()打印之后出现这样的字符
[<Element p at 0x10263c300>, <Element p at 0x101562940>, <Element p at 0x1014d2fc0>, <Element p at 0x102669e40>, <Element p at 0x102669e80>, <Element p at 0x1025abc40>, <Element p at 0x101d83a80>, <Element p at 0x102684580>, <Element p at 0x1026845c0>, <Element p at 0x101d86fc0>, <Element p at 0x102684540>, <Element p at 0x102684640>, <Element p at 0x102684680>, <Element p at 0x1026846c0>, <Element p at 0x102684700>, <Element p at 0x102684780>, <Element p at 0x1026847c0>, <Element p at 0x102684800>, <Element p at 0x102684840>, <Element p at 0x102684880>, <Element p at 0x1026848c0>, <Element p at 0x102684600>, <Element p at 0x102684900>, <Element p at 0x102684940>, <Element p at 0x102684980>, <Element p at 0x1026849c0>, <Element p at 0x102684a00>, <Element p at 0x102684a40>, <Element p at 0x102684a80>, <Element p at 0x102684ac0>, <Element p at 0x102684b00>, <Element p at 0x102684740>, <Element p at 0x102684b80>, <Element p at 0x102684bc0>]

晚上遇到用xpath清洗数据时候，一直出现这样的数据，看着好像没解码，但是加上.text和decode（）都不行

# 解析得到的信息
resq = requests.get(url, headers=headers).text

html = etree.HTML(resq)

result = html.xpath('//div//p')

print(result)

最后求救别人才得到解决方案：

xpath('//div//p'后边要加上‘/text()’

改成这样就行了：

# 解析得到的信息
resq = requests.get(url, headers=headers).text

html = etree.HTML(resq)

result = html.xpath('//div//p/text()')

print(result)

远啊远学python

关注

2
点赞
踩
12

收藏

觉得还不错? 一键收藏
3
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录