python selenium判断网页是否包含关键字_Python和Selenium-获取不包括子节点文本的文本...

最新推荐文章于 2024-07-06 02:54:23 发布

weixin_39734987

最新推荐文章于 2024-07-06 02:54:23 发布

阅读量838

点赞数

文章标签： python selenium判断网页是否包含关键字

在Python3中使用web scraping时，如果遇到包含子节点的元素，如何只提取直接文本而不包括子节点的内容？原始HTML结构为'VIVEGRPNHen,la.'解决方案是通过删除子节点文本，从所有文本中提取直接父节点的文本。

摘要由CSDN通过智能技术生成

Using Python 3.

Supposing:

text

other

If I do:

elem = driver.find_element_by_xpath("//whatever")

elem.text contains "text other"

If I do:

elem = driver.find_element_by_xpath("//whatever/text()[normalize-space()]")

elem is not Webelement.

How my I proceed to grab only "text" (and not "other")?

Id est: grab only text in direct node, not the child nodes.

UPDATE:

Original HTML is:

VIVEGRPN

Hen, la.

解决方案

You can remove the child node text from the all text

all_text = driver.find_element_by_xpath("//whatever").text

child_text = driver.find_element_by_xpath("//subchild").text

parent_text = all_text.replace(child_text, '')

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

关注关注