关于selenium中text返回空值的原因

最新推荐文章于 2024-01-02 11:17:12 发布

Zzers

最新推荐文章于 2024-01-02 11:17:12 发布

阅读量4.1k

点赞数 17

文章标签： selenium python

本文链接：https://blog.csdn.net/JustZzer/article/details/111355510

版权

关于selenium中text返回空值的原因

这几天闲来无事，在做爬虫的过程中遇到了一个小问题，获取值的xpath正确，可以正常获取到标签属性。但是获取文本的时候却获取到了空值。
错误代码如下：

from selenium import webdriver
Driver = webdriver.Chrome(executable_path='D:\ALL_WorkSpace\PycharmProject\Selenium_Study\chromedriver.85.0.4183.87')
Driver.get("**网站**")
Informations = Driver.find_elements_by_xpath("//div[@class='s-main-slot s-result-list s-search-results sg-row']/div[@data-uuid]")
for info in Informations:
    Price = info.find_element_by_xpath(".//a/span[@class='a-price']/span").text
    print(Price+"----")

错误例图：
在这里插入图片描述
如果将值存入列表方便观看就类似于—>["","","",""]

解决方法：

将info.find_element_by_xpath(".//a/span[@class='a-price']/span").text改成info.find_element_by_xpath(".//span[@class='a-offscreen']").get_attribute('textContent')就可以正常获取到标签里的文本了。

原因：.text的适用范围的问题
requests对象的get和post方法都会返回一个Response对象，这个对象里面存的是服务器返回的所有信息，包括响应头，响应状态码等。其中返回的网页部分会存在.content和.text两个对象中。

两者区别在于，content中间存的是字节码，而text中存的是Beautifulsoup根据猜测的编码方式将content内容编码成字符串。

所以简而言之，.text是现成的字符串，.content还要编码，但是.text不是所有时候显示都正常，这是就需要用.content进行手动编码。

更详细的解释参考：https://zhidao.baidu.com/question/941417472703558372.html

Zzers

关注

17
点赞
踩
26

收藏

觉得还不错? 一键收藏
5
评论
关于selenium中text返回空值的原因

https://blog.csdn.net/qq_42804678/article/details/91345725?ops_request_misc=%25257B%252522request%25255Fid%252522%25253A%252522160826145016780277868650%252522%25252C%252522scm%252522%25253A%25252220140713.130102334.pc%25255Fall.%252522%25257D&request_id=16
复制链接

扫一扫