selenium用过的方法

最新推荐文章于 2024-07-11 09:08:58 发布

锅炉房刘大爷

最新推荐文章于 2024-07-11 09:08:58 发布

阅读量237

点赞数

分类专栏： python3 文章标签： selenium

本文链接：https://blog.csdn.net/u012067766/article/details/109069208

版权

python3 专栏收录该内容

28 篇文章 1 订阅

订阅专栏

本文详细介绍了如何利用Selenium进行网页元素的定位、属性获取及文本内容读取，包括设置Chrome浏览器选项、处理页面加载超时、设置headless模式等。此外，还分享了在遇到页面加载缓慢时停止加载页面的方法，为自动化测试提供了实用技巧。

摘要由CSDN通过智能技术生成

通过xpath获取某个element：

img_element = driver.find_element_by_xpath('//img[@class="a-image-img"]')

获取element的某个属性值：

img_url = img_element.get_attribute('data-src')

获取element的文本内容：

content = img_element.text

driver的一些设置：

options = webdriver.ChromeOptions()
# 设置为开发者模式，防止网站识别
options.add_experimental_option('excludeSwitches', ['enable-automation'])
options.add_experimental_option("useAutomationExtension", False)
# 加载驱动程序
# driver = webdriver.Chrome(executable_path='./chromedriver', options=options)

driver = webdriver.Chrome(options=options)
driver.execute_cdp_cmd("Page.addScriptToEvaluateOnNewDocument", {
            "source": """
                Object.defineProperty(navigator, 'webdriver', {
                  get: () => undefined
                })
              """
        })

# 设置页面超时时间
driver.set_page_load_timeout(60)
driver.set_script_timeout(60)

driver设置headless：

cp = Options()
cp.add_argument('--headless')
# linux下需要加下面两个参数才能正常运行Chrome
cp.add_argument('--disable-gpu')
cp.add_argument('--no-sandbox')
driver = webdriver.Chrome(options=cp)

driver退出：

driver.close()
driver.quit()

WebDriverWait，这里发给别人的总结：here

停止加载的方法：

有时请求一个页面，由于部分资源响应慢，selenium可能会报timeout的错误，此时可以用下面的方法停止加载：

try:
    driver.get(url)
except Exception as e:
    driver.execute_script('window.stop ? window.stop() : document.execCommand("Stop");')

参考：here

锅炉房刘大爷

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录