记录一次反selenium爬虫经历

最新推荐文章于 2023-11-08 16:13:59 发布

super_admin_123

最新推荐文章于 2023-11-08 16:13:59 发布

阅读量466

点赞数

文章标签：爬虫 selenium python

本文链接：https://blog.csdn.net/l15031138244/article/details/127890126

版权

1.起因：使用selenium爬取某网站，第一次爬取成功了，时隔半个月在执行脚本发现翻页不好使，打开控制台发现几个错误，然后使用默认谷歌浏览器打开就没问题，猜想是反爬虫了。

2.解决方案：

options = webdriver.ChromeOptions()
#使用chrome开发者模式
options.add_argument("--disable-blink-features=AutomationControlled")
#禁用启用Blink运行时的功能
options.add_argument("--disable-blink-features=AutomationControlled")
#Selenium执行cdp命令
driver = webdriver.Chrome(options=options)
    driver.execute_cdp_cmd("Page.addScriptToEvaluateOnNewDocument", {
        "source": """
                    Object.defineProperty(navigator, 'webdriver', {
                      get: () => undefined
                    })
                  """
    })

按照如上设置再次执行脚本则可以继续访问了。

参考文献：selenium被识别的解决方法_HelloW先生的博客-CSDN博客_selenium爬虫被识别

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

super_admin_123

关注关注

0
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
记录一次反selenium爬虫经历

1.起因：使用selenium爬取某网站，第一次爬取成功了，时隔半个月在执行脚本发现翻页不好使，打开控制台发现几个错误，然后使用默认谷歌浏览器打开就没问题，猜想是反爬虫了。按照如上设置再次执行脚本则可以继续访问了。
复制链接

扫一扫