python+selenium+PIL自动登陆网站

最新推荐文章于 2023-04-19 19:48:41 发布

帅气的搬砖工

最新推荐文章于 2023-04-19 19:48:41 发布

阅读量437

点赞数 1

分类专栏：爬虫 python 文章标签： python selenium PIL pytesseract 自动登陆

本文链接：https://blog.csdn.net/snail_youth/article/details/92687312

版权

本文记录了使用Python结合selenium和PIL库，完全模拟手动登录网站的过程，包括两种方式：直接写入cookie和动态模拟操作。虽然写入cookie的方式可能因cookie变化导致登录失败，但完整模拟登录则能更稳定地完成任务。

摘要由CSDN通过智能技术生成

近期要爬取一个网站的数据，嗯？需要登陆才能爬取，那怎么办呢？突然灵光一闪，百度了一下发现python+selenium+PIL可以解决这个问题，为了以后需要使用的时候能给做到有资料可查，在这里就做下简单的记录吧！

一、写入cookie的形式

这种方式有个弊端，就是可能标识的cookie会变，在下次登陆中不能登陆成功。

from selenium import webdriver
#引入selenium模块
opt = webdriver.ChromeOptions()
opt.set_headless()
#设置不在前台打开chrome浏览器
driver = webdriver.Chrome('G:/py_2019\Reptile/Reptile001/chrome/chromedriver.exe',options=opt)
#使用chrome引擎，并指定chromedriver所在位置
driver.maximize_window()
#chrome浏览器窗口最大化
cookies1 = {'httpOnly': True, 'path': '/', 'secure': False, 'name': 'JSESSIONID', 'domain': 'www.xxxxx.org', 'value': 'xxxxxxxxxxxxxxxxxxxx'}
cookies2 = {'httpOnly': False, 'name': 'loginname', 'path': '/', 'secure': False, 'expiry': 1