python爬取网站验证码并保存

最新推荐文章于 2023-03-10 11:26:12 发布

coolsunxu

最新推荐文章于 2023-03-10 11:26:12 发布

阅读量3.4k

点赞数

分类专栏： Python 文章标签： python selenium 验证码

本文链接：https://blog.csdn.net/coolsunxu/article/details/80956624

版权

Python 专栏收录该内容

55 篇文章 2 订阅

订阅专栏

from selenium import webdriver
from PIL import Image
import pytesseract

driver=webdriver.Firefox()
driver.get('网址')
driver.implicitly_wait(10)

for i in range(0,10):
	driver.find_element_by_id('captchaImg').click()
	driver.save_screenshot(r'E:\code_full.png')
	href=driver.find_element_by_xpath('//*[@id="captchaImg"]')
	left = href.location['x']
	top = href.location['y']
	elementWidth = href.location['x'] + href.size['width']
	elementHeight = href.location['y'] + href.size['height']
	picture = Image.open(r'E:\code_full.png')
	picture = picture.crop((left, top, elementWidth, elementHeight))
	picture.save(r'E:\\'+str(i)+'.png')

注意xpath和id,name需要根据自己要爬的网站结构进行编写