python3验证码识别,借助于AI来实现验证码识别,内含python3示例

本帖最后由 derain 于 2020-4-28 10:02 编辑

借助于AI工具来实现验证码识别,内含python3示例

验证码识别的场景十分常见

本文主要讨论作为普通开发者(缺乏/没有Ai学术(教育/实践)背景)的前提下,来低成本快速实现验证码识别

本次测试的验证码主要有两种

1.   无干扰的纯数字验证码

55fd2b2273b5a8b4531f72773c469d6e.gif

LoginImg.jpg (1.17 KB, 下载次数: 1)

2020-4-28 09:58 上传

2.    有干扰的数字加字母验证码

55fd2b2273b5a8b4531f72773c469d6e.gif

b.png (5.78 KB, 下载次数: 1)

2020-4-28 09:59 上传

1.百度AI大脑https://ai.baidu.com/tech/ocr/general

55fd2b2273b5a8b4531f72773c469d6e.gif

2.png (266.24 KB, 下载次数: 0)

2020-4-28 09:51 上传

55fd2b2273b5a8b4531f72773c469d6e.gif

2.png (279.24 KB, 下载次数: 0)

2020-4-28 09:52 上传

下边我用python3来示例在https://console.bce.baidu.com/ai/?fromai=1#/ai/ocr/app/list这里新建应用

55fd2b2273b5a8b4531f72773c469d6e.gif

2.png (39.4 KB, 下载次数: 0)

2020-4-28 09:52 上传

记录appid, apikey, secret key

import requests

import base64

import shortuuid

from pprint import pprint

#填上自己的app 信息

appid = ""

key = ""

secret = ""

def Token():

host = 'https://aip.baidubce.com/oauth/2.0/token?grant_type=client_credentials&client_id={}&client_secret={}'.format(key, secret)

response = requests.get(host)

# if response:

# pprint(response.json())

return response.json()['access_token']

token =  Token()

request_url = "https://aip.baidubce.com/rest/2.0/ocr/v1/general_basic"

f = open('./code/code.png', 'rb')

img = base64.b64encode(f.read())

params = {"image":img,"language_type":"CHN_ENG"}

# access_token = '[调用鉴权接口获取的token]'

request_url = request_url + "?access_token=" + token

headers = {'content-type': 'application/x-www-form-urlencoded'}

response = requests.post(request_url, data=params, headers=headers)

pprint (response.json())

2    腾讯AIhttps://ai.qq.com/product/ocr.shtml#common

55fd2b2273b5a8b4531f72773c469d6e.gif

2.png (467.57 KB, 下载次数: 0)

2020-4-28 09:54 上传

55fd2b2273b5a8b4531f72773c469d6e.gif

2.png (351.97 KB, 下载次数: 0)

2020-4-28 09:54 上传

腾讯ocr示例在这里新建应用https://ai.qq.com/console/application/create-app

55fd2b2273b5a8b4531f72773c469d6e.gif

2.png (32.04 KB, 下载次数: 0)

2020-4-28 09:55 上传

55fd2b2273b5a8b4531f72773c469d6e.gif

2.png (42.26 KB, 下载次数: 0)

2020-4-28 09:55 上传

记录以上app信息 APP_ID,APP_Key

import base64, hashlib, json, random, string, time

from urllib import parse

import requests

from pprint import pprint

# 填写app信息

app_id = ""

app_key = ""

def GetAccessToken(formdata, app_key):

dic = sorted(formdata.items(), key=lambda d: d[0])

sign = parse.urlencode(dic) + '&app_key=' + app_key

m = hashlib.md5()

m.update(sign.encode('utf8'))

return m.hexdigest().upper()

def RecogniseGeneral(app_id, time_stamp, nonce_str, image, app_key):

host = 'https://api.ai.qq.com/fcgi-bin/ocr/ocr_generalocr'

formdata = {'app_id': app_id, 'time_stamp': time_stamp, 'nonce_str': nonce_str, 'image': image}

app_key = app_key

sign = GetAccessToken(formdata=formdata, app_key=app_key)

formdata['sign'] = sign

try:

r = requests.post(url=host, data=formdata, timeout=20)

except requests.exceptions.ReadTimeout:

r = requests.post(url=host, data=formdata, timeout=20)

if (r.status_code == 200):

return r.json()

else:

print(r.text)

def Recognise(img_path):

with open(file=img_path, mode='rb') as file:

base64_data = base64.b64encode(file.read())

nonce = ''.join(random.sample(string.digits + string.ascii_letters, 32))

stamp = int(time.time())

recognise = RecogniseGeneral(app_id=app_id, time_stamp=stamp, nonce_str=nonce, image=base64_data,

app_key=app_key)

# for k, v in recognise.items():

#     print(k, v)

return recognise

img_path = "./code/code.png"

response = Recognise(img_path)

pprint(response)

code = response['data']['item_list'][0]['itemstring'].replace(" ", "")

print(code)

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值