python编辑器中文字体倒立的_如何用Python+人工识别处理知乎的倒立汉字验证码...

最新推荐文章于 2021-02-21 16:43:06 发布

weixin_39856589

最新推荐文章于 2021-02-21 16:43:06 发布

阅读量284

点赞数

文章标签： python编辑器中文字体倒立的

展开全部

# 登录知乎，通过保存验证图片方式

import urllib.request

import urllib.parse

import time

import http.cookiejar

webUrl = "https://www.zhihu.com/login/email"#不能写https://www.zhihu.com/#signin因为不支持重定向e68a84e8a2ad3231313335323631343130323136353331333363393131

webheader = {

# 'Accept': 'text/html, application/xhtml+xml, */*',

# 'Accept-Language': 'zh-CN',

# 'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64; Trident/7.0; rv:11.0) like Gecko',

'User-Agent': 'Mozilla/5.0 (Linux; Android 6.0; Nexus 5 Build/MRA58N) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/56.0.2924.87 Mobile Safari/537.36',

# 'User-Agent': 'Mozilla/5.0 (iPod; U; CPU iPhone OS 4_3_3 like Mac OS X; en-us) AppleWebKit/533.17.9 (KHTML, like Gecko) Version/5.0.2 Mobile/8J2 Safari/6533.18.5',

# 'DNT': '1',

# 'Connection': 'Keep-Alive'

}

postData = {

'email': '在这里写你的账号',

'captcha_type': 'cn',

'password': '在这里写你的密码',

'_xsrf': '',

'captcha': ''

}

localStorePath = "写你想保存的验证码图片的地址"

if __name__ == '__main__':

#声明一个CookieJar对象实例来保存cookie

cookie = http.cookiejar.CookieJar()

#创建opener

handler = urllib.request.HTTPCookieProcessor(cookie)

opener = urllib.request.build_opener(handler)#建立opener对象，并添加头信息

urllib.request.install_opener(opener)

captcha_url = 'https://www.zhihu.com/captcha.gif?r=%d&type=login&lang=cn' %(time.time() * 1000)

# captcha_url = 'http://www.zhihu.com/captcha.gif?r=%d&type=login' %(time.time() * 1000)#这样获得的是“字母+数字验证码”

#这个获取验证码图片的方法是不行的！

# urllib.request.urlretrieve(captcha_url, localStorePath + 'myCaptcha.gif')

#用urlopen函数保存验证图片

req = urllib.request.Request(url=captcha_url,headers=webheader)

content = urllib.request.urlopen(req)

# content = opener.open(req)

captcha_name = 'D:/Python学习/crawler_learning/知乎登录专题研究/知乎验证码图片/myNewCaptcha.gif'

content = content.read()

with open(captcha_name, 'wb') as f:

f.write(content)

postData['captcha'] = input('请输入验证码')

# postData['_xsrf'] = get_xsrf()

postData['_xsrf'] = 'fa5ae712244bd4287e371801052003fc'

print(postData['_xsrf'])

#用urlopen函数传送数据给服务器实现登录

postData_encoded = urllib.parse.urlencode(postData).encode('utf-8')

req = urllib.request.Request(url=webUrl,data=postData_encoded,headers=webheader)

webPage = urllib.request.urlopen(req)

# webPage = opener.open(req)

data = webPage.read().decode('utf-8')

print(data)

with open("D:/知乎服务器反馈的内容.txt",mode='w',encoding='utf-8') as dataFile:

dataFile.write(data)

weixin_39856589

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python编辑器中文字体倒立的_如何用Python+人工识别处理知乎的倒立汉字验证码...

展开全部#登录知乎，通过保存验证图片方式importurllib.requestimporturllib.parseimporttimeimporthttp.cookiejarwebUrl="https://www.zhihu.com/login/email"#不能写https://www.zhihu.com/#signin因为不支持重定向e68a84e8a2ad3231313335...
复制链接

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。