import requests import re url = 'http://www.heibanke.com/lesson/crawler_ex01/' n = 0 for i in range(30): n = i+1 headers = { 'Host': 'www.heibanke.com', 'Referer': 'http: // www.heibanke.com / lesson / crawler_ex01 /', 'Content - Type': 'application / x - www - form - urlencoded', } form_data ={'csrfmiddlewaretoken': 'NbRHuYRzAxSf8xvGGVWevYuYY6zDA2L9', 'username':'', 'password': n} results = requests.post(url, data=form_data, headers=headers).text getdata = re.findall(r'<h3>(.+)</h3>', results) print(n, '\n', getdata)
运行结果:
........
26
['您输入的密码错误, 请重新输入']
27
['恭喜! 用户成功闯关, 继续你的爬虫之旅吧']
28
['您输入的密码错误, 请重新输入']
.........