python构建随机ip请求头以及headers
上文我们提到已经搭建了一个 ip池了,那么我们该如何调用呢
获取随机ip
方法一(mysql)
如果你把ip存取到了数据库(mysql等),你可以调用这个方法
def getip():
db = pymysql.connect(
host="****",#数据库
port=3306,
user="****",#用户
passwd="****",#密码
db="****",#表名
charset='utf8'
)
cursor= db.cursor()
sql='SELECT * FROM IP地址 AS t1 JOIN (SELECT ROUND(RAND() * ((SELECT MAX(useid) FROM `IP地址`)-(SELECT MIN(useid) FROM IP地址))+(SELECT MIN(useid) FROM IP地址)) AS useid) AS t2 WHERE t1.useid >= t2.useid ORDER BY t1.useid LIMIT 1'
#IP地址是你存取IP地址的表
cursor.execute(sql)
results = cursor.fetchall()
db.commit()
return results[0][2]
db.close()
方法二(文档)
如果你把ip存取到了文档(可以从数据库导出),你可以调用这个方法
def getipd():
workbook = xlrd.open_workbook("ip地址.xlsx")
worksheet = workbook.sheet_by_index(0)
ncols = worksheet.cell_value(random.randint(*, ****), 2)#**代表你要调用的范围,具体你得看你文档数据分布
return ncols
获取随机headers
这里只是列出几个随机headers,你也可以加多几个进去
def getheaders():
userAgentList = [
'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/70.0.3538.77 Safari/537.36',
'Mozilla/5.0 (Windows NT 6.2; WOW64) AppleWebKit/537.36 (KHTML like Gecko) Chrome/44.0.2403.155 Safari/537.36',
'Mozilla/5.0 (Windows NT 6.1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2228.0 Safari/537.36',
'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_10_1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2227.1 Safari/537.36',
'Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2227.0 Safari/537.36',
'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2227.0 Safari/537.36',
'Mozilla/5.0 (Windows NT 6.3; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2226.0 Safari/537.36',
'Mozilla/5.0 (Windows NT 6.4; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2225.0 Safari/537.36',
'Mozilla/5.0 (Windows NT 6.3; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2225.0 Safari/537.36',
'Mozilla/5.0 (Windows NT 5.1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/41.0.2224.3 Safari/537.36',
'Mozilla/5.0 (Windows NT 10.0) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/40.0.2214.93 Safari/537.36',
'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_10_1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/37.0.2062.124 Safari/537.36',
'Mozilla/5.0 (Windows NT 6.3; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/37.0.2049.0 Safari/537.36',
'Mozilla/5.0 (Windows NT 4.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/37.0.2049.0 Safari/537.36',
'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/36.0.1985.67 Safari/537.36',
'Mozilla/5.0 (Windows NT 5.1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/36.0.1985.67 Safari/537.36',
'Mozilla/5.0 (X11; OpenBSD i386) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/36.0.1985.125 Safari/537.36'
]
return random.choice(userAgentList)
重点
现在两个方法写好了,那么该如何调用
response = requests.get(网址,headers={'User-Agent':getheaders()},proxies={'http':getipd()})
用这个方法就可以构造随机ip请求头了