要点:(环境Python3.5,额外库bs4)
1.从免费代理ip获取ip和端口号 http://www.xicidaili.com/nn/
2.使用代理
import urllib.request
proxy_support = urllib.request.ProxyHandler({'http': 'ip:port'})
opener = urllib.request.build_opener(proxy_support)
urllib.request.install_opener(opener)
3.urlopen测试ip能否使用 http://ip.chinaz.com/getip.aspx
4.返回包含可以使用的ip和端口组成的字典的list
贴代码:
from urllib.request import urlopen
import re
import requests
from bs4 import BeautifulSoup as bs
from urllib import request
import socket
#init timeout = 3
socket.setdefaulttimeout(3)
#request the xiciURL and get the response
def request_to_get(url):
hearder = {
"Accept":"text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,image/apng,*/*;q=0.8",
"Accept-Encoding":"gzip, deflate",
"A