获取免费的代理

小跟班在这里

已于 2022-02-07 23:16:03 修改

阅读量597

点赞数

分类专栏： python 工具文章标签： http 代理模式网络协议

于 2022-02-07 23:13:48 首次发布

本文链接：https://blog.csdn.net/weixin_43818544/article/details/122816225

版权

python 工具专栏收录该内容

8 篇文章 0 订阅

订阅专栏

原理：网上免费的代理网站，爬取速度最快的前几位返回出来，具体怎么用：自己把函数的返回值打印出来看看

import re
import requests

PROXY_IPS = []


def get_proxy_ips() -> list:
    global PROXY_IPS
    if not PROXY_IPS:
        contents = requests.get("https://www.kuaidaili.com/free/inha/").text
        ips = re.findall(
            '<td data-title="IP">([0-9]{1,3}?.[0-9]{1,3}?.[0-9]{1,3}?.[0-9]{1,3}?)</td>',
            contents,
        )
        https = re.findall('<td data-title="类型">(HTTP|HTTPS)</td>', contents)
        time = re.findall('<td data-title="响应速度">(.*?)秒</td>', contents)
        http_results = sorted(
            {(i, h): t for i, h, t in zip(ips, https, time)}.items(), key=lambda x: x[1]
        )
        PROXY_IPS = [i[0] for i in http_results if float(i[1]) <= 2]
        if not PROXY_IPS:
            PROXY_IPS = re.findall(
                "<td>([0-9]{1,3}?.[0-9]{1,3}?.[0-9]{1,3}?.[0-9]{1,3}?)</td>.*?<td>(HTTP|HTTPS)</td>",
                requests.get("https://ip.jiangxianli.com/?anonymity=1").text,
            )
    return PROXY_IPS

小跟班在这里

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
获取免费的代理

原理：网上免费的代理网站，爬取速度最快的前几位返回出来，具体怎么用：自己把函数的返回值打印出来看看import reimport requestsPROXY_IPS = []def get_proxy_ips() -> list: global PROXY_IPS if not PROXY_IPS: contents = requests.get("https://www.kuaidaili.com/free/inha/").text ip
复制链接

扫一扫