metrics.roc_curve()输出的tpr或fpr的结果为nan

最新推荐文章于 2023-12-13 17:02:19 发布

会飞的猩猩。

最新推荐文章于 2023-12-13 17:02:19 发布

阅读量1.1w

点赞数 4

分类专栏：机器学习 Python 文章标签： machine learning metrics.roc_curve

本文链接：https://blog.csdn.net/zx1245773445/article/details/82909515

版权

Python 同时被 2 个专栏收录

44 篇文章 8 订阅

订阅专栏

机器学习

34 篇文章 3 订阅

订阅专栏

在用metrics.roc_curve()函数计算tpr的时候，出现tpr为nan的情况，主要是因为label里面没有正样本的标签。

下面是roc_curve()里面的一段源码，其中tps[-1]存放的是所有的正样本。同理，如果fpr出现nan的情况是因为label里面没有负样本。

 if fps[-1] <= 0:
        warnings.warn("No negative samples in y_true, "
                      "false positive value should be meaningless",
                      UndefinedMetricWarning)
        fpr = np.repeat(np.nan, fps.shape)
    else:
        fpr = fps / fps[-1]

    if tps[-1] <= 0:
        warnings.warn("No positive samples in y_true, "
                      "true positive value should be meaningless",
                      UndefinedMetricWarning)
        tpr = np.repeat(np.nan, tps.shape)
    else:
        tpr = tps / tps[-1]

我们来看个?，正样本的标签为2，负样本的标签为1：

from sklearn import metrics
import numpy as np

y = np.array([1,1,1,1])
scores = scores = np.array([0.1, 0.4, 0.35, 0.8])
fpr, tpr, thresholds = metrics.roc_curve(y, scores, pos_label=2)

因为样本中没有怎样本，所以tpr的值就为nan