【机器学习实战】分类算法评估之ROC曲线绘制（多模型对比）

最新推荐文章于 2025-03-20 18:23:03 发布

想做一只快乐的修狗

最新推荐文章于 2025-03-20 18:23:03 发布

阅读量1.5k

点赞数 2

文章标签：机器学习分类 python

本文链接：https://blog.csdn.net/weixin_44109827/article/details/129727291

版权

该代码示例展示了如何使用Python的scikit-learn库训练Logistic回归、决策树和随机森林三种分类器，并计算它们在ROC曲线下的面积(AUC)。通过对1000个样本的二分类问题进行训练和测试，绘制了各个模型的ROC曲线，以及随机猜测的基准线，以评估模型的性能。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

1. 代码

import numpy as np
import matplotlib.pyplot as plt
from sklearn.datasets import make_classification
from sklearn.linear_model import LogisticRegression
from sklearn.tree import DecisionTreeClassifier
from sklearn.ensemble import RandomForestClassifier
from sklearn.metrics import roc_curve, roc_auc_score
from sklearn.model_selection import train_test_split

# 生成样本数据
X, y = make_classification(n_samples=1000, n_features=10, n_classes=2, random_state=42)

# 将数据集分为训练集和测试集
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=42)

# 初始化分类器
classifiers = [
    ('Logistic Regression', LogisticRegression()),
    ('Decision Tree', DecisionTreeClassifier()),
    ('Random Forest', RandomForestClassifier())
]

# 遍历每个分类器，训练并绘制ROC曲线
for name, classifier in classifiers:
    classifier.fit(X_train, y_train)
    y_pred_proba = classifier.predict_proba(X_test)[:,1]
    fpr, tpr, _ = roc_curve(y_test, y_pred_proba)
    auc = roc_auc_score(y_test, y_pred_proba)
    plt.plot(fpr, tpr, label=f'{name} (AUC = {auc:.2f})')

# 绘制基准线
plt.plot([0, 1], [0, 1], 'k--', label='Random Guess')

# 设置图例、标题、坐标轴标签等信息
plt.legend()
plt.title('Receiver Operating Characteristic (ROC) Curve')
plt.xlabel('False Positive Rate')
plt.ylabel('True Positive Rate')
plt.show()