逻辑回归癌症分类预测

最新推荐文章于 2023-12-15 16:24:33 发布

Cocktail_py

最新推荐文章于 2023-12-15 16:24:33 发布

阅读量1.8k

点赞数

分类专栏：算法文章标签：逻辑回归癌症分类预测逻辑回归癌症分类预测机器学习

本文链接：https://blog.csdn.net/Cocktail_py/article/details/103041686

版权

算法专栏收录该内容

10 篇文章 2 订阅

订阅专栏

逻辑回归(Logistic Regression)是一种分类模型

应用场景:
广告点击率
是否为垃圾邮件
是否患病
金融诈骗
虚假账号

一.案例背景介绍

# -*- coding: utf-8 -*-
# @Time    : 2019/11/13 07:16
# @Author  :

import pandas as pd
import numpy as np
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler
from sklearn.linear_model import LogisticRegression

import ssl
ssl._create_default_https_context = ssl._create_unverified_context

# 2.基本数据处理
# 2.1 缺失值处理
data = data.replace(to_replace="?", value=np.NaN)
data = data.dropna()
# 2.2 确定特征值,目标值
x = data.iloc[:, 1:10]
y = data["Class"]

# 2.3 分割数据
x_train, x_test, y_train, y_test = train_test_split(x, y, random_state=22)

# 3.特征工程(标准化)
transfer = StandardScaler()
x_train = transfer.fit_transform(x_train)
x_test = transfer.transform(x_test)

# 4.机器学习(逻辑回归)
estimator = LogisticRegression()
estimator.fit(x_train, y_train)

# 5.模型评估(比如以这个癌症举例子！！！我们并不关注预测的准确率，而是关注在所有的样本当中，癌症患者有没有被全部预测（检测）出来)
y_predict = estimator.predict(x_test)
estimator.score(x_test, y_test)