七、(4)逻辑回归——二分类法,预测乳腺癌数据
乳腺癌数据集下载地址:https://archive.ics.uci.edu/ml/machine-learning-databases/breast-cancer-wisconsin/breast-cancer-wisconsin.data
下载的数据为data格式,直接改文件名为csv查看数据内容即可。最后一行为目标值。2代表正常,4代表癌症。
由于官网给的数据集没有每列的名称,需要我们自己添加,代码中会写出添加步骤。
代码如下:
# -*- coding: utf-8 -*-
"""
Created on Sun May 26 21:34:29 2019
@author: sun
"""
from sklearn.linear_model import LogisticRegression
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler
from sklearn.metrics import classification_report
from sklearn.externals import joblib
import pandas as pd
import numpy as np