线性分类的Jupyter实践

最新推荐文章于 2022-12-07 16:09:44 发布

_wensea_

最新推荐文章于 2022-12-07 16:09:44 发布

阅读量217

点赞数

分类专栏：笔记

本文链接：https://blog.csdn.net/weixin_48547489/article/details/115256186

版权

笔记专栏收录该内容

34 篇文章 1 订阅

订阅专栏

一、准备工作

1、安装Anaconda
可以看我上一个博客，也可以直接在这里下载
2、下载实验所需的包
在创建的虚拟环境中安装自己需要的包。
在这里插入图片描述
点击Open Terminal
之后分别运行下列命令：

pip install -i https://pypi.tuna.tsinghua.edu.cn/simple numpy
pip install -i https://pypi.tuna.tsinghua.edu.cn/simple pandas
pip install -i https://pypi.tuna.tsinghua.edu.cn/simple sklearn
pip install -i https://pypi.tuna.tsinghua.edu.cn/simple matplotlib

如下图：
在这里插入图片描述

二、实验步骤

1、打开命令行**

在这里插入图片描述

此处点击open with python

2、取萼片的长宽作为特征进行分类

1.导入包
使用下列

import numpy as np
from sklearn.linear_model import LogisticRegression
import matplotlib.pyplot as plt
import matplotlib as mpl
from sklearn import datasets
from sklearn import preprocessing
import pandas as pd
from sklearn.preprocessing import StandardScaler
from sklearn.pipeline import Pipeline

在这里插入图片描述

2、获取数据集

# 获取所需数据集
iris=datasets.load_iris()
#每行的数据，一共四列，每一列映射为feature_names中对应的值
X=iris.data
print(X)
#每行数据对应的分类结果值（也就是每行数据的label值）,取值为[0,1,2]
Y=iris.target
print(Y)

3、对数据进行处理

#归一化处理
X = StandardScaler().fit_transform(X)
print(X)

4、训练模型

lr = LogisticRegression()   # Logistic回归模型
lr.fit(X, Y)        # 根据数据[x,y]，计算回归参数

5、绘制分类后的图像

N, M = 500, 500     # 横纵各采样多少个值
x1_min, x1_max = X[:, 0].min(), X[:, 0].max()   # 第0列的范围
x2_min, x2_max = X[:, 1].min(), X[:, 1].max()   # 第1列的范围
t1 = np.linspace(x1_min, x1_max, N)
t2 = np.linspace(x2_min, x2_max, M)
x1, x2 = np.meshgrid(t1, t2)                    # 生成网格采样点
x_test = np.stack((x1.flat, x2.flat), axis=1)   # 测试点

cm_light = mpl.colors.ListedColormap(['#77E0A0', '#FF8080', '#A0A0FF'])
cm_dark = mpl.colors.ListedColormap(['g', 'r', 'b'])
y_hat = lr.predict(x_test)       # 预测值
y_hat = y_hat.reshape(x1.shape)                 # 使之与输入的形状相同
plt.pcolormesh(x1, x2, y_hat, cmap=cm_light)     # 预测值的显示
plt.scatter(X[:, 0], X[:, 1], c=Y.ravel(), edgecolors='k', s=50, cmap=cm_dark)    
plt.xlabel('petal length')
plt.ylabel('petal width')
plt.xlim(x1_min, x1_max)
plt.ylim(x2_min, x2_max)
plt.grid()
plt.show()

在这里插入图片描述
6、预测模型

y_hat = lr.predict(X)
Y = Y.reshape(-1)
result = y_hat == Y
print(y_hat)
print(result)
acc = np.mean(result)
print('准确度: %.2f%%' % (100 * acc))

在这里插入图片描述

三、小结

通过线性多分类的实现，主要认识逻辑回归的使用过程。从最后预测结果来看，整个模型的准确性还是比较高，准确度是可以满足要求的。

_wensea_

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录