machine learning(一) 一感知机模型

最新推荐文章于 2024-08-16 22:26:23 发布

bunschen

最新推荐文章于 2024-08-16 22:26:23 发布

阅读量540

点赞数

分类专栏：机器学习文章标签：机器学习算法 python

本文链接：https://blog.csdn.net/chenrulong/article/details/53558883

版权

机器学习专栏收录该内容

1 篇文章 0 订阅

订阅专栏

machine learning(一) 一感知机模型

感知机学习算法的基本原理:

感知机模型由Rosenblatt提出，该模型是模仿单个神经元在大脑中的工作：通过激活神经元。
这里写图片描述
因此最初的感知机工作原理十分简单，主要步骤如下：

初始化权重（weight）为0，或者其他随机的很的小值
对于每一个训练样本 $x^{(i)}$ 执行下列操作
1. 计算输出的目标值 $\hat y$
2. 更新权重（weight）

这里，输出的目标值就是预测的所属的类，同时我们在预测中，要同时更新每一个在权重向量w中的具体权重（weight） $w_j$ ，也即：

$w_j := w_j + \Delta w_j$

对于用于更新权重 $w_j$ 值的 $\Delta w_j$ 通过感知机的学习规则计算：

$\Delta w_j = \eta (y^{i} - \hat y^{(i)}) x_j^{(i)}$

其中 $\eta$ 为学习率(一个常数，取值一般为0.0~1.0)， $y^{(i)}$ 为第 i 个训练样本的真实标签， $\hat y^{(i)}$ 为相应的预测标签， $x_j^{(i)}$ 为第 i 个训练样本的第 j 个特征值。例如：

$\Delta w_0 = \eta(y^{(i)} - output^{(i)})$

$\Delta w_1 = \eta(y^{(i)} - output^{(i)})x_1^{(i)}$

$\Delta w_2 = \eta(y^{(i)} - output^{(i)})x_2^{(i)}$

感知机学习算法的实现:

perceptron_classifier.py :

# -*- coding:utf-8 -*-

import numpy as np


class Perceptron(object):

    """
    eta: float
        学习率
    n_inter: int
        迭代次数
    errors_: list
        在每次迭代中，错误分类的的数量
    w_: 1d-array
        训练后的权重
    """
    def __init__(self, eta=0.01, n_iter=10):
        self.eta = eta
        self.n_iter = n_iter

    def fit(self, X, y):
        self.w_ = np.zeros(1 + X.shape[1])
        self.errors_ = []

        for _ in range(self.n_iter):


            errors = 0
            for xi, target in zip(X, y):
                update = self.eta * (target - self.predict(xi))
                self.w_[1:] += update * xi
                self.w_[0] += update
                errors += int(update != 0.0)
            self.errors_.append(errors)
        return self

    def net_input(self, X):
        return np.dot(X, self.w_[1:]) + self.w_[0]

    def predict(self, X):
        return np.where(self.net_input(X) >= 0.0, 1, -1)

感知机学习算法的测试:

iris_perceptron_classifier.py :

import pandas as pd
import matplotlib.pyplot as plt
import numpy as np
import sys
sys.path.append('../')
from perceptron_classifier import Perceptron

# 从网上下载鸢尾花的数据
df = pd.read_csv('https://archive.ics.uci.edu/ml/'
                 'machine-learning-databases/iris/iris.data', header=None)

# y 取前一百个数据中标签数据， 位于第 5 列中
y = df.iloc[0:100, 4].values
# 将标签数据数值化， 若为Iris-setosa 品种， 则设为-1， 否则为1
y = np.where(y == 'Iris-setosa', -1, 1)
# X 取前一百个样本数据，选取为第1和3个特征作为识别花品种的特征
X = df.iloc[0:100, [0, 2]].values

# 在坐标图中根据花朵的sepal length(萼片长度)，petal length（花瓣长度）来标出花朵
plt.scatter(X[:50, 0], X[:50, 1], color='red', marker='o', label='setosa')
plt.scatter(X[50:100, 0], X[50:100, 1], color='blue', marker='x', label='versicolor')
plt.xlabel('sepal length')
plt.ylabel('petal length')
plt.legend(loc='upper left')
plt.show()

# 用感知机对训练样本进行训练
ppn = Perceptron(eta=0.1, n_iter=10)
ppn.fit(X, y)

# 画出在每次训练中，错误分类样本的个数
plt.plot(range(1, len(ppn.errors_) + 1), ppn.errors_, marker='o')
plt.xlabel('Epochs')
plt.ylabel('Number of misclassification')
plt.show()

感知机学习算法的测试结果:

这里写图片描述

注释：

zip(X, y) : 得到一个tuple对象，其中每个子元素是一个元素对 $(X^{(i)}, y^{(i)})$ 。
sys.path.append('../') from perceptron_classifier import Perceptron
该代码片通过将当前文件的父路径临时添加为查找模块目录，来导入相应的python模块

bunschen

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
machine learning(一) 一感知机模型

machine learning(一) 一感知机模型感知机学习算法的基本原理:感知机模型由Rosenblatt提出，该模型是模仿单个神经元在大脑中的工作：通过激活神经元。
复制链接

扫一扫