LogisticRegression(逻辑回归)

香喷喷的小鸡翅

已于 2022-04-13 09:01:09 修改

阅读量954

点赞数 1

分类专栏：机器学习算法文章标签：机器学习

于 2022-04-12 22:07:49 首次发布

本文链接：https://blog.csdn.net/qq_37637415/article/details/124132842

版权

机器学习算法专栏收录该内容

2 篇文章 0 订阅

订阅专栏

一：概念

Logistic Regression 虽然被称为回归，但其实际上是分类模型，并常用于二分类。Logistic Regression 因其简单、可并行化、可解释强深受工业界喜爱。
Logistic 回归的本质是：假设数据服从这个分布，然后使用极大似然估计做参数的估计。

决策边界可以表示为 ：
在这里插入图片描述
1.对未知样本
${x} = (x_{1} + x_{2} ... x_{n})$ 类别的预测与求解：w
$w^Tx = w_0 + w_1x_1 + w_2x_2+...+w_nx_n + b\\ 矩阵格式为：\\ z = w^Tx = \begin{gathered} \begin{bmatrix} w_0 \\ w_1\\ w_2\\...\\w_n \end{bmatrix} \begin{bmatrix} 1 & x_1& x_2&...&x_n \end{bmatrix} \end{gathered}\\ 注意：x向量第一个元素为1，后面代码会有生成过程。$

2.将特征的现行组合函数通过sigmoid函数进行归一化
$\quad function ：\varphi(z) = \frac{1}{1+e^{-z}}$
那么 ~
$属于正类的概率\hat{p} = \left\{ \begin{array}{rcl} 1\quad(\hat{p}\leq 0.5)\\ 0\quad(\hat{p} <0.5) \end{array}\right.$
$s i g m o i d :$
在这里插入图片描述

sigmoid 函数输出概率的形式值域为（0，1），因为一般处理二分问题我们只需要两个预测值，当然我也见过（-1，1）不确定是否还是sigmoid函数，但是后面的数学公式全部需要重推而且概率也不是我们想要的了 ps：这个还有待考究，十分麻烦，所以还是采用（0，1）区间。
样本x各个维度的加权叠加z作为sigmoid的输入
$\in(-\infty,+\infty) ,\quad z=0\quad \varphi(z)=0.5$
2.推导进行分类公式
$\frac{p}{1-p}几率(odds)$
$log(\frac{p}{1-p})对数几率(logodds)或logit$
x为正例的概率越大那么对数几率取值就越大，相反取反例的概率越大对数的几率取值就越小
取得线性模型：
$log(\frac{p}{1-p}) = w^Tx + b$
那么正反例的推导就为：
$\varphi(x)=\frac{1}{1+e^{-{w^Tx+b}}}$
$\varphi(x)=\frac{e^{-{w^Tx+b}}}{1+e^{-{w^Tx+b}}}$
现在我们来讨论单个训练样本的代价函数
$例如： (x^{(i)},y^{(i)})$
$\left\{ \begin{array}{rcl} -log(\hat{p})\quad(y^{(i)}=1)\\ -log(1-\hat{p})\quad(y^{(i)}=0) \end{array}\right.$
$y^{(i)}=1 时，\hat{p}值越小，训练样本是正类的可能性越小，将其判断为正类的代价就越高\\ 当 y^{(i)}=0 时，\hat{p}值越大，训练样本是负类的可能性越小，将其判断为负类的代价就越高$
总代价函数：
$-[y^{(i)}log(\hat{p}) + (1-y^{(i)})log(1-\hat{p})]$
$cost对w求偏导得（此处使用梯度下降法求最优w）：\\ \frac{\partial cost}{\partial w} = x^T(\frac{1}{1+e^{-z}} - y^{(i)})$
使用梯度下降求最优w
1.初始化w （w为三行一列矩阵）
2.更新w：
$-alpha*\frac{\partial cost}{\partial w}$
3.迭代到一定次数或到一定阀值
最后得到w为一条超平面线将正利反例分割开

python代码：

import os, sys
import numpy as np
import matplotlib.pyplot as plt
from sklearn import datasets
from sklearn.model_selection import train_test_split
#从文本中读取
x = []
y = []
label = []
len1 = 0
index = 0
file_path = 'testSet.txt'
data_dir = '.'
file_path = os.path.join(data_dir, file_path)
data = [line.strip() for line in open(file_path)]
np.random.shuffle(data)
for data1 in data:
    part_x, part_y, part_label = data1.split(' ')
    x.append(float(part_x))
    y.append(float(part_y))
    label.append(int(part_label))
    len1 = len1 + 1
#print(label)
#concatenate函数主要指的是吧x按照0轴进行合并

"""
#从sklearn dataset中读取数据
iris = datasets.load_iris()
x = iris['data'][0]
y = iris['target']
x = x[y != 2]
y = y[y != 2]
"""

x = np.concatenate([x])
x = x.reshape(len1,1)
#print(x)
y = np.concatenate([y])
y = y.reshape(len1,1)

#X与一个len1行1列的矩阵合并
ones = np.ones((len1, 1))
x = np.hstack((x,ones))
x = np.hstack((x, y))
#print(x)
x_avr = np.mean(x)
x_std = np.std(x)

label = np.concatenate([label])
label = label.reshape(len1,1)

for label1 in label:
    if(label1 == 0):
        plt.scatter(x[index,0], y[index,0], c='red')
    elif(label1 == 1):
        plt.scatter(x[index,0], y[index,0], c='green')
    index = index + 1

def sigmoid(x, omega):
    diff = np.dot(x,omega)
    return 1 / (1 + np.exp(-diff))

def cost(label, sig):
    return (1./len1) * (np.dot(-label, np.log(sig)) - np.dot((1 - label), np.log(1 - sig)))

def gradient(x, label, sig):
    gradient = (1./len1) * (np.dot(np.transpose(x), (sig - label)))
    return gradient

def Logistic_regression(x, label):
    num = 100000  #迭代2000000轮
    omega = np.array([0, 0, 0]).reshape(3, 1)
    alpha = 0.01
    sig = sigmoid(x, omega)
    cost_gradient = gradient(x, label, sig)
    for i in range(num):
        omega = omega - alpha * cost_gradient
#        print(omega)
        sig = sigmoid(x, omega)
        cost_gradient = gradient(x, label, sig)
    return omega

omega = Logistic_regression(x, label)
x1 = np.linspace(-5, 5, 100)   #创建一个等差数列
y1 = (omega[0]*x1 + omega[1]) / -omega[2]
print(omega)
plt.plot(x1, y1)
plt.show()

数据格式：
数据集大家可以模拟一下类似
x1 x2 lable
1 3 0
2 5 1
这种也可以在sklearn.datasets 中使用预制的数据集。

之后会继续完善文档，此种还有细节和正则化问题需要补充！！！

香喷喷的小鸡翅

关注

1
点赞
踩
2

收藏

觉得还不错? 一键收藏
1
评论
LogisticRegression(逻辑回归)

一：概念Logistic Regression 虽然被称为回归，但其实际上是分类模型，并常用于二分类。Logistic Regression 因其简单、可并行化、可解释强深受工业界喜爱。Logistic 回归的本质是：假设数据服从这个分布，然后使用极大似然估计做参数的估计。决策边界可以表示为：1.对未知样本x=(x1+x2...xn){x} = (x_{1} + x_{2} ... x_{n})x=(x1+x2...xn)类别的预测与求解：wz=wTx=w0+w1x1+w2x
复制链接

扫一扫