Coursera吴恩达机器学习(四)——Exercise4-神经网络的反向传播算法

最新推荐文章于 2023-12-27 18:56:28 发布

芋圆乌龙茶

最新推荐文章于 2023-12-27 18:56:28 发布

阅读量279

点赞数

分类专栏：机器学习文章标签：机器学习

本文链接：https://blog.csdn.net/qq_37751989/article/details/107946618

版权

文章目录

1.加载并处理数据
2.随机初始化
3.前向传播
4.代价函数
- （1）无正则化代价函数
- （2）正则化代价函数
5.反向传播——求梯度
- （1）无正则化梯度
- （2）正则化梯度
6.梯度检验
7.训练模型
8.预测
9.准确率、精确率、召回率
10.画出隐藏层

1.加载并处理数据

import numpy as np
import matplotlib.pyplot as plt
import matplotlib
from scipy.io import loadmat
import scipy.optimize as opt
from sklearn.metrics import classification_report

def load_data(path,transpose=True):
    data=loadmat(path)
    X=data['X']
    y=data['y'].flatten()
    if transpose:
        X=np.array([row.reshape(20,20).T.flatten() for row in X])
    return X,y

def serialize(theta1,theta2):
    theta=np.concatenate((theta1.flatten(),theta2.flatten()))#对于一维数组拼接，axis的值不影响最后的结果
    return theta

def deserialize(theta):
    theta1=theta[:25*401].reshape(25,401)
    theta2=theta[25*401:].reshape(10,26)
    return theta1,theta2

X,y=load_data(path,transpose=False)#权重是根据原始数据（未转置的）来的
y=np.array([(y==k) for k in range(1,11)]).T # y.shape(5000,10)
X=np.insert(X,0,np.ones(X.shape[0]),axis=1) # X.shape(5000,401)

2.随机初始化

def random_init(size):
    return np.random.uniform(-0.12,0.12,size)
theta_init=random_init(10285)

3.前向传播

def sigmoid(z):
    return 1/(1+np.exp(-z))

在这里插入图片描述

def feed_forward(theta,X):
    theta1,theta2=deserialize(theta)#theta1(25,401) theta2(10,26)
    a1=X#(5000,401)
    z2=a1@theta1.T
    a2=sigmoid(z2)
    a2=np.insert(a2,0,np.ones(a2.shape[0]),axis=1)
    z3=a2@theta2.T
    h3=sigmoid(z3)
    return a1,z2,a2,z3,h3

4.代价函数

（1）无正则化代价函数

$J(\Theta)=\frac{1}{m}[\sum_{i=1}^{m}\sum_{k=1}^{K}-y_k^{(i)}log(h_{\Theta}(x^{(i)})_k)-(1-y_k^{(i)})log(1-h_{\Theta}(x^{(i)})_k)]$ $K$ 表示有 $K$ 个分类

def cost(theta,X,y):
    _,_,_,_,h=feed_forward(theta,X)#h(5000,10) y(5000,10)
    first=-(y*np.log(h))#数组用*表示对应元素相乘，等价于np.multiply
    second=(1-y)*np.log(1-h)
    return np.sum(first-second)/len(X)
cost(theta_init,X,y)
#cost：7.237525037084105

（2）正则化代价函数

$J(\Theta)=\frac{1}{m}[\sum_{i=1}^{m}\sum_{k=1}^{K}-y_k^{(i)}log(h_{\Theta}(x^{(i)})_k)-(1-y_k^{(i)})log(1-h_{\Theta}(x^{(i)})_k)]+\frac{\lambda}{2m}\sum_{l=1}^{L-1}\sum_{i=1}^{s_l}\sum_{j=1}^{s_{l+1}}(\Theta_{ji}^{(i)})^2$

最低0.47元/天解锁文章

芋圆乌龙茶

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Coursera吴恩达机器学习(四)——Exercise4-神经网络的反向传播算法

1.加载并处理数据import numpy as npimport matplotlib.pyplot as pltimport matplotlibfrom scipy.io import loadmatimport scipy.optimize as optfrom sklearn.metrics import classification_report def load_data(path,transpose=True): data=loadmat(path) X=dat
复制链接

扫一扫