BP神经网络（python代码）

最新推荐文章于 2024-09-06 22:37:26 发布

FangHui5206

最新推荐文章于 2024-09-06 22:37:26 发布

阅读量1.4k

点赞数 1

分类专栏：神经网络机器学习 python

机器学习同时被 3 个专栏收录

10 篇文章 0 订阅

订阅专栏

python

8 篇文章 2 订阅

订阅专栏

神经网络

3 篇文章 0 订阅

订阅专栏

BP神经网络（python代码）

2016年02月23日 17:34:02

阅读数：7052

神经网络是深度学习的基础。个人理解神经网络就是可以拟合任何一种广义线性模型的结构，本文主要记录python代码的学习笔记。

BP神经网络原理（待补充，可以详见《数据挖掘概念与技术》P258页）

伪代码：

代码中使用的随机梯度下降，伪代码是使用整体数据做梯度下降。

 
     [python]  
     view plaincopy
# coding=utf-8  
import numpy as np  
  
def tanh(x):  
    return np.tanh(x)  
  
def tanh_deriv(x):  
    return 1.0- np.tanh(x)*np.tanh(x)  
  
def logistic(x):  
    return 1/(1+np.exp(-x))  
  
def logistic_derivative(x):  
    return logistic(x)*(1-logistic(x))  
  
class NeuralNetwork:  
    def __init__(self,layers,activation='tanh'):  
        """ 
 
        """  
        if activation == 'logistic':  
            self.activation = logistic  
            self.activation_deriv = logistic_derivative  
        elif activation=='tanh':  
            self.activation = tanh  
            self.activation_deriv=tanh_deriv  
  
        self.weights=[]  
        self.weights.append((2*np.random.random((layers[0]+1,layers[1]))-1)*0.25)  
        for i in range(2,len(layers)):  
            self.weights.append((2*np.random.random((layers[i-1],layers[i]))-1)*0.25)  
            #self.weights.append((2*np.random.random((layers[i]+1,layers[i+1]))-1)*0.25)  
  
    def fit(self,X,y,learning_rate=0.2,epochs=10000):  
        X = np.atleast_2d(X)  
            # atlest_2d函数:确认X至少二位的矩阵  
        temp = np.ones([X.shape[0],X.shape[1]+1])  
            #初始化矩阵全是1（行数，列数+1是为了有B这个偏向）  
        temp[:,0:-1]=X  
            #行全选，第一列到倒数第二列  
        X=temp  
        y=np.array(y)  
            #数据结构转换  
        for k in range(epochs):  
                # 抽样梯度下降epochs抽样  
            i = np.random.randint(X.shape[0])  
            a = [X[i]]  
  
            for l in range(len(self.weights)):  
                a.append(self.activation(np.dot(a[l],self.weights[l])))  
                # 向前传播，得到每个节点的输出结果  
            error = y[i]-a[-1]  
                # 最后一层错误率  
            deltas=[error*self.activation_deriv(a[-1])]  
  
            for l in range(len(a)-2,0,-1):  
                deltas.append(deltas[-1].dot(self.weights[l].T)*self.activation_deriv(a[l]))  
            deltas.reverse()  
            for i in range(len(self.weights)):  
                layer = np.atleast_2d(a[i])  
                delta = np.atleast_2d(deltas[i])  
                self.weights[i] +=learning_rate*layer.T.dot(delta)  
  
    def predict(self,x):  
        x=np.array(x)  
        temp= np.ones(x.shape[0]+1)  
        temp[0:-1]=x  
        a = temp  
        for l in range(0,len(self.weights)):  
            a=self.activation(np.dot(a,self.weights[l]))  
        return(a)  

下面为测试“异或”数据的代码：
因为输入是2个特征的所以输入层为2，隐藏层我随意定义了12，输出层为1（因为结果为0或1所以1位就够定义了。）

 
     [python]  
     view plaincopy

 
     [python]  
     view plaincopy
from BPNN import NeuralNetwork  
import numpy as np  
  
nn = NeuralNetwork([2,12,1],'tanh')  
x = np.array([[0,0],[0,1],[1,0],[1,1]])  
y = np.array([0,1,1,0])  
nn.fit(x,y)  
for i in [[0,0],[0,1],[1,0],[1,1]]:  
    print(i,nn.predict(i))  

代码结果：

 
     [python]  
     view plaincopy
([0, 0], array([-0.00917032]))  
([0, 1], array([ 0.99785397]))  
([1, 0], array([ 0.9979288]))  
([1, 1], array([ 0.01522156]))