时间序列中的前馈神经网络 (Feed-Forward Neural Network) 详细解释及举例

最新推荐文章于 2024-09-24 13:30:00 发布

six.学长

最新推荐文章于 2024-09-24 13:30:00 发布

阅读量707

点赞数 8

分类专栏： informer 文章标签：神经网络人工智能深度学习

本文链接：https://blog.csdn.net/m0_51200050/article/details/139632324

版权

informer 专栏收录该内容

39 篇文章 0 订阅

订阅专栏

时间序列中的前馈神经网络 (Feed-Forward Neural Network) 详细解释及举例

前馈神经网络 (Feed-Forward Neural Network, FFNN) 是Transformer模型中的一个重要组成部分，它用于对自注意力层的输出进行进一步的非线性变换。FFNN通过两个线性变换和一个非线性激活函数（通常是ReLU）来增强模型的表达能力。

工作原理

在这里插入图片描述

举例说明

假设我们有一个时间序列输入 ( X )，自注意力层的输出形状为 ( (4, 3) )，即序列长度为4，每个时间步有3个特征。

输入数据

import numpy as np

# 自注意力机制的输出（示例）
X = np.array([[0.5, 0.6, 0.7], [0.8, 0.9, 1.0], [1.1, 1.2, 1.3], [1.4, 1.5, 1.6]])

第一层线性变换

假设前馈神经网络的隐藏层维度 ( d_ff = 6 )：

# 初始权重和偏置
W1 = np.random.rand(3, 6)
b1 = np.random.rand(6)

# 第一层线性变换
X_ff1 = np.dot(X, W1) + b1

非线性激活函数

应用ReLU激活函数：

# ReLU激活函数
def relu(x):
    return np.maximum(0, x)

X_relu = relu(X_ff1)

第二层线性变换

将特征维度还原到原始维度 ( d_ff= 3 )：

# 初始权重和偏置
W2 = np.random.rand(6, 3)
b2 = np.random.rand(3)

# 第二层线性变换
X_ff2 = np.dot(X_relu, W2) + b2

完整代码示例

import numpy as np

# 自注意力机制的输出（示例）
X = np.array([[0.5, 0.6, 0.7], [0.8, 0.9, 1.0], [1.1, 1.2, 1.3], [1.4, 1.5, 1.6]])

# 初始权重和偏置
W1 = np.random.rand(3, 6)
b1 = np.random.rand(6)
W2 = np.random.rand(6, 3)
b2 = np.random.rand(3)

# 第一层线性变换
X_ff1 = np.dot(X, W1) + b1

# ReLU激活函数
def relu(x):
    return np.maximum(0, x)

X_relu = relu(X_ff1)

# 第二层线性变换
X_ff2 = np.dot(X_relu, W2) + b2

print("Input X:\n", X)
print("First Layer Output (X_ff1):\n", X_ff1)
print("ReLU Activation Output (X_relu):\n", X_relu)
print("Second Layer Output (X_ff2):\n", X_ff2)