将注意力机制加入CNN网络中

最新推荐文章于 2024-10-12 12:26:23 发布

xunzhaoreai

最新推荐文章于 2024-10-12 12:26:23 发布

阅读量148

点赞数 3

文章标签： python 深度学习开发语言

本文链接：https://blog.csdn.net/xunzhaoreai/article/details/141421366

版权

import torch
import torch.nn as nn
import torchvision.models as models

# 1. 定义特征提取网络（ResNet）
class FeatureExtractor(nn.Module):
    def __init__(self):
        super(FeatureExtractor, self).__init__()
        self.resnet = models.resnet50(pretrained=True)
        self.resnet = nn.Sequential(*list(self.resnet.children())[:-2])  # 去掉最后的全连接层和池化层

    def forward(self, x):
        return self.resnet(x)  # 返回特征图

# 2. 定义自注意力机制
class SelfAttention(nn.Module):
    def __init__(self, in_channels):
        super(SelfAttention, self).__init__()
        self.query = nn.Conv2d(in_channels, in_channels // 8, kernel_size=1)
        self.key = nn.Conv2d(in_channels, in_channels // 8, kernel_size=1)
        self.value = nn.Conv2d(in_channels, in_channels, kernel_size=1)
        self.gamma = nn.Parameter(torch.zeros(1))

    def forward(self, x):
        batch, channels, height, width = x.size()
        proj_query = self.query(x).view(batch, -1, width * height).permute(0, 2, 1)
        proj_key = self.key(x).view(batch, -1, width * height)
        energy = torch.bmm(proj_query, proj_key)
        attention = nn.Softmax(dim=-1)(energy)
        proj_value = self.value(x).view(batch, -1, width * height)

        out = torch.bmm(proj_value, attention.permute(0, 2, 1))
        out = out.view(batch, channels, height, width)
        
        out = self.gamma * out + x
        return out

# 3. 将特征提取与注意力机制结合
class AttentionModel(nn.Module):
    def __init__(self):
        super(AttentionModel, self).__init__()
        self.feature_extractor = FeatureExtractor()
        self.attention = SelfAttention(in_channels=2048)  # 根据ResNet50的输出通道数

    def forward(self, x):
        features = self.feature_extractor(x)
        attention_features = self.attention(features)
        # 可以进一步接上全连接层或其他处理
        return attention_features

# 4. 实例化模型并进行前向传播
model = AttentionModel()
input_tensor = torch.randn(1, 3, 224, 224)
output = model(input_tensor)
print(output.shape)

在使用CNN网络进行特征提取之后再加入注意力机制是一个合理且常见的策略。这样做可以结合CNN的局部特征提取能力和注意力机制的全局依赖建模能力，进一步提升模型的性能。以下是这样做的优点和一个可能的实现步骤：

优点：

局部特征提取：CNN擅长捕捉图像的局部特征，如边缘、纹理等，能够生成具有空间结构的特征图。
全局依赖建模：注意力机制（如自注意力）擅长捕捉特征图中的全局关系，能够让模型在所有位置之间建立联系，关注更重要的区域或特征。
性能提升：将两者结合后，模型可以更精确地捕捉到重要的特征，同时过滤掉冗余信息，提高分类或检测等任务的准确性。

实现步骤：

CNN特征提取：
- 使用一个预训练的CNN（如ResNet、VGG等）或自定义的卷积层提取输入图像的特征图。
注意力机制：
- 在提取到的特征图上应用注意力机制。可以选择通道注意力、空间注意力或者自注意力等机制。
- 如果使用自注意力机制，可以将其应用在特征图的每个位置上，以计算全局依赖关系。
后续处理：
- 将经过注意力机制处理的特征图输入到后续的网络层，如全连接层或分类头，用于最终的任务（如分类、检测）。