文本分类论文及pytorch版复现（三）：VDCNN

最新推荐文章于 2024-06-04 09:55:45 发布

Young Panda

最新推荐文章于 2024-06-04 09:55:45 发布

阅读量587

点赞数

分类专栏： pytorch 文本分类

本文链接：https://blog.csdn.net/qq_28969139/article/details/103646076

版权

文本分类同时被 2 个专栏收录

13 篇文章 7 订阅

订阅专栏

pytorch

12 篇文章 2 订阅

订阅专栏

Very Deep Convolutional Networks for Text Classification

1、模型

2、代码

import torch
import torch.nn.functional as F
from torch import nn


# char-level
# embedding_dim=16, SGD, mini-batch=128, init_lr=0.01, momentum=0.9
# init_conv=He et al.,2015, use temporal batch norm without dropout.
# 29 conv layers is best.
# MaxPooling is better than KMaxPooling and Conv

class VDCNN(nn.Module):

    def __init__(self):
        super(VDCNN, self).__init__()
        num_embeddings = 5031 + 1
        num_classes = 10
        num_layers = 9
        layers_types = {
            9: [2, 2, 2, 2],
            17: [4, 4, 4, 4],
            29: [4, 4, 10, 10],
            49: [6, 10, 16, 16]
        }
        layers_dist = layers_types[num_layers]

        self.embed = nn.Embedding(num_embeddings, 16, 0)
        self.conv = nn.Conv1d(16, 64, 3, 1, 1)
        self.conv_block1 = nn.Sequential(
            *([ConvBlock(64, 64, 3)] + [ConvBlock(64, 64, 3) for _ in range(layers_dist[0] - 1)]))

        self.conv_block2 = nn.Sequential(
            *([ConvBlock(64, 128, 3)] + [ConvBlock(128, 128, 3) for _ in range(layers_dist[1] - 1)]))

        self.conv_block3 = nn.Sequential(
            *([ConvBlock(128, 256, 3)] + [ConvBlock(256, 256, 3) for _ in range(layers_dist[2] - 1)]))

        self.conv_block4 = nn.Sequential(
            *([ConvBlock(256, 512, 3)] + [ConvBlock(512, 512, 3) for _ in range(layers_dist[3] - 1)]))

        self.fc = nn.Sequential(
            nn.Linear(4096, 2048),
            nn.LeakyReLU(inplace=True),
            nn.Linear(2048, 2048),
            nn.LeakyReLU(inplace=True),
            nn.Linear(2048, num_classes)
        )

    # input_length=1024
    def forward(self, x):
        x = self.embed(x)
        x = x.transpose(1, 2).contiguous()
        x = self.conv(x)
        x = self.conv_block1(x)
        x = F.max_pool1d(x, 3, 2, 1)
        x = self.conv_block2(x)
        x = F.max_pool1d(x, 3, 2, 1)
        x = self.conv_block3(x)
        x = F.max_pool1d(x, 3, 2, 1)
        x = self.conv_block4(x)
        x, _ = x.topk(8, dim=2, sorted=False)
        x = x.view(x.size(0), -1).contiguous()
        x = self.fc(x)
        return x


class ConvBlock(nn.Module):

    def __init__(self, in_channels, out_channels, kernel_size):
        super(ConvBlock, self).__init__()
        self.conv1 = nn.Sequential(
            nn.Conv1d(in_channels, out_channels, kernel_size, 1, 1),
            nn.BatchNorm1d(out_channels),
            nn.ReLU(inplace=True)
        )
        self.conv2 = nn.Sequential(
            nn.Conv1d(out_channels, out_channels, kernel_size, 1, 1),
            nn.BatchNorm1d(out_channels),
            nn.ReLU(inplace=True)
        )
        self.shortcut = nn.Sequential(
            nn.Conv1d(in_channels, out_channels, 1),
            nn.BatchNorm1d(out_channels),
            nn.ReLU(inplace=True)
        )

    def forward(self, x):
        y = self.conv1(x)
        y = self.conv2(y)
        x = self.shortcut(x)
        return y + x

Young Panda

关注

0
点赞
踩
7

收藏

觉得还不错? 一键收藏
0
评论
文本分类论文及pytorch版复现（三）：VDCNN

Very Deep Convolutional Networks for Text Classification1、模型2、代码import torchimport torch.nn.functional as Ffrom torch import nn# char-level# embedding_dim=16, SGD, mini-batch=128...
复制链接

扫一扫