利用LSTM实现NER

最新推荐文章于 2020-11-22 16:27:29 发布

砰！

最新推荐文章于 2020-11-22 16:27:29 发布

阅读量757

点赞数 2

文章标签： python 深度学习

本文链接：https://blog.csdn.net/Harder_14/article/details/109096309

版权

1.数据及库的准备

#!pip -q install trax==1.3.1

import trax 
from trax import layers as tl
import os 
import numpy as np
import pandas as pd


from utils import get_params, get_vocab
import random as rnd

# set random seeds to make this notebook easier to replicate
trax.supervised.trainer_lib.init_random_number_generators(33)

数据的表示和标记（其中B-表示token是实体的开始，I-表示token在实体内部）

数据规模如下：

数据生成器：

def data_generator(batch_size, x, y, pad, shuffle=False, verbose=False):
    '''
      Input: 
        batch_size - integer describing the batch size
        x - list containing sentences where words are represented as integers
        y - list containing tags associated with the sentences
        shuffle - Shuffle the data order
        pad - an integer representing a pad character
        verbose - Print information during runtime
      Output:
        a tuple containing 2 elements:
        X - np.ndarray of dim (batch_size, max_len) of padded sentences
        Y - np.ndarray of dim (batch_size, max_len) of tags associated with the sentences in X
    '''
    
    # count the number of lines in data_lines
    num_lines = len(x)
    
    # create an array with the indexes of data_lines that can be shuffled
    lines_index = [*range(num_lines)]
    
    # sh

最低0.47元/天解锁文章

砰！

关注

2
点赞
踩
4

收藏

觉得还不错? 一键收藏
0
评论
利用LSTM实现NER

1.数据及库的准备#!pip -q install trax==1.3.1import trax from trax import layers as tlimport os import numpy as npimport pandas as pdfrom utils import get_params, get_vocabimport random as rnd# set random seeds to make this notebook easier to replic
复制链接

扫一扫