【Tensorflow】Bi-LSTM文本分类

最新推荐文章于 2024-07-27 22:01:28 发布

theoreoeater

最新推荐文章于 2024-07-27 22:01:28 发布

阅读量4.8k

点赞数

分类专栏： Tensorflow 自然语言处理深度学习文章标签： tensorflow nlp 文本分类

本文链接：https://blog.csdn.net/weixin_39837402/article/details/80605670

版权

深度学习同时被 3 个专栏收录

12 篇文章 0 订阅

订阅专栏

自然语言处理

9 篇文章 0 订阅

订阅专栏

Tensorflow

5 篇文章 0 订阅

订阅专栏

用于训练的计算图

#训练图
train_graph = tf.Graph()
with train_graph.as_default():
    #输入文本维度为[time_step,batch_size,embedding_size]
    encoder_inputs = tf.placeholder(shape=[None,None,input_size], dtype=tf.float32, name='encoder_inputs')
    #文本标签
    text_label = tf.placeholder(shape=(None), dtype=tf.int32)
    #每个文本的序列长度
    text_length = tf.placeholder(shape=(None), dtype=tf.int32)
   
    #前向cell和后向cell，分别加了dropout
    encoder_fw_cell = tf.contrib.rnn.LSTMCell(256)
    encoder_fw_cell = tf.nn.rnn_cell.DropoutWrapper(encoder_fw_cell, output_keep_prob=0.7)
    encoder_bw_cell = tf.contrib.rnn.LSTMCell(256)
    encoder_bw_cell = tf.nn.rnn_cell.DropoutWrapper(encoder_bw_cell, output_keep_prob=0.7)
    #双向LSTM，输出outputs为两个cell的output
    encoder_outputs, encoder_final_state = tf.nn.bidirectional_dynamic_rnn(
        encoder_fw_cell, encoder_bw_cell, encoder_inputs,
        sequence_length=text_length,
        dtype=tf.float32, time_major=True,
    )
    #将两个cell的outputs进行拼接
    encoder_outputs = tf.concat(encoder_outputs,2)
    #全连接层
    fc1 = tf.contrib.layers.linear(encoder_outputs[-1], 256)
    logits_ = tf.contrib.layers.linear(fc1, class_num)
    
    prediction = tf.argmax(logits_, 1)
    cross_entropy = tf.nn.softmax_cross_entropy_with_logits(
        labels=tf.one_hot(text_label, depth=class_num, dtype=tf.float32),
        logits=logits_,
    )
    
    correct_prediction = tf.equal(prediction, tf.argmax(tf.one_hot(text_label, depth=class_num, dtype=tf.float32), 1))
    accuracy = tf.reduce_mean(tf.cast(correct_prediction, tf.float32))
    loss = tf.reduce_mean(cross_entropy)
    train_op = tf.train.AdamOptimizer().minimize(loss)
    
    saver = tf.train.Saver()