Tensorflow2.0基础-笔记-文本序列-全连接神经网络

最新推荐文章于 2023-08-15 10:34:05 发布

二流子学程序

最新推荐文章于 2023-08-15 10:34:05 发布

阅读量217

点赞数

分类专栏： tensorflow2.0 文章标签： tensorflow nlp

本文链接：https://blog.csdn.net/qq_39329902/article/details/119791132

版权

tensorflow2.0 专栏收录该内容

13 篇文章 3 订阅

订阅专栏

import tensorflow as tf
from tensorflow import keras
from tensorflow.keras import layers
import matplotlib.pyplot as plt
%matplotlib inline

data=keras.datasets.imdb #导入电影评分数据集

max_word=10000
(x_train,y_train),(x_test,y_test)=data.load_data(num_words=max_word)#考虑编码中前10000个单词
x_train.shape
x_train[0] #是一个整数的序列,一个整数代表一个单词.
[len(x)for x in x_train] #发现文本长度不相等
x_train=keras.preprocessing.sequence.pad_sequences(x_train,300) #将长度填充到300
x_test=keras.preprocessing.sequence.pad_sequences(x_test,300)
[len(x)for x in x_train] 

#建立模型
model=keras.models.Sequential()
model.add(layers.Embedding(input_dim=10000,output_dim=50,input_length=300)) #嵌入层将正整数（下标）转换为具有固定大小的向量
model.add(layers.Flatten())
model.add(layers.Dense(128,activation='relu'))
model.add(layers.Dense(64,activation='relu'))
model.add(layers.Dense(1,activation='sigmoid')) #标签为正面评价和负面评价
model.summary()

model.compile(optimizer=keras.optimizers.Adam(lr=0.001),
             loss='binary_crossentropy',
             metrics=['acc'])

history=model.fit(x_train,y_train,epochs=15,batch_size=256,validation_data=(x_test,y_test))

plt.plot( history.epoch, history.history.get('acc'),label='acc')
plt.plot( history.epoch, history.history.get('val_acc'),label='val_acc')
plt.legend()

plt.plot( history.epoch, history.history.get('loss'),label='loss')
plt.plot( history.epoch, history.history.get('val_loss'),label='val_loss')
plt.legend()

注意Embedding层只能用作模型中的第一层。

Keras中的Embedding层本质上是一个对输入数据降维过程，其中Embedding函数有三个容易混淆的参数，分别是：