08.keras 实现电影评价预测

最新推荐文章于 2022-10-19 08:00:39 发布

Thames_h

最新推荐文章于 2022-10-19 08:00:39 发布

阅读量205

点赞数

文章标签：神经网络 python

本文链接：https://blog.csdn.net/Thames_h/article/details/110545942

版权

keras 实现电影评价预测

import tensorflow as tf
from tensorflow import keras
import numpy as np
from matplotlib import pyplot as plt
from keras import layers
from keras import regularizers
%matplotlib inline

data = keras.datasets.imdb 
max_word = 10000   #最大取1W个单词
(x_train, y_train),(x_test,y_test) = data.load_data(num_words=max_word) %加载数据集
word_index = data.get_word_index() %获取索引
index_word=dict((value, key) for key, value in word_index. items()) %将索引和文字调换位置
[index_word.get(index-3,'?') for index in x_train[0]]  #将第一条评论转换为文字

文本向量化，有两种方式，一种是独热编码（这里不太合适）
二是k-hot编码即[0,0,……0] 1w个，对应位置有单词就是置1 即每条评论建立一个1*10000的向量，当有对应的单词出现时，相应位置置1

def k_hot(seqs, dim=10000):
    result = np.zeros((len(seqs),dim))   #建立一个全0二维向量
    for i, seq in enumerate(seqs):      #有对应位置的单词就置为1
        result[i, seq]=1
    return result

x_train = k_hot(x_train)
x_test = k_hot(x_test)

建立神经网络模型

model = keras.Sequential()
model.add(layers.Dense(32, input_dim=10000,activation='relu'))
model.add(layers.Dense(32,activation='relu'))
model.add(layers.Dense(1,activation='sigmoid'))
model.compile(optimizer='adam',
              loss='binary_crossentropy',
             metrics=['acc'])

history = model.fit(x_train, y_train, epochs=15, batch_size=256,validation_data=(x_test,y_test) )

查看结果

plt.plot(history.epoch, history.history.get('loss'),c='r',label = 'loss')
plt.plot(history.epoch, history.history.get('val_loss'),c= 'b',label = 'val_loss')
plt.legend()

在这里插入图片描述
当训练loss一直减小，测试loss 反而增高，所有严重过拟合！思考一下如何调整吧！

Thames_h

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
08.keras 实现电影评价预测

keras 实现电影评价预测import tensorflow as tffrom tensorflow import kerasimport numpy as npfrom matplotlib import pyplot as pltfrom keras import layersfrom keras import regularizers%matplotlib inlinedata = keras.datasets.imdb max_word = 10000 #最大取1W个单词
复制链接

扫一扫