官方文本解析以及调试
官方RNN链接:https://www.tensorflow.org/tutorials/text/text_generation
数据集来自莎士比亚作品集,资源链接:https://storage.googleapis.com/download.tensorflow.org/data/shakespeare.txt
基本配置环境:
Python3,Tensorflow, Keras。
库文件:
from __future__ import absolute_import, division, print_function, unicode_literals
import tensorflow as tf
import numpy as np
import os
import time
下载数据集并读取:
path_to_file = tf.keras.utils.get_file('shakespeare.txt', 'https://storage.googleapis.com/download.tensorflow.org/data/shakespeare.txt')
# Read, then decode for py2 compat.
text = open(path_to_file, 'rb').read().decode(encoding='utf-8')
# length of text is the number of characters in it
print ('Length of text: {} characters'.format(len(text)))
# Take a look at the first 250 characters in text
print(text[:250])
# The unique characters in the file
vocab = sorted(set(text