Wikitext-103-数据集

最新推荐文章于 2024-10-28 11:22:33 发布

不务正业的猿

最新推荐文章于 2024-10-28 11:22:33 发布

阅读量3.7k

点赞数 1

分类专栏：下载数据集文章标签：数据集 WikiText 下载语料库

本文链接：https://blog.csdn.net/ispeasant/article/details/108140333

版权

下载同时被 2 个专栏收录

198 篇文章 ¥29.90 ¥99.00

订阅专栏

数据集

169 篇文章

订阅专栏

本数据集是超过 1 亿个语句的数据合集，全部从维基百科的 Good 与 Featured 文章中提炼出来。广泛用于语言建模，当中包括 fastai 库和 ULMFiT 算法中经常用到的预训练模型。

Recent neural network sequence models with softmax classifiers have achieved their best language modeling performance only with very large hidden states and large vocabularies. Even then they struggle to predict rare or unseen words even if the context makes the prediction unambiguous. We introduce the pointer sentinel mixture architecture for neural sequence models which has the ability to either reproduce a word from the recent context or produce a word from a standard softmax classifier. Our pointer sentinel-LSTM model achieves state of the art language modeling performance on the Penn Treebank (70.9 perplexity) while using far fewer parameters than a standard sof

了解本专栏