文本摘要 CNN/DailyMail 原始数据集
文本摘要 CNN/DailyMail 原始数据集。
压缩包内含 cnn_stories.tgz 和 dailymail_stories.tgz 。
可用于抽取式摘要(Extractive Summarization)任务以及生成式摘要(Abstractive Summarization)旨在方便国内的研究者们获取该数据集。
技术细节可参考博文:https://blog.csdn.net/muyao987/article/details/104949367
[PDF]Neural Network Methods in Natural Language Processing 基于深度学习的自然语言处理英文原版
Neural networks are a family of powerful machine learning models. This book focuses on the application of neural network models to natural language data. The first half of the book (Parts I and II) covers the basics of supervised machine learning and feed-forward neural networks, the basics of working with machine learning over language data, and the use of vector-based rather than symbolic representations for words. It also covers the computation-graph abstraction, which allows to easily define and train arbitrary neural networks, and is the basis behind the design of contemporary neural network software libraries.
The second part of the book (Parts III and IV) introduces more specialized neural network architectures, including 1D convolutional neural networks, recurrent neural networks, conditioned-generation models, and attention-based models. These architectures and techniques are the driving force behind state-of-the-art algorithms for machine translation, syntactic parsing, and many other applications. Finally, we also discuss tree-shaped networks, structured prediction, and the prospects of multi-task learning.
希拉里 克林顿 邮件 自然语言处理 Hillary Clinton's Emails
希拉里克林顿的电子邮件,整理了近7,000页克林顿的电子邮件,用作机器学习自然语言处理的语料。
MFC类库详解.chm
MFC类库详解,以前做飞机大战项目时经常用。挺好的,对VS下的MFC编程有一定好处。