- 博客(1)
- 资源 (4)
- 收藏
- 关注
原创 基于LDA模型的邮件主题分类
资源地址:希拉里邮件7000封左右,Emails.csv 运行环境:windows10(64bit) + python3.6 + pycharm Python源代码: import warnings warnings.filterwarnings(action='ignore', category=UserWarning, module='gensim') import pandas as...
2018-07-19 11:54:03 2391
文本摘要 CNN/DailyMail 原始数据集
文本摘要 CNN/DailyMail 原始数据集。
压缩包内含 cnn_stories.tgz 和 dailymail_stories.tgz 。
可用于抽取式摘要(Extractive Summarization)任务以及生成式摘要(Abstractive Summarization)旨在方便国内的研究者们获取该数据集。
技术细节可参考博文:https://blog.csdn.net/muyao987/article/details/104949367
2022-04-15
[PDF]Neural Network Methods in Natural Language Processing 基于深度学习的自然语言处理英文原版
Neural networks are a family of powerful machine learning models. This book focuses on the application of neural network models to natural language data. The first half of the book (Parts I and II) covers the basics of supervised machine learning and feed-forward neural networks, the basics of working with machine learning over language data, and the use of vector-based rather than symbolic representations for words. It also covers the computation-graph abstraction, which allows to easily define and train arbitrary neural networks, and is the basis behind the design of contemporary neural network software libraries.
The second part of the book (Parts III and IV) introduces more specialized neural network architectures, including 1D convolutional neural networks, recurrent neural networks, conditioned-generation models, and attention-based models. These architectures and techniques are the driving force behind state-of-the-art algorithms for machine translation, syntactic parsing, and many other applications. Finally, we also discuss tree-shaped networks, structured prediction, and the prospects of multi-task learning.
2018-11-23
希拉里 克林顿 邮件 自然语言处理 Hillary Clinton's Emails
希拉里克林顿的电子邮件,整理了近7,000页克林顿的电子邮件,用作机器学习自然语言处理的语料。
2018-07-19
空空如也
TA创建的收藏夹 TA关注的收藏夹
TA关注的人