- 博客(1)
- 资源 (7)
- 问答 (4)
- 收藏
- 关注
维基百科中文语料word2vec训练后结果
中文维基百科语料库,将其转换为文本文件后,进行繁体字转换为简体字,字符集转换,分词,然后训练得到模型以及向量。由于文件上传的大小限制是60MB,而训练后的所有文件大小有1G以上,所以这里只提供了下载链接,地址在网盘中。使用python中的gensim包进行训练得到的,运行时间较长,纯粹的维基百科中文语料训练后的结果,拿去可以直接使用。
2017-06-03
Stanford typed dependencies manual
Revised for the Stanford Parser v. 3.7.0 in September 2016
Stanford parser的类型依赖说明
2017-02-27
Natural Language Processing with Python
This book offers a highly accessible introduction to Natural Language Processing, the field that underpins a variety of language technologies, ranging from predictive text and email filtering to automatic summarization and translation. With Natural Language Processing with Python, you'll learn how to write Python programs to work with large collections of unstructured text. You'll access richly-annotated datasets using a comprehensive range of linguistic data structures. And you'll understand the main algorithms for analyzing the content and structure of written communication., Packed with examples and exercises, Natural Language Processing with Python will help you:, * Extract information from unstructured text, to guess the topic or identify 'named entities', * Analyze linguistic structure in text, including parsing and semantic analysis, * Access popular linguistic databases, including WordNet and treebanks, * Integrate techniques drawn from fields as diverse as linguistics and artificial intelligence, Perfect for individual study, or as a classroom and workshop textbook, this book will help you gain practical skills in Natural Language Processing using the Python programming language and the Natural Language Toolkit (NLTK) open source library., If you're interested in developing Web applications, analyzing multilingual news sources, documenting endangered languages, or if you are simply curious to have a programmer's perspective on how human language works, you will find Natural Language Processing with Python both fascinating and immensely useful.
2017-02-26
wiki.zh.text.model
中文维基百科语料库,将其转换为文本文件后,进行繁体字转换为简体字,字符集转换,分词,然后训练得到模型以及向量。由于文件上传的大小限制是60MB,我这里的压缩包中有model,然后对向量提供了下载链接。使用python中的gensim包进行训练得到的,运行时间较长,希望对你们有帮助。
2017-02-23
APP技术解决方案,安卓高手与IOS高手看过来
2016-03-24
web service访问数量控制
2015-07-20
在java web工程中·,利用ireport生成的jasper文件,导出pdf文件
2014-01-14
http://www.cheersmug.com/网站上文字的特效是怎样制作的?
2013-10-23
TA创建的收藏夹 TA关注的收藏夹
TA关注的人