维基百科中文语料word2vec训练后结果
中文维基百科语料库,将其转换为文本文件后,进行繁体字转换为简体字,字符集转换,分词,然后训练得到模型以及向量。由于文件上传的大小限制是60MB,而训练后的所有文件大小有1G以上,所以这里只提供了下载链接,地址在网盘中。使用python中的gensim包进行训练得到的,运行时间较长,纯粹的维基百科中文语料训练后的结果,拿去可以直接使用。
Stanford typed dependencies manual
Revised for the Stanford Parser v. 3.7.0 in September 2016
Stanford parser的类型依赖说明
Natural Language Processing with Python
This book offers a highly accessible introduction to Natural Language Processing, the field that underpins a variety of language technologies, ranging from predictive text and email filtering to automatic summarization and translation. With Natural Language Processing with Python, you'll learn how to write Python programs to work with large collections of unstructured text. You'll access richly-annotated datasets using a comprehensive range of linguistic data structures. And you'll understand the main algorithms for analyzing the content and structure of written communication., Packed with examples and exercises, Natural Language Processing with Python will help you:, * Extract information from unstructured text, to guess the topic or identify 'named entities', * Analyze linguistic structure in text, including parsing and semantic analysis, * Access popular linguistic databases, including WordNet and treebanks, * Integrate techniques drawn from fields as diverse as linguistics and artificial intelligence, Perfect for individual study, or as a classroom and workshop textbook, this book will help you gain practical skills in Natural Language Processing using the Python programming language and the Natural Language Toolkit (NLTK) open source library., If you're interested in developing Web applications, analyzing multilingual news sources, documenting endangered languages, or if you are simply curious to have a programmer's perspective on how human language works, you will find Natural Language Processing with Python both fascinating and immensely useful.
python自然语言处理
python自然语言处理,中文文字版pdf,此书仅供学习参考使用,下载后请尽快删除,为支持正版请购买原版书籍
java自然语言处理英文
使用java进行自然语言处理,电子书。pdf文字版,不是扫描版。
wiki.zh.text.model
中文维基百科语料库,将其转换为文本文件后,进行繁体字转换为简体字,字符集转换,分词,然后训练得到模型以及向量。由于文件上传的大小限制是60MB,我这里的压缩包中有model,然后对向量提供了下载链接。使用python中的gensim包进行训练得到的,运行时间较长,希望对你们有帮助。
phantomjs-2.1.1-linux-x86_64.tar.bz2
ubuntu(或者linux)平台上安装phantomjs。这是从官网上下载的。