分享一本文本挖掘的书

好不容易从国外找到的,有需要的可以下来看看。

The information age has made it easy to store large amounts of data. The proliferation
of documents available on the Web, on corporate intranets, on news wires, and
elsewhere is overwhelming. However, although the amount of data available to us
is constantly increasing, our ability to absorb and process this information remains
constant. Search engines only exacerbate the problem by making more and more
documents available in a matter of a few key strokes.
Text mining is a new and exciting research area that tries to solve the information
overload problem by using techniques from data mining, machine learning, natural
language processing (NLP), information retrieval (IR), and knowledge management.
Text mining involves the preprocessing of document collections (text categorization,
information extraction, term extraction), the storage of the intermediate representations,
the techniques to analyze these intermediate representations (such as distribution
analysis, clustering, trend analysis, and association rules), and visualization of
the results.
This book presents a general theory of text mining along with the main techniques
behind it.We offer a generalized architecture for text mining and outline the
algorithms and data structures typically used by text mining systems.
The book is aimed at the advanced undergraduate students, graduate students,
academic researchers, and professional practitioners interested in complete coverage
of the text mining field. We have included all the topics critical to people
who plan to develop text mining systems or to use them. In particular, we have
covered preprocessing techniques such as text categorization, text clustering, and
information extraction and analysis techniques such as association rules and link
analysis.
The book tries to blend together theory and practice; we have attempted to
provide many real-life scenarios that show how the different techniques are used in
practice.When writing the book we tried to make it as self-contained as possible and
have compiled a comprehensive bibliography for each topic so that the reader can
expand his or her knowledge accordingly.
x

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值