Taming Text

下载地址
数据清洗 大数据
It is no secret that the world is drowning in text and data. This causes real problems for everyday users who need to make sense of all the information available, and software engineers who want to make their text-based applications more useful and user-friendly. Whether you’re building a search engine for a corporate website, automatically organizing email, or extracting important nuggets of information from the news, dealing with unstructured text can be a daunting task.

Taming Text is a hands-on, example-driven guide to working with unstructured text in the context of real-world applications. This book explores how to automatically organize text using approaches such as full-text search, proper name recognition, clustering, tagging, information extraction, and summarization. The book guides you through examples illustrating each of these topics, as well as the foundations upon which they are bulit.

世界淹没在文字和数据中已不是秘密。这给需要理解所有可用信息的日常用户和希望使其基于文本的应用程序更加有用和用户友好的软件工程师带来了真正的问题。无论你是为公司网站建立搜索引擎,自动组织电子邮件,还是从新闻中提取重要信息,处理非结构化文本都是一项艰巨的任务。

Taming Text是一个实际操作的示例驱动的指南,用于在实际应用程序的上下文中处理非结构化文本。这本书探索如何使用诸如全文搜索、专有名称识别、聚类、标记、信息提取和摘要等方法自动组织文本。这本书指导你举例说明每一个主题,以及它们的基础。

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值