《Hands-On Machine Learning》学习笔记-1.2 为什么要使用机器学习

第一章 机器学习概述

为什么要使用机器学习

思考一下,如果使用传统的软件编程技术,该如何编写一个垃圾邮件过滤器?:

  1. 首先你需要观察典型的垃圾邮件都有哪些特征。也许你会注意到,一些特定的单词或者短语在标题中出现的较多。或者你还会在发件人,邮件内容中发现的一些特定的模式。

  2. 你要写一个检测算法,对你所发现的所有特征一一检验,如果在邮件中检验到了一定数量的特性,则会把它标记为垃圾邮件。

  3. 你还需要测试你的代码,重复步骤1和步骤2,直到它表现的足够好。

传统的解决方案图1-1 传统的解决方案

尽管问题并不复杂,你的程序还是会变成一长串复杂的规则,极难维护。

与之对应的,一个基于机器学习的垃圾邮件分类器程序能够通过探测垃圾邮件相比较于正常邮件频繁出现的特定特征,从而能够自动学习哪些单词和短语往往预示着垃圾邮件。这样的程序要简短,而且易于维护,而且一般说来会更加准确。

  • 0
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
Hands-On Data Science and Python Machine Learning by Frank Kane English | 31 July 2017 | ISBN: 1787280748 | ASIN: B072QBVXGH | 420 Pages | AZW3 | 7.21 MB Key Features Take your first steps in the world of data science by understanding the tools and techniques of data analysis Train efficient Machine Learning models in Python using the supervised and unsupervised learning methods Learn how to use Apache Spark for processing Big Data efficiently Book Description Join Frank Kane, who worked on Amazon and IMDb's machine learning algorithms, as he guides you on your first steps into the world of data science. Hands-On Data Science and Python Machine Learning gives you the tools that you need to understand and explore the core topics in the field, and the confidence and practice to build and analyze your own machine learning models. With the help of interesting and easy-to-follow practical examples, Frank Kane explains potentially complex topics such as Bayesian methods and K-means clustering in a way that anybody can understand them. Based on Frank's successful data science course, Hands-On Data Science and Python Machine Learning empowers you to conduct data analysis and perform efficient machine learning using Python. Let Frank help you unearth the value in your data using the various data mining and data analysis techniques available in Python, and to develop efficient predictive models to predict future results. You will also learn how to perform large-scale machine learning on Big Data using Apache Spark. The book covers preparing your data for analysis, training machine learning models, and visualizing the final data analysis. What you will learn Learn how to clean your data and ready it for analysis Implement the popular clustering and regression methods in Python Train efficient machine learning models using decision trees and random forests Visualize the results of your analysis using Python's Matplotlib library Use Apache Spark's MLlib package to perform

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值