【笔记】三张图读懂机器学习:基本概念、五大流派与九种常见算法

【笔记】三张图读懂机器学习:基本概念、五大流派与九种常见算法

原文链接(附有完整翻译):https://zhuanlan.zhihu.com/p/26512893

机器学习的常见应用:语音识别、图像检测、机器翻译、风格迁移等技术。

Chapter 1: A look at Machine learning

1.What is it?

Machines can “learn” by analyzing large amounts of data.

2.How does machine learning relate to artificial intelligence?

Machine learning is a category of research and algorithms focused on finding patterns in data and using those patterns to make predictions. Machine learning falls within the artificial intelligence (AI) umbrella, which in turn intersects with the broader field of knowledge discovery and data mining.


intersect: 贯穿;横贯

3.How machine learning works?

  1. Select data

    Split the data you have into three groups: training data, validation data, and test data.

  2. Model data

    Use the training data to build the model using the relevant features.

  3. Validate model

    Assess the model with your validation data.

  4. Test model

    Check performance of the validated model with your test data.

  5. Use the model

    Deploy the fully trained model to make predictions on new data.

  6. Tune model(调优模型)

    Improve performance of the algorithm with more data, different features, or adjusted parameters.


validate: 确证;验证

deploy: 部署;利用

tune: 曲调;调整

4.How machine learning fits in?

  1. Traditional programming

    The software engineer writes a program that solves a problem.

    Data => Software engineer writes a procedure that tells the machine what to do to solve the problem. => Computer follows the procedure and generates a result.

  2. Statistics

    An analyst compares the relationships of variables.

  3. Machine learning

    A data scientist uses a training data set to teach the computer what to do, and the system carries out the tasks.

    Big data => The machine learns to classify with the help of a training data set and tunes a specific alorithm to the desired classification. => The computer learns to identify relationships, trends, and patterns in the data.

  4. Intelligent apps

    Intelligent apps leverage the outputs of AI, as in this precision farming example that uses drone-based data collection.


carry out: 执行;履行;进行

leverage: use (something) to maximum advantage. 最大限度地利用,最优化使用

drone: 无人机

5.Machine learning in practice

For example:

  1. Rapid 3D mapping and modeling
  2. Enhanced profiling to mitigate risks
  3. Predicting the top performers

profiling: (对个人心理、行为特征的)剖析研究(以评定或预测其在某领域潜力或认识某一种人)

mitigate: 减轻;使缓和

Chapter 2: A look at Machine learning evolution

For decades, individual “tribes” of artificial intelligence researchers have vied with one another for dominance. Is the time ripe now for tribes to collaborate? They may forced to, as collaboration and algorithm blending are the only ways to reach true artificial general intelligence (AGI). Here’s a look back at how machine learning methods have evolved and what the future may look like.


tribe: 部落;流派

vie: 竞争;相争

ripe: 成熟的;适合……的

blend: 使混合;使交融

What are the five tribes?

  • Symbolists

    • Use symbols, rules, and logic to represent knowledge and draw logical inference
    • Favored algorithm: Rules and decision trees, inverse deduction
  • Bayesians

    • Assess the likelihood of occurrence for probabilisitic inference
    • Favored algorithm: Naive Bayes (朴素贝叶斯) or Markov (马尔可夫)
  • Connectionists

    • Recognize and generalize patterns dynamically with matrices of probabilistic, weighted neurons.
    • Favored algorithm: Neural networks, backpropagation
  • Evolutionaries

    • Generate variations and then assess the fitness of each for a given purpose
    • Favored algorithm: Genetic programs (遗传算法)
  • Analogizers

    • Optimize a functionin light of constraints (“going as high as you can while staying on the road”)
    • Favored algorithm: Support vectors

inference: 推理

likelihood: 可能性

occurrence: 发生的事;事件;发生频率;存在

probabilisitic: 盖然性的;可能性的; 概率的

matrices: matrix的复数

neuron: 神经元

generalize: 归纳

variation: 变化

(补充)

Pedro Domingos总结了五大流派目前存在的问题和解决方案,但他也重点强调,我们真正需要的是可以一次性解决这些所有问题的统一算法。

⭐️各学派的综合

表示

  • 概率逻辑(例如马尔可夫逻辑模型)
  • 带权公式,状态分布

评估

  • 后验概率
  • 用户定义的目标函数

最优化

  • 公式发现:基因编程
  • 权值学习:反向传播

Chapter 3: A look at Machine learning methods

Which machine learning algorithm should you use? A lot depends on the characteristics and the amount of the available data, as well as your training goals, in each particular use case. Avoid using the most complicated algorithms unless the end justifies more expensive means and resources. Here are some of the more common algorithms ranked by ease of use.

1.Decision trees

Decision tree analysis typically uses a hierarchy of variables or decision nodes that, when answered step by step, can classify a given customer as creditworthy or not, for example.

  • Advantages

    Decision trees are useful when evaluating lists of distinct features, qualities, or characteristics of people, places, or things.

  • Use cases

    Rule-based credit risk assessment, horse race performance prediction


distinct: 可辩别的;有区别的;不同的;明显的;清楚无误的;明确的

2.Support vector machines

Support vector machines classify groups of data with the help of hyperplanes

  • Advantages

    Support vector machines are good for the binary classification of X versus other variables and are useful whether or not the relationship between variables is linear.

  • Use cases

    News categorization, handwriting recognition


hyperplane: 超平面

3.Regression

Regression maps the behavior of a dependent variable relative to one or more dependent variables. In this example, logistic regression separates spam from non-spam text.

  • Advantages

    Regression is useful for identifying continuous (not necessarily distinct) relationships between variables.

  • Use cases

    Traffic flow analysis, email filtering


map: v.勾画;绘制

dependent variable: 因变量

spam: 垃圾邮件

4.Naive Bayes classification

Naive Bayes classifiers compute probabilities, given tree branches of possible conditions. Each individual feature is “naive” or conditionally independent of, and therefore does not influence, the others. For example, what’s the probability you would draw two yellow marbles in a row, given a jar of five yellow and red marbles total? The probability, following the topmost branch of two yellow in a row, is one in ten. Naive Bayes classifiers compute the combined, conditional probabilities of multiple attributes.

  • Advantages

    Naive Bayes methods allow the quick classification of relevant items in small data sets that have distinct features.

  • Use cases

    Sentiment analysis, consumer segmentation


classification: 分类器

marble: 弹子游戏

in a row: 连续地

segmentation: 分割;划分

5.Hidden Markow models

Observable Markov processes are purely deterministic–one given state always follows another given state. Traffic light patterns are an example.

Hidden Markov models, by contrast, compute the probability of hidden states occurring by analyzing observable data, and then estimating the likely pattern of future observation with the help of the hidden state analysis. In this example, the probability of high or low pressure (the hidden state) is used to predict the likelihood of sunny, rainy, or cloudy weather.

  • Advantages

    Tolerates data variability and effective for recognition and prediction.

  • Use cases

    Facial expression analysis, weather prediction


observable: 显著的;显式的;可观察的

deterministic: 确定性

6.Random forest

Random forest algorithms improve the accuracy of decision trees by using multiple trees with randomly selected subsets of data. This example reviews the expression levels of various genes associated with breast cancer relapse and computes a relapse risk.

  • Advantages

    Random forest methods prove useful with large data sets and items that have numerous and sometimes irrelevant features.

  • Use cases

    Customer churn analysis, risk assessment


subset: 子集

relapse: 重新恶化;复发

numerous: 许多的

churn: 搅;翻腾;流失

7.Recurrent neural networks

Each neuron in any neural network converts many inputs into single outputs via one or more hidden layers. Recurrent neural networks [RNNs] additionally pass values from step to step, making step-by-step learning possible. In other words, RNNs have a form of memory, allowing previous outputs to affect subsequent inputs.

  • Advantages

    Recurrent neural networks have predictive power when used with large amounts of sequenced information.

  • Use cases

    Image classification and captioning, political sentiment analysis


caption: 给(插图)加标题(或说明)

8.Long short-term memory & gated recurrent unit neural networks (门控循环单元神经网络)

Older forms of RNNs can be lossy. While these older recurrent neural networks only allow small amounts of older information to persist, newer long short-term memory (LSTM) and gated recurrent unit (GRU) neural networks have both long- and short-term memory. In other words, these newer RNNs have greater memory control, allowing previous values to persist or to be reset as necessary for many sequences of steps, avoiding “gradient decay” or eventual degradation of the values passed from step to step. LSTM and GRU networks make this memory control possible with memory blocks and structures called gates that pass or reset values as appropriate.

  • Advantages

    Long short-term memory and gated recurrent unit neural networks have the same advantages as other recurrent neural networks and are more frequently used than other recurrent neural networks because of their greater memory capablilities.

  • Use cases

    Natural language processing, translation


lossy: (压缩)有损的

persist: 坚持;持续;延续

gradient: (数学)梯度;斜率

decay: 衰减

degradation: 降级;退化

9.Convolutional neural networks (卷积神经网络)

Convolutions are blends of weights from a subsequent layer that are used to label the output layer.

  • Advantages

    Convolutional neural networks are most useful with very large data sets, large numbers of features, and complex classification tasks.

  • Use cases

    Image recognition, text to speech, drug discovery

相关阅读

  1. 一文读懂机器学习、数据科学、人工智能、深度学习和统计学之间的区别

    https://mp.weixin.qq.com/s?__biz=MzA3MzI4MjgzMw==&mid=2650723678&idx=1&sn=5cb049e37427dd2b2a4e30e42bcc2fff&chksm=871b1120b06c983651bf92526cd8554225304200364d39cd18592fd8a6848d1f84cf80aeea22&scene=21#wechat_redirect

  2. 深度 | 深度学习与神经网络全局概览:核心技术的发展历程

    https://mp.weixin.qq.com/s?__biz=MzA3MzI4MjgzMw==&mid=2650717969&idx=1&sn=712e4880e63db42bcb4db5ba06c9856d&scene=21#wechat_redirect

  3. 机器学习算法集锦:从贝叶斯到深度学习及各自优缺点

    https://mp.weixin.qq.com/s?__biz=MzA3MzI4MjgzMw==&mid=2650723438&idx=1&sn=a778051186c0e1fb3cdb4076868fd54a&chksm=871b1010b06c99063ec5599dcecbed5ce3065e7c2f0ab1cc11a8251f2472838302f89cf51d52&scene=21#wechat_redirect

  4. 经验之谈:如何为你的机器学习问题选择合适的算法?

    https://mp.weixin.qq.com/s?__biz=MzA3MzI4MjgzMw==&mid=2650723704&idx=1&sn=5e791710b46502661e25ff6f7528003b&chksm=871b1106b06c98107174c81401c1f7017b35939ab20bc83b305ecae8b503690518fd32d75bbd&scene=21#wechat_redirect

  5. 教程 | 从头开始:用Python实现决策树算法

    https://mp.weixin.qq.com/s?__biz=MzA3MzI4MjgzMw==&mid=2650723438&idx=4&sn=cf3902a9933afe08ac3c38452044cddd&chksm=871b1010b06c99062809133f3ad6279bccd64768a761a2aa6495367048069bc13788929b276a&scene=21#wechat_redirect

  6. 想了解概率图模型?你要先理解图论的基本定义与形式

    https://mp.weixin.qq.com/s?__biz=MzA3MzI4MjgzMw==&mid=2650725041&idx=1&sn=0c57ba70e2613e6af80c4ab61c996d44&chksm=871b1ecfb06c97d9547e50705d3e74a2b8c41254f0efc2dd88d2e89eec3bfac5da089f28c398&scene=21#wechat_redirect

  7. 干货 | 详解支持向量机(附学习资源)

    https://mp.weixin.qq.com/s?__biz=MzA3MzI4MjgzMw==&mid=2650722941&idx=2&sn=328ba8aa2657217c1d90304018ba3bc6&chksm=871b1603b06c9f155faf0f1e6d6a62f9d014bcaa85f57abc9f0f9ff0ab0ac608b1749f12c170&scene=21#wechat_redirect

  8. 教程 | 从头开始:用Python实现随机森林算法

    https://mp.weixin.qq.com/s?__biz=MzA3MzI4MjgzMw==&mid=2650722795&idx=3&sn=53a95b9ecc269dc0ec8926253b2d5412&chksm=871b1595b06c9c83d205dcab550d14f2709bbae290219b2ce0ad913a0677dc617f138d1ebb6f&scene=21#wechat_redirect

  9. 深度 | LSTM 和递归网络基础教程

    https://mp.weixin.qq.com/s?__biz=MzA3MzI4MjgzMw==&mid=2650717709&idx=2&sn=2bff1e56bc75d65e178476ea9a93b2c5&scene=21#wechat_redirect

  10. 干货 | 图解LSTM神经网络架构及其11种变体(附论文)

    https://mp.weixin.qq.com/s?__biz=MzA3MzI4MjgzMw==&mid=2650719562&idx=1&sn=ad6693cdeaa18034ed1c53271f642ef7&chksm=871b0134b06c8822bf89781a81081c161eb82b06d0c20b655bd7b991202d363b6c233ef137ff&scene=21#wechat_redirect

  11. 专栏 | 卷积神经网络简介

    https://mp.weixin.qq.com/s?__biz=MzA3MzI4MjgzMw==&mid=2650723520&idx=4&sn=8ee14dd052766ca3e0afa60dcbb65b2d&chksm=871b10beb06c99a81ef547319637a177142d33a40da5a85024fc6a3b623d60d3a7ac22e3efc3&scene=21#wechat_redirect

  12. 深度 | 从入门到精通:卷积神经网络初学者指南(附论文)

    https://mp.weixin.qq.com/s?__biz=MzA3MzI4MjgzMw==&mid=2650717691&idx=2&sn=3f0b66aa9706aae1a30b01309aa0214c&scene=21#wechat_redirect

  13. 重磅论文 | 解析深度卷积神经网络的14种设计模式(附下载)

    https://mp.weixin.qq.com/s?__biz=MzA3MzI4MjgzMw==&mid=2650720302&idx=1&sn=c88634da158f36db23b9dc7d0dc550ad&chksm=871b0c50b06c854694984e193f289deb51a5efe71f53223dc37feb70509fd957c8af5bb61ab3&scene=21#wechat_redirect

  • 0
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值