51.选择管道组件之任务简单化 翻译自 吴恩达新书-Machine Learning Yearning

更多好玩的机器学习教程: 进入袋马学院领一份 ,全部免费?.

除了数据可用性以外,考虑流程中组件时,你还需要考虑一个因素:单个组件解决问题的简单程度。你应该试着选择那些容易构建和学习的组件。那什么叫做组件易于学习呢?
曝光过度的猫的图片
看看下面几个机器学习任务,按难度递增的顺序排列出来:

  1. 判断图片是否曝光过度
  2. 判断图片是在室内拍摄的还是在室外拍摄的
  3. 判断图片中是否含有猫
  4. 判断图片中的猫是否有黑色和白色的毛
  5. 判断图像是否包含暹罗猫(一种特殊的猫)

上面这些都是图形二分类任务:输入一幅图像,输出0或1。但是列表中前几个,对于神经网络来说相对容易学习。你可以通过很少的样本集就可以完成到前几个任务。机器学习中,还没有一个很好的正式定义,什么样的任务难,什么样的任务简单。随着机器学习和神经网络的兴起,我们说,如果训练的步数很少(或神经网络很浅),任务就可以完成,那么我们称任务简单。如果训练需要更多的步数(或更深的网络),则我们说任务很难。但是这些都是非正式的定义。

如果你能将一个复杂的任务,拆分成几个简单的子任务,然依据简单的子任务进行编码,通过给你的算法提供先验知识,可以帮助你的算法更有效的学习这个任务。
两只猫的图片
假如你正在构建一个暹罗猫检测器,下面是一个端到端的结构:
端到端结构
你也可以使用两步来完成这个任务:
非端到端流程
第一步先检测图片中的所有猫。
图片中的猫
第二步将每只猫裁剪出来,然后通过特殊猫咪检测器,检测每一只猫,如果有暹罗猫,则输出1.
逐只猫检测

相对于纯端到端系统,只是用0/1标签,使用两个组件:猫咪检测器和特殊猫咪分类器,似乎更容易学习且需要的数据量更少。

最后一个例子,让我们回到自动驾驶流程中:
自动驾驶流程

通过使用这个流程,你告诉算法执行三个关键步骤来实现自动驾驶:

  1. 检测其他汽车
  2. 检测路上行人
  3. 行车路线规划

这三个步骤中,每一步都相对简单,相对纯端到端系统,所需的数据量也更少。

总的来说,当你决定流程采用哪些组件时,尽量选择相对简单的组件,这样可以使用更少的数据,就可以学习了。

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
Table of Contents (draft) Why Machine Learning Strategy 4 ........................................................................................... How to use this book to help your team 6 ................................................................................ Prerequisites and Notation 7 .................................................................................................... Scale drives machine learning progress 8 ................................................................................ Your development and test sets 11 ............................................................................................ Your dev and test sets should come from the same distribution 13 ........................................ How large do the dev/test sets need to be? 15 .......................................................................... Establish a single-number evaluation metric for your team to optimize 16 ........................... Optimizing and satisficing metrics 18 ..................................................................................... Having a dev set and metric speeds up iterations 20 ............................................................... When to change dev/test sets and metrics 21 .......................................................................... Takeaways: Setting up development and test sets 23 .............................................................. Build your first system quickly, then iterate 25 ........................................................................ Error analysis: Look at dev set examples to evaluate ideas 26 ................................................ Evaluate multiple ideas in parallel during error analysis 28 ................................................... If you have a large dev set, split it into two subsets, only one of which you look at 30 ........... How big should the Eyeball and Blackbox dev sets be? 32 ...................................................... Takeaways: Basic error analysis 34 .......................................................................................... Bias and Variance: The two big sources of error 36 ................................................................. Examples of Bias and Variance 38 ............................................................................................ Comparing to the optimal error rate 39 ................................................................................... Addressing Bias and Variance 41 .............................................................................................. Bias vs. Variance tradeoff 42 ..................................................................................................... Techniques for reducing avoidable bias 43 .............................................................................. Techniques for reducing Variance 44 ....................................................................................... Error analysis on the training set 46 ........................................................................................ Diagnosing bias and variance: Learning curves 48 ................................................................. Plotting training error 50 .......................................................................................................... Interpreting learning curves: High bias 51 ............................................................................... Interpreting learning curves: Other cases 53 .......................................................................... Plotting learning curves 55 ....................................................................................................... Why we compare to human-level performance 58 .................................................................. How to define human-level performance 60 ........................................................................... Surpassing human-level performance 61 ................................................................................ Why train and test on different distributions 63 ...................................................................... Page!2 Machine Learning Yearning-Draft V0.5 Andrew NgWhether to use all your data 65 ................................................................................................ Whether to include inconsistent data 67 .................................................................................. Weighting data 68 .................................................................................................................... Generalizing from the training set to the dev set 69 ................................................................ Addressing Bias and Variance 71 ............................................................................................. Addressing data mismatch 72 ................................................................................................... Artificial data synthesis 73 ........................................................................................................ The Optimization Verification test 76 ...................................................................................... General form of Optimization Verification test 78 ................................................................... Reinforcement learning example 79 ......................................................................................... The rise of end-to-end learning 82 ........................................................................................... More end-to-end learning examples 84 .................................................................................. Pros and cons of end-to-end learning 86 ................................................................................ Learned sub-components 88 .................................................................................................... Directly learning rich outputs 89 .............................................................................................. Error Analysis by Parts 93 ....................................................................................................... Beyond supervised learning: What’s next? 94 ......................................................................... Building a superhero team - Get your teammates to read this 96 ........................................... Big picture 98 ............................................................................................................................ Credits 99
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值