机器学习中倒三角符号_机器学习的三角误差

最新推荐文章于 2021-07-29 20:46:43 发布

weixin_26752765

最新推荐文章于 2021-07-29 20:46:43 发布

阅读量1.3k

点赞数

文章标签：机器学习 python 人工智能深度学习

原文链接：https://medium.com/people-ai-research/machine-learnings-triangle-of-error-2c05267cb2bd

版权

本文探讨了机器学习中倒三角符号所代表的三角误差概念，该概念在理解和评估模型性能时起着关键作用。通过深入浅出的解释，帮助读者理解这一核心概念。

摘要由CSDN通过智能技术生成

机器学习中倒三角符号

By David Weinberger

大卫·温伯格(David Weinberger)

AI Outside In is a column by PAIR’s writer-in-residence, David Weinberger, who offers his outsider perspective on key ideas in machine learning. His opinions are his own and do not necessarily reflect those of Google.

AI Outside In 是PAIR的常驻作者David Weinberger的专栏文章，他提供了有关机器学习关键思想的局外人观点。 他的观点是他自己的，不一定反映Google的观点。

机器学习的超能力 (Machine learning’s superpower)

When we humans argue over what’s fair, sometimes it’s about principles, sometimes about consequences, and sometimes about trade-offs. But machine learning systems can bring us to think about fairness — and many other things — in terms of three interrelated factors: two ways the machine learning (ML) can go wrong, and the most basic way of adjusting the balance between these potential errors. The types of error you’ll prefer to live with depends entirely on the sort of fairness — defined mathematically — you’re aiming your ML system at. But one way or another, you have to decide.

当我们人类争论什么是公平的时候，有时是关于原则，有时是后果，有时是权衡。但是，机器学习系统可以使我们从三个相互关联的因素来考虑公平性以及许多其他方面：机器学习(ML)出错的两种方式，以及调节这些潜在错误之间的平衡的最基本的方式。您更愿意忍受的错误类型完全取决于以ML系统为目标的公平性(以数学方式定义)。但是，您必须决定一种方式。

At their heart, many ML systems are classifiers. They ask: Should this photo go into the bucket of beach photos or not? Should this dark spot on a medical scan be classified as a fibrous growth or something else? Should this book go on the “Recommended for You” or “You’re Gonna Hate It” list? ML’s superpower is that it lets computers make these sorts of “decisions” based on what they’ve inferred from looking at thousands or even millions of examples that have already been reliably classified. From these examples they notice patterns that indicate which categories new inputs should be put into.

本质上，许多机器学习系统都是分类器。他们问：这张照片是否应该放在沙滩照片的桶中？是否应该将医学扫描上的黑点归类为纤维状生长或其他？这本书应该放在“推荐给您”还是“您讨厌它”清单上？ ML的超强能力是，它使计算机可以根据从数千个甚至数百万个已经可靠分类的示例中得出的结论来做出这些“决定”。从这些示例中，他们注意到指示新输入应放入哪些类别的模式。

While this works better than almost anyone would expect — and a tremendous amount of research is devoted to fundamental improvements in classification algorithms — virtually every ML system that classifies inputs mis-classifies some of them. An image classifier might think that the photo of a desert is a photo of a beach. The cellphone you’re dictating into might insist that you said “Wreck a nice beach” instead of “Recognize speech.”

尽管这比几乎任何人都预期的要好，并且大量研究致力于分类算法的根本改进，但实际上，对输入进行分类的每个ML系统都会对其中一些进行错误分类。图像分类器可能认为沙漠的照片就是海滩的照片。您要输入的手机可能会坚持要求您说“ 破坏美丽的海滩 ”，而不是“识别语音”。

So, researchers and developers typically test and tune their ML systems by having them classify data that’s already been reliably

最低0.47元/天解锁文章

weixin_26752765

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
机器学习中倒三角符号_机器学习的三角误差

机器学习中倒三角符号By David Weinberger 大卫·温伯格(David Weinberger) AI Outside In is a column by PAIR’s writer-in-residence, David Weinberger, who offers his outsider perspective on key ideas in machine learning....
复制链接

扫一扫