吴恩达教授《AI for everyone》课程第一周——机器学习

最新推荐文章于 2024-06-20 07:30:56 发布

港版国产机

最新推荐文章于 2024-06-20 07:30:56 发布

阅读量764

点赞数 1

分类专栏： AI

本文链接：https://blog.csdn.net/u012897401/article/details/88431972

版权

AI 专栏收录该内容

2 篇文章 0 订阅

订阅专栏

视频地址：https://www.coursera.org/learn/ai-for-everyone/lecture/5TPFo/machine-learning

英文字幕：

The rise of AI has been largely driven by one tool in AI called machine learning. In this video, you'll learn what is machine learning, so that by the end, you hope we will start thinking how machine learning might be applied to your company or to your industry. The most commonly used type of machine learning is a type of AI that learns A to B, or input to output mappings. This is called supervised learning. Let's see some examples. If the input A is an email and the output B one is email spam or not, zero one. Then this is the core piece of AI used to build a spam filter. Or if the input is an audio clip, and the AI's job is to output the text transcript, then this is speech recognition. More examples, if you want to input English and have it output a different language, Chinese, Spanish, something else, then this is machine translation. Or the most lucrative form of supervised learning, of this type of machine learning maybe be online advertising, where all the large online ad platforms have a piece of AI that inputs some information about an ad, and some information about you, and tries to figure out, will you click on this ad or not? By showing you the ads you're most likely to click on, this turns out to be very lucrative. Maybe not the most inspiring application, but certainly having a huge economic impact today. Or if you want to build a self-driving car, one of the key pieces of AI is in the AI that takes as input an image, and some information from their radar, or from other sensors, and output the position of other cars, so your self-driving car can avoid the other cars. Or in manufacturing. I've actually done a lot of work in manufacturing where you take as input a picture of something you've just manufactured, such as a picture of a cell phone coming off the assembly line. This is a picture of a phone, not a picture taken by a phone, and you want to output, is there a scratch, or is there a dent, or some other defects on this thing you've just manufactured? And this is visual inspection which is helping manufacturers to reduce or prevent defects in the things that they're making. This set of AI called supervised learning, just learns input to output, or A to B mappings. On one hand, input to output, A to B it seems quite limiting. But when you find a right application scenario, this can be incredibly valuable. Now, the idea of supervised learning has been around for many decades. But it's really taken off in the last few years. Why is this? Well, my friends asked me, "Hey Andrew, why is supervised learning taking off now?" There's a picture I draw for them. I want to show you this picture now, and you may be able to draw this picture for others that ask you the same question as well. Let's say on the horizontal axis you plot the amount of data you have for a task. So, for speech recognition, this might be the amount of audio data and transcripts you have. In lot of industries, the amount of data you have access to has really grown over the last couple of decades. Thanks to the rise of the Internet, the rise of computers. A lot of what used to be say pieces of paper, are now instead recorded on a digital computer. So, we've just been getting more and more data. Now, let's say on the vertical axis you plot the performance of an AI system. It turns out that if you use a traditional AI system, then the performance would grow like this, that as you feed in more data is performance gets a bit better. But beyond a certain point it did not get that much better. So it's as if your speech recognition system did not get that much more accurate, or your online advertising system didn't get that much more accurate that's showing the most relevant ads, even as you show the more data. AI has really taken off recently due to the rise of neural networks and deep learning. I'll define these terms more precise in later video, so don't worry too much about what it means for now. But with modern AI, with neural networks and deep learning, what we saw was that, if you train a small neural network, then the performance looks like this, where as you feed them more data, performance keeps getting better for much longer. If you train a even slightly larger neural network, say medium-sized neural net, then the performance may look like that. If you train a very large neural network, then the performance just keeps on getting better and better. For applications like speech recognition, online advertising, building self-driving car, where having a high-performance, highly accurate, say speech recognition system is important, enable these AI systems get much better, and make speech recognition products much more acceptable to users, much more valuable to companies and to users. Now, a few couple of implications of this figure. If you want the best possible levels of performance, your performance to be up here, to hit this level of performance, then you need two things: One is, it really helps to have a lot of data. So that's why sometimes you hear about big data. Having more data almost always helps. The second thing is, you want to be able to train a very large neural network. So, the rise of fast computers, including Moore's law, but also the rise of specialized processors such as graphics processing units or GPUs, which you'll hear more about in a later video, has enabled many companies, not just a giant tech companies, but many many other companies to be able to train large neural nets on a large enough amount of data in order to get very good performance and drive business value. The most important idea in AI has been machine learning, has basically supervised learning, which means A to B, or input to output mappings. What enables it to work really well is data. In the next video, let's take a look at what is the data and what data you might already have? And how to think about feeding this into AI systems. Let's go on to the next video.

中文字幕：

人工智能的兴起主要是由人工智能中的一种称为机器学习的工具驱动的。在本视频中，您将了解什么是机器学习，因此最终，您希望我们将开始考虑如何将机器学习应用于您的公司或您的行业。最常用的机器学习类型是一种学习A到B或输入到输出映射的AI。这称为监督学习。我们来看一些例子。如果输入A是电子邮件而输出B是电子邮件垃圾邮件，则为零。然后，这是用于构建垃圾邮件过滤器的核心AI。或者，如果输入是音频剪辑，并且AI的工作是输出文本记录，则这是语音识别。更多的例子，如果你想输入英语并输出不同的语言，中文，西班牙语等等，那么这就是机器翻译。或者最有利可图的监督学习形式，这种类型的机器学习可能是在线广告，其中所有大型在线广告平台都有一块人工智能输入一些广告信息，以及一些关于你的信息，并试图想象你会点击这个广告吗？通过向您展示您最有可能点击的广告，结果证明这是非常有利可图的。也许不是最鼓舞人心的应用程序，但今天肯定会产生巨大的经济影响。或者，如果你想要制造一辆自动驾驶汽车，人工智能的一个关键部分就是在人工智能中输入图像，从雷达或其他传感器获取一些信息，并输出其他车辆的位置，所以你的自动驾驶汽车可以避开其他车。或者在制造业。我实际上已经在制造方面做了很多工作，你可以把你刚刚制造的东西的图片作为输入，例如手机下线的照片。这是一张手机的照片，而不是手机拍的照片，你想要输出，是否有划痕，或者是否有凹痕，或者你刚制造的这个东西还有其它缺陷？这是视觉检查，帮助制造商减少或防止他们正在制造的东西的缺陷。这组AI称为监督学习，只是学习输出到输出，或A到B映射。一方面，输入到输出，A到B似乎非常有限。但是当您找到合适的应用场景时，这可能非常有价值。现在，监督学习的想法已经存在了几十年。但它在过去几年里真的起飞了。为什么是这样？好吧，我的朋友问我，“嘿安德鲁，为什么现在有人监督学习起飞？”有一张我为他们画的照片。我现在想给你看这张照片，你也可以把这张照片画给那些问你同样问题的人。假设您在水平轴上绘制了任务的数据量。因此，对于语音识别，这可能是您拥有的音频数据和成绩单的数量。在许多行业中，您可以访问的数据量在过去几十年中确实增长了。由于互联网的兴起，计算机的兴起。过去常被说成纸的很多东西现在被记录在数字计算机上。所以，我们刚刚获得越来越多的数据。现在，让我们说在纵轴上绘制AI系统的性能。事实证明，如果你使用传统的AI系统，那么性能会像这样增长，因为当你输入更多数据时，性能会变得更好。但是超过某一点它没有那么好。因此，就好像您的语音识别系统没有那么准确，或者您的在线广告系统没有那么准确，显示最相关的广告，即使您显示的数据更多。由于神经网络和深度学习的兴起，人工智能最近真正起飞。我将在以后的视频中更准确地定义这些术语，所以不要过于担心它现在意味着什么。但是对于现代AI，神经网络和深度学习，我们看到的是，如果你训练一个小型神经网络，那么性能就像这样，在你为它们提供更多数据时，性能会持续变得更好。如果你训练一个更大的神经网络，比如说中型神经网络，那么性能可能就像那样。如果你训练一个非常大的神经网络，那么性能就会越来越好。对于语音识别，在线广告，构建自动驾驶汽车等应用，具有高性能，高精度的语音识别系统非常重要，使这些AI系统变得更好，并使语音识别产品更易被用户接受，对公司和用户更有价值。现在，这个数字有几个含义。如果你想要最好的性能水平，你的性能在这里，达到这个性能水平，那么你需要两件事：一个是，它确实有助于拥有大量的数据。这就是为什么有时你会听到大数据的原因。拥有更多数据几乎总是有帮助。第二件事是，你希望能够训练一个非常大的神经网络。因此，包括摩尔定律在内的快速计算机的兴起，以及图形处理单元或GPU等专用处理器的兴起，在后来的视频中你会听到更多，这使得许多公司，而不仅仅是一家大型科技公司但是，许多其他公司能够在足够大量的数据上训练大型神经网络，以获得非常好的性能并提高业务价值。 AI中最重要的想法是机器学习，基本上是监督学习，即A到B，或输入到输出映射。使它能够很好地工作的是数据。在下一个视频中，让我们看一下数据是什么以及您可能拥有哪些数据？以及如何考虑将其投入AI系统。让我们继续下一个视频。