图像分类分类器
Image classification is an amazing application of deep learning. We can train a powerful algorithm to model a large image dataset. This model can then be used to classify a similar but unknown set of images.
图像分类是深度学习的惊人应用。 我们可以训练一个强大的算法来对大型图像数据集建模。 然后可以使用该模型对相似但未知的图像集进行分类。
There is no limit to the applications of image classification. You can use it in your next app or you can use it to solve some real world problem. That's all up to you. But to someone who is fairly new to this realm, it might seem very challenging at first. How should I get my data? How should I build my model? What tools should I use?
图像分类的应用没有限制。 您可以在下一个应用程序中使用它,也可以使用它来解决一些现实世界中的问题。 这完全取决于您。 但是对于这个领域的新手来说,乍一看似乎很有挑战性。 我应该如何获取数据? 我应该如何建立我的模型? 我应该使用什么工具?
In this article we will discuss all of that - from finding a dataset to training your model. I will try to make things as simple as possible by avoiding some technical details (PS: Please note that this doesn't mean those details are not important. I will mention some great resources which you can refer to learn more about those topics). The purpose of this article is to explain the basic process of building an image classifier and that's what we will focus more on here.
在本文中,我们将讨论所有这些-从查找数据集到训练模型。 我将通过避免一些技术细节来使事情变得尽可能简单( PS:请注意,这并不意味着这些细节并不重要。我将提到一些很棒的资源,您可以参考这些资源以了解有关这些主题的更多信息 )。 本文的目的是解释构建图像分类器的基本过程,这就是我们将在此处重点关注的内容。
We will build an Image classifier for the Fashion-MNIST Dataset. The Fashion-MNIST dataset is a collection of Zalando's article images. It contains 60,000 images for the training set and 10,000 images for the test set data (we will discuss the test and training datasets along with the validation dataset later). These images belong to the labels of 10 different classes.
我们将为Fashion-MNIST数据集构建一个图像分类器。 Fashion-MNIST数据集是Zalando的文章图像的集合。 它包含用于训练集的60,000张图像和用于测试集数据的10,000张图像( 我们稍后将讨论测试和训练数据集以及验证数据集 )。 这些图像属于10个不同类别的标签。
导入库 (Importing Libraries)
Our goal is to train a deep learning model that can classify a given set of images into one of these 10 classes. Now that we have our dataset, we should move on to the tools we need. There are many libraries and tools out there that you can choose based on your own project requirements. For this one I will stick to the following:
我们的目标是训练一种深度学习模型,该模型可以将一组给定的图像分类为这10个类中的一个。 现在我们有了数据集,我们应该继续使用所需的工具。 您可以根据自己的项目要求选择许多库和工具。 为此,我将坚持以下几点:
Numpy - Python library for numerical computation
Numpy-用于数值计算的Python库
Pandas - Python library data manipulation
Pandas -Python库数据操作
Matplotlib - Python library data visualisation
Matplotlib -Python库数据可视化
Keras - Python library based on tensorflow for creating deep learning models
Keras - Python库基于tensorflow创建深度学习模型
Jupyter - I will run all my code on Jupyter Notebooks. You can install it via the link. You can use Google Colabs also if you need better computational power.
Jupyter-我将在Jupyter Notebooks上运行所有代码。 您可以通过链接进行安装。 如果您