In the last decade, with the discovery of deep learning, the field of image classification has experienced a renaissance. Traditional machine learning methods have been replaced by newer and more powerful deep learning algorithms, such as the convolutional neural network. However, to truly understand and appreciate deep learning, we must know why does it succeed where the other methods fail. In this article, we try to answer some of those questions, by applying various classification algorithms on the Fashion MNIST dataset.
在过去的十年中,随着深度学习的发现,图像分类领域经历了复兴。 传统的机器学习方法已被更新和更强大的深度学习算法(例如卷积神经网络)取代。 但是,要真正理解和欣赏深度学习,我们必须知道为什么其他方法失败时它会成功。 在本文中,我们尝试通过对Fashion MNIST数据集应用各种分类算法来回答其中的一些问题。
Dataset information Fashion MNIST was introduced in August 2017, by research lab at Zalando Fashion. Its goal is to serve as a new benchmark for testing machine learning algorithms, as MNIST became too easy and overused. While MNIST consists of handwritten digits, Fashion MNISTis made of images of 10 different clothing objects. Each image has the following properties:
数据集信息Fashion MNIST由Zalando Fashion的研究实验室于2017年8月推出。 随着MNIST变得过于简单和过度使用,其目标是成为测试机器学习算法的新基准。 MNIST由手写数字组成,而Fashion MNIST由10种不同服装对象的图像组成。 每个图像具有以下属性:
- Its size is 28 × 28 pixels. 尺寸为28×28像素。
- Rotated accordingly and represented in grayscale, with integer values ranging from 0 to 255. 进行相应旋转并以灰度表示,整数值的范围为0到255。
- Blank space represented by black color and having value 0. 黑色表示的空白,值为0。
In the dataset, we distinguish between the following clothing objects:
在数据集中,我们区分以下服装对象:
- T-shirt/Top T恤/上衣
- Trousers长裤
- Pullover拉过来
- Dress连衣裙
- Coat涂层
- Sandal凉鞋
- Shirt衬衫
- Sneaker运动鞋
- Bag袋
- Ankle Boot脚踝靴
Exploratory data analysis As the dataset is available as the part of the Keras library, and the images are already processed, there is no need for much preprocessing on our part. The only changes we made was converting images from a 2D array into a 1D array, as that makes them easier to work with.
探索性数据分析由于数据集可以作为Keras库的一部分使用,并且图像已经过处理,因此我们不需要太多预处理。 我们所做的唯一更改是将图像从2D数组转换为1D数组,因为这使它们更易于使用。
The dataset consists of 70000 images, of which the 60000 make the training set, and 10000 the test set. Like in the original MNIST dataset, the items are distributed evenly (6000 of each of training set and 1000 in the test set).
数据集包含70000张图像,其中60000张为训练集,10000张为测试集。 像在原始MNIST数据集中一样,项目也平均分配(每个训练集6000个,测试集中1000个)。