十种方法实现图像数据集降维

最新推荐文章于 2024-11-26 09:06:50 发布

白杨qq_44597856

最新推荐文章于 2024-11-26 09:06:50 发布

阅读量901

点赞数

分类专栏：人工智能文章标签： python 开发语言

原文链接：http://www.mark-to-win.com/tutorial/175888.html

版权

人工智能专栏收录该内容

6 篇文章

订阅专栏

本文详细介绍了如何使用多种降维方法对手写数字图像数据集MNIST进行降维，包括RandomProjection、PCA、SVD、LDA、MDS、Isomap、LLE、t-SNE等。通过降维，可以将高维图像数据在二维或三维空间中展示，揭示数据的内在结构。此外，还展示了数据集的可视化技巧，帮助理解降维效果。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

1、获取数据集

2、数据集可视化

3、降维及可视化

3.1、Random projection降维

3.10、Spectral embedding降维

4、总结

降维是通过单幅图像数据的高维化，对单幅图像转化为高维空间中的数据集合进行的一种操作。机器学习领域中所谓的降维就是指采用某种映射方法，将原高维空间中的数据点映射到低维度的空间中。降维的本质是学习一个映射函数 f : x->y，其中x是原始数据点的表达，目前最多使用向量表达形式。 y是数据点映射后的低维向量表达，通常y的维度小于x的维度（当然提高维度也是可以的）。f可能是显式的或隐式的、线性的或非线性的。

本项目将依托于MNIST数据集，手把手实现图像数据集降维。

MNIST数据集来自美国国家标准与技术研究所，是入门级的计算机视觉数据集。它是由6万张训练图片和1万张测试图片构成的，这些图片是手写的从0到9的数字，50%采集美国中学生，50%来自人口普查局(the Census Bureau)的工作人员。这些数字图片进行过预处理和格式化，均为黑白色构成，做了大小调整（28×28像素）并居中处理。MNIST数据集效果如下图所示：

1、获取数据集

在本案例中，选择直接从sklearn.datasets模块中通过load_digits导入手写数字图片数据集，该数据集是UCI datasets的Optical Recognition of Handwritten Digits Data Set中的测试集，并且只是MNIST的很小的子集，一共有1797张分辨率为8××8的手写数字图片。同时，该图片有从0到9共十类数字。

先导入load_digits模块及本案例所需的相关的包，实现代码如下所示：

from time import time # 用于计算运行时间
import matplotlib.pyplot as plt
import numpy as np
from matplotlib import offsetbox # 定义图形box的格式
from sklearn import (manifold, datasets, decomposition, ensemble,
discriminant_analysis, random_projection)

load_digits中有n_class参数，可以指定选择提取多少类的图片（从数字0开始），缺省值为10；还有一个return_X_y参数（sklearn 0.18版本的新参数），若该参数值为True，则返回图片数据data和标签target，默认为False。return_X_y为False的情况下，将会返回一个Bunch对象，该对象是一个类似字典的对象，其中包括了数据data、images和数据集的完整描述信息DESCR。

下面就这两种读取方式分别展示：

方法一：返回Bunch对象，实现代码如下所示：

digits = datasets.load_digits(n_class=6)
print(digits)
# 获取bunch中的data,target
print(digits.data)
print(digits.target)

输出结果如下所示：

[[ 0. 0. 5. ..., 0. 0. 0.]
[ 0. 0. 0. ..., 10. 0. 0.]
[ 0. 0. 0. ..., 16. 9. 0.]
...,
[ 0. 0. 0. ..., 9. 0. 0.]
[ 0. 0. 0. ..., 4. 0. 0.]
[ 0. 0. 6. ..., 6. 0. 0.]]
[0 1 2 ..., 4 4 0]

方法二：只返回data和target，实现代码如下所示：

data = datasets.load_digits(n_class=6)
print(data)

输出结果如下所示：

{'images': array([[[ 0., 0., 5., ..., 1., 0., 0.],
[ 0., 0., 13., ..., 15., 5., 0.],
[ 0., 3., 15., ..., 11., 8., 0.],
...,
[ 0., 4., 11., ..., 12., 7., 0.],
[ 0., 2., 14., ..., 12., 0., 0.],
[ 0., 0., 6., ..., 0., 0., 0.]],
[[ 0., 0., 0., ..., 5., 0., 0.],
[ 0., 0., 0., ..., 9., 0., 0.],
[ 0., 0., 3., ..., 6., 0., 0.],
...,
[ 0., 0., 1., ..., 6., 0., 0.],
[ 0., 0., 1., ..., 6., 0., 0.],
[ 0., 0., 0., ..., 10., 0., 0.]],
[[ 0., 0., 0., ..., 12., 0., 0.],
[ 0., 0., 3., ..., 14., 0., 0.],
[ 0., 0., 8., ..., 16., 0., 0.],
...,
[ 0., 9., 16., ..., 0., 0., 0.],
[ 0., 3., 13., ..., 11., 5., 0.],
[ 0., 0., 0., ..., 16., 9., 0.]],
...,
[[ 0., 0., 0., ..., 6., 0., 0.],
[ 0., 0., 0., ..., 2., 0., 0.],
[ 0., 0., 8., ..., 1., 2., 0.],
...,
[ 0., 12., 16., ..., 16., 1., 0.],
[ 0., 1., 7., ..., 13., 0., 0.],
[ 0., 0., 0., ..., 9., 0., 0.]],
[[ 0., 0., 0., ..., 4., 0., 0.],
[ 0., 0., 4., ..., 0., 0., 0.],
[ 0., 0., 12., ..., 4., 3., 0.],
...,
[ 0., 12., 16., ..., 13., 0., 0.],
[ 0., 0., 4., ..., 8., 0., 0.],
[ 0., 0., 0., ..., 4., 0., 0.]],
[[ 0., 0., 6., ..., 11., 1., 0.],
[ 0., 0., 16., ..., 16., 1., 0.],
[ 0., 3., 16., ..., 13., 6., 0.],
...,
[ 0., 5., 16., ..., 16., 5., 0.],
[ 0., 1., 15., ..., 16., 1., 0.],
[ 0., 0., 6., ..., 6., 0., 0.]]]), 'data': array
([[ 0., 0., 5., ..., 0., 0., 0.],
[ 0., 0., 0., ..., 10., 0., 0.],
[ 0., 0., 0., ..., 16., 9., 0.],
...,
[ 0., 0., 0., ..., 9., 0., 0.],
[ 0., 0., 0., ..., 4., 0., 0.],
[ 0., 0., 6., ..., 6., 0., 0.]]), 'target_names':
array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9]), 'DESCR': "Optical Recognition
of Handwritten Digits Data Set\n===================================================
\n\nNotes\n-----\nData Set Characteristics:\n :Number of Instances:
5620\n :Number of Attributes: 64\n :Attribute Information:
8x8 image of integer pixels in the range 0..16.\n :Missing Attribute Values:
None\n :Creator: E. Alpaydin (alpaydin '@' boun.edu.tr)\n :Date: July;
1998\n\nThis is a copy of the test set of the UCI ML hand-written digits
datasets\nhttp://archive.ics.uci.edu/ml/datasets/Optical+Recognition+of+Handwritten+
Digits\n\nThe data set contains images of hand-written digits: 10 classes where\neach
class refers to a digit.\n\nPreprocessing programs made available by NIST were used
to extract\nnormalized bitmaps of handwritten digits from a preprinted form. From
a\ntotal of 43 people, 30 contributed to the training set and different 13\nto the
test set. 32x32 bitmaps are divided into nonoverlapping blocks of\n4x4 and the
number of on pixels are counted in each block. This generates\nan input matrix of
8x8 where each element is an integer in the range\n0..16. This reduces dimensionality
and gives invariance to small\ndistortions.\n\nFor info on NIST preprocessing routines,
see M. D. Garris, J. L. Blue, G.\nT. Candela, D. L. Dimmick, J. Geist, P. J. Grother,
S. A. Janet, and C.\nL. Wilson, NIST Form-Based Handprint Recognition System, NISTIR
5469,\n1994.\n\nReferences\n----------\n - C. Kaynak (1995) Methods of Combining
Multiple Classifiers and Their\n Applications to Handwritten Digit Recognition,
MSc Thesis, Institute of\n Graduate Studies in Science and Engineering, Bogazici
University.\n - E. Alpaydin, C. Kaynak (1998) Cascading Classifiers, Kybernetika.\n
- Ken Tang and Ponnuthurai N. Suganthan and Xi Yao and A. Kai Qin.\n Linear
dimensionalityreduction using relevance weighted LDA. School of\n Electrical
and Electronic Engineering Nanyang Technological University.\n 2005.\n - Claudio
Gentile. A New Approximate Maximal Margin Classification\n Algorithm. NIPS.
2000.\n", 'target': array([0, 1, 2, ..., 4, 4, 0])}