cs231n assignment(1.1):kNN分类器

最新推荐文章于 2021-04-05 20:36:44 发布

小码奔腾

最新推荐文章于 2021-04-05 20:36:44 发布

阅读量323

点赞数

分类专栏：人工智能 cs231n 文章标签： cs231n 图片分类

本文链接：https://blog.csdn.net/EvansYang_112/article/details/78394595

版权

人工智能同时被 2 个专栏收录

4 篇文章 0 订阅

订阅专栏

cs231n

4 篇文章 0 订阅

订阅专栏

cs231n assignment(1.1):kNN分类器

kNN分类器练习的宗旨在于：了解基本图像分类流程，交叉验证，熟悉书写高效的向量代码

understand the basic Image Classification pipeline, cross-validation, and gain proficiency in writing efficient, vectorized code.

1. 数据准备

数据来源于cifar10图片数据库，该数据库有10种不同的图片类，分别是’plane’, ‘car’, ‘bird’, ‘cat’, ‘deer’, ‘dog’, ‘frog’, ‘horse’, ‘ship’, ‘truck.每张图片大小为32*32*3

以下是数据集的基本信息，均为numpy.ndarray

Training data shape:  (50000, 32, 32, 3)
Training labels shape:  (50000,)
Test data shape:  (10000, 32, 32, 3)
Test labels shape:  (10000,)

部分展示如下：
cifar-10数据集部分展示

为了简化流程，训练集取了5000幅图，测试集取了500幅图，并且把训练集和测试集的维数都拉成了一维于是

    X_Train.shape:(5000,3072)
    y_train.shape:(5000,)
    X_test.shape:(500,3072)
    y_test.shape(500,)

2. kNN算法的实现

2.1 kNN算法的第一步在于计算两张图的L2距离，也就是欧式距离
其公式如下：

d 2 (I 1, I 2) = \sum p (I p 1 - I p 2) 2 ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ \sqrt

$d_2(I_1,I_2) = \sqrt{\sum_p{({I_1^p-I_2^p})^2}}$
其中

I1 $I_1$ ,

I2 $I_2$ 分别代表了两幅不同的图，p是指其中某个点的值。
assignment中分别实现了用两层循环，一层循环，和不用循环的向量化运算，这里只写不用循环的向量化计算
L2的代码：

  def compute_distances_no_loops(self, X):
    """
    Compute the distance between each test point in X and each training point
    in self.X_train using no explicit loops.
    Input / Output: Same as compute_distances_two_loops
    """
    num_test = X.shape[0]
    num_train = self.X_train.shape[0]
    dists = np.zeros((num_test, num_train)) 
    M1=np.sum(X**2,axis=1,keepdims = True)
    M2=np.sum(self.X_train ** 2, axis=1)
    M3=np.multiply(np.dot(X,self.X_train.T),-2)
    M=M3+M1+M2
    dists=np.sqrt(M)
    return dists

理解这段代码的方法在于将L2公式展开并且用矩阵的想法带入，公式可类比展开如下：

d 2 (I 1, I 2) = \sum p (I p 1) 2 + \sum p (I p 2) 2 - 2 \sum p I p 1 I p 2 ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ ‾ \sqrt

$d_2(I_1,I_2) = \sqrt{\sum_p(I_1^p)^2+\sum_p(I_2^p)^2-2\sum_pI_1^pI_2^p}$
理解矩阵的维数和整体操作就不难写出上面的代码了。

2.2 预测函数

  def predict_labels(self, dists, k=1):
    """
    Given a matrix of distances between test points and training points,
    predict a label for each test point.

    Inputs:
    - dists: A numpy array of shape (num_test, num_train) where dists[i, j]
      gives the distance betwen the ith test point and the jth training point.

    Returns:
    - y: A numpy array of shape (num_test,) containing predicted labels for the
      test data, where y[i] is the predicted label for the test point X[i].  
    """
    num_test = dists.shape[0]
    num_train = dists.shape[1]
    y_pred = np.zeros(num_test)
    for i in xrange(num_test):
      closest_y = []
      idx = np.argsort(dists[i,:])
      closest_y = self.y_train[idx[:k]]
      count = 0
      label = 0
      for j in closest_y:
         tmp = 0
         for kk in closest_y:
            tmp += (kk == j)
         if tmp > count:
            count = tmp
            label = j
      y_pred[i] = label

    return y_pred

3.交叉验证

交叉验证的原因在于不能使用测试集来做验证，因为测试集是非常宝贵的，它反映了模型整体的适配能力，所以不应该使用。然而，又需要调试超参数（hyperparameters),所以我们使用训练集来调试超参数。

对不同的K值，操作是一样的：把训练集合分成五份，然后选其中一份为验证数据（ValidationData），其余四份为训练数据（TrainingData)，循环五次，得到不同的分类准确度（accuracy)。也就是说，每个k值对应的五个准确度。

num_folds = 5
k_choices = [1, 3, 5, 8, 10, 12, 15, 20, 50, 100]

X_train_folds = []
y_train_folds = []
X_train_folds = np.array_split(X_train,num_folds)
y_train_folds = np.array_split(y_train,num_folds)
k_to_accuracies = {}

for k in k_choices:
    k_to_accuracies[k] = np.zeros(num_folds)
    for i in xrange(num_folds):
        Training_Data = np.array(X_train_folds[:i]+X_train_folds[i+1:])
        Training_Label = np.array(y_train_folds[:i]+y_train_folds[i+1:])
        Validation_Data = X_train_folds[i]
        Validation_Label = y_train_folds[i]

        Training_Data = np.reshape(Training_Data,(4000,3072))
        Training_Label = np.reshape(Training_Label,4000)

        classifier.train(Training_Data, Training_Label)
        yte_Pre = classifier.predict(Validation_Data,k)
        num_correct = np.sum(yte_Pre == Validation_Label)
        accuracy = (float)(num_correct)/len(Validation_Label)
        k_to_accuracies[k][i] = accuracy

得到一张图表
交叉验证结果图

大约在 $k\approx8$ 的地方表现最好，于是把可设置成8，进行最后的测试，得到

Got 147 / 500 correct => accuracy: 0.294000

小码奔腾

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录