深度学习系列之cs231n assignment1 softmax（四）

最新推荐文章于 2023-10-07 15:58:41 发布

明曦君

最新推荐文章于 2023-10-07 15:58:41 发布

阅读量454

点赞数 7

分类专栏：深度学习文章标签： python 深度学习

本文链接：https://blog.csdn.net/qq_35149632/article/details/104897652

版权

写在开头：assignment对于softmax的作业对于svm差别就在于损失函数与梯度的差别，其余地方几乎是一样的，比如在预测的时候仍然选择得分对高的类，所以今天就来开始softmax部分的作业分享。

内容安排

今天主要会对softmax损失函数以及softmax求梯度进行讲解，然后通过编程来完成关于循环计算softmax loss function和向量计算loss function，然后对于训练和预测函数使用linear_classifier.py中的函数，这个函数关于SGD和预测的函数与上一节相同这里就不对代码进行展示。

开始完成任务

1.加载包和数据
这里仍然使用的是test、val、dev、train将数据集划分为四块，

import random
import numpy as np
from cs231n.data_utils import load_CIFAR10
import matplotlib.pyplot as plt


%matplotlib inline
plt.rcParams['figure.figsize'] = (10.0, 8.0) # set default size of plots
plt.rcParams['image.interpolation'] = 'nearest'
plt.rcParams['image.cmap'] = 'gray'

# for auto-reloading extenrnal modules
# see http://stackoverflow.com/questions/1907993/autoreload-of-modules-in-ipython
%load_ext autoreload
%autoreload 2

def get_CIFAR10_data(num_training=49000, num_validation=1000, num_test=1000, num_dev=500):
    """
    Load the CIFAR-10 dataset from disk and perform preprocessing to prepare
    it for the linear classifier. These are the same steps as we used for the
    SVM, but condensed to a single function.  
    """
    # Load the raw CIFAR-10 data
    cifar10_dir = 'cs231n/datasets/cifar-10-batches-py'
    X_train, y_train, X_test, y_test = load_CIFAR10(cifar10_dir)
    
    # subsample the data
    mask = list(range(num_training, num_training + num_validation))
    X_val = X_train[mask]
    y_val = y_train[mask]
    mask = list(range(num_training))
    X_train = X_train[mask]
    y_train = y_train[mask]
    mask = list(range(num_test))
    X_test = X_test[mask]
    y_test = y_test[mask]
    mask = np.random.choice(num_training, num_dev, replace=False)
    X_dev = X_train[mask]
    y_dev = y_train[mask]
    
    # Preprocessing: reshape the image data into rows
    X_train = np.reshape(X_train, (X_train.shape[0], -1))
    X_val = np.reshape(X_val, (X_val.shape[0], -1))
    X_test = np.reshape(X_test, (X_test.shape[0], -1))
    X_dev = np.reshape(X_dev, (X_dev.shape[0], -1))
    
    # Normalize the data: subtract the mean image
    mean_image = np.mean(X_train, axis = 0)
    X_train -= mean_image
    X_val -= mean_image
    X_test -= mean_image
    X_dev -= mean_image
    
    # add bias dimension and transform into columns
    X_train = np.hstack([X_train, np.ones((X_train.shape[0], 1))])
    X_val = np.hstack([X_val, np.ones((X_val.shape[0], 1))])
    X_test = np.hstack([X_test, np.ones(

最低0.47元/天解锁文章

明曦君

关注

7
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
深度学习系列之cs231n assignment1 softmax（四）

写在开头：assignment对于softmax的作业对于svm差别就在于损失函数与梯度的差别，其余地方几乎是一样的，比如在预测的时候仍然选择得分对高的类，所以今天就来开始softmax部分的作业分享。内容安排今天主要会对softmax损失函数以及softmax求梯度进行讲解，然后通过编程来完成关于循环计算softmax loss function和向量计算loss function，然后...
复制链接

扫一扫

专栏目录