不同的数据预处理对L1距离性能的影响_使用l1距离的最近邻分类器的性能由哪些因素决定-CSDN博客

本文链接：https://blog.csdn.net/dawningblue/article/details/103927092

本文探讨了不同数据预处理步骤如何影响使用L1距离的最近邻分类器的性能。通过分析减去均值、减去像素均值、除以标准差、像素级标准化和坐标轴旋转等操作，得出结论：平移和归一化操作不会显著改变性能，而坐标旋转会导致性能变化。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

问题由来

这个问题来自于CS231n Assignment1 Q1 inlineQuestion 2 ，原问题描述如下
We can also use other distance metrics such as L1 distance.
For pixel values $p_{ij}^{(k)}$ at location $(i, j)$ of some image $I_k$ ,

the mean $\mu$ across all pixels over all images is $\mu=\frac{1}{nhw}\sum_{k=1}^n\sum_{i=1}^{h}\sum_{j=1}^{w}p_{ij}^{(k)}$
And the pixel-wise mean $\mu_{ij}$ across all images is
$\mu_{ij}=\frac{1}{n}\sum_{k=1}^np_{ij}^{(k)}.$
The general standard deviation $\sigma$ and pixel-wise standard deviation $\sigma_{ij}$ is defined similarly.

Which of the following preprocessing steps will not change the performance of a Nearest Neighbor classifier that uses L1 distance? Select all that apply.

Subtracting the mean $\mu$ ( $\tilde{p}_{ij}^{(k)}=p_{ij}^{(k)}-\mu$ .)
Subtracting the per pixel mean $\mu_{ij}$ ( $\tilde{p}_{ij}^{(k)}=p_{ij}^{(k)}-\mu_{ij}$ .)
Subtracting the mean $\mu$ and dividing by the standard deviation $\sigma$ .
Subtracting the pixel-wise mean $\mu_{ij}$ and dividing by the pixel-wise standard deviation $\sigma_{ij}$ .
Rotating the coordinate axes of the data.