Normalization vs Regularization in Machine or Deep learning

最新推荐文章于 2024-01-06 17:51:10 发布

masonwang_513

最新推荐文章于 2024-01-06 17:51:10 发布

阅读量186

点赞数

分类专栏： cv Deep Learning

原文链接：https://stackoverflow.com/questions/47014365/what-is-the-difference-between-normalisation-and-regularisation-in-machine-learn

版权

cv 同时被 2 个专栏收录

22 篇文章

订阅专栏

Deep Learning

7 篇文章

订阅专栏

Normalisation adjusts the data; regularisation adjusts the prediction function.

It is well-known that normalizing the input data makes training faster. If your data are on very different scales (esp. low-to-high range), you likely want to normalise the data: alter each column to have the same (or compatible) basic statistics, such as standard deviation and mean. This is helpful to keep your fitting parameters on a scale that the computer can handle without a damaging loss of accuracy.

One goal of model training is to identify the signal (important features) and ignore the noise (random variation not really related to classification). If you give your model free rein to minimize the error on the given data, you can suffer from overfitting: the model insists on predicting the data set exactly, including those random variations.

Regularisation imposes some control on this by rewarding simpler fitting functions over complex ones. For instance, it can promote that a simple log function with a RMS error of x is preferable to a 15th-degree polynomial with an error of x/2. Tuning the trade-off is up to the model developer: if you know that your data are reasonably smooth in reality, you can look at the output functions and fitting errors, and choose your own balance.