Classification and Representation（分类与表示）

最新推荐文章于 2024-10-14 11:42:11 发布

山清水秀iOS

最新推荐文章于 2024-10-14 11:42:11 发布

阅读量132

点赞数

文章标签：数据结构与算法

原文链接：http://www.cnblogs.com/ne-zha/p/7356666.html

版权

分类

为了尝试分类，一种方法是使用线性回归，并将大于0.5的所有预测映射为1，全部小于0.5作为0.然而，该方法不能很好地进行，因为分类实际上不是线性函数。

分类问题就像回归问题一样，只是我们现在想要预测的值只有少量的离散值。现在，我们将重点介绍二进制分类问题，其中y只能取两个值0和1.（我们所说的大部分内容也将归结为多类的情况）。例如，如果我们正在尝试为电子邮件构建垃圾邮件分类器，那么x（i）可能是一个电子邮件的一些功能，如果它是一个垃圾邮件，y可能为1，否则为0。因此，y∈{0,1}。 0也称为负类，1为正类，有时也由符号“ - ”和“+”表示。给定x（i），相应的y（i）也称为培训实例。

Hypothesis Representation

We could approach the classification problem ignoring the fact that y is discrete-valued, and use our old linear regression algorithm to try to predict y given x. However, it is easy to construct examples where this method performs very poorly. Intuitively, it also doesn’t make sense for

Our new form uses the "Sigmoid Function," also called the "Logistic Function":

The following image shows us what the sigmoid function looks like:

The function g(z), shown here, maps any real number to the (0, 1) interval, making it useful for transforming an arbitrary-valued function into a function better suited for classification.