逻辑回归

最新推荐文章于 2025-03-12 16:43:52 发布

K5niper

最新推荐文章于 2025-03-12 16:43:52 发布

阅读量582

点赞数 1

分类专栏：机器学习知识点整理

本文链接：https://blog.csdn.net/zhaoyin214/article/details/102698341

版权

本文深入探讨了逻辑回归的原理，包括目标函数的定义、梯度计算及其作为凸函数的性质。通过最大似然估计推导了逻辑回归的损失函数，并展示了如何利用梯度下降法进行参数优化。此外，还证明了逻辑回归的目标函数的二阶导数矩阵是半正定的，从而证明了逻辑回归是凸函数，确保全局最优解的存在。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

逻辑回归

逻辑回归的梯度下降法推导
逻辑回归目标函数为凸函数

训练数据 $\{ (\mathbf{x}_{1}, y_{1}), \cdots, (\mathbf{x}_{n}, y_{n}) \}$ ，其中 $(\mathbf{x}_{i}, y_{i})$ 表示一条样本， $\mathbf{x}_{i} \in \R^{D}$ 为 $D$ 维样本特征（feature）， $y_{i} \in \{ 0, 1\}$ 表示样本标签（label）。

逻辑回归模型的参数为 $(\mathbf{w}, b)$ 。为推导方便，通常将 $b$ 整合到 $\mathbf{w}$ 中，此时， $\mathbf{w}$ 和 $\mathbf{x}_{i}$ 分别改写为

$\mathbf{w} = [w_{0}, w_{1}, \cdots, w_{D}], \ \mathbf{x}_{i} = [1, x_{1}, \cdots, x_{D}]$

1 逻辑回归的目标函数

目标函数（objective function），也称为损失函数（loss function），记为 $\mathcal{L} (\mathbf{w})$ 。

二分类问题模型

$\mathbf{x}; \mathbf{w} ) = p(y = 1 | \mathbf{x}; \mathbf{w})^{y} [1 - p(y = 1 | \mathbf{x}; \mathbf{w})]^{1 - y} \tag {1}$

最大似然估计（MLE）

$\begin{aligned} \mathbf{w}^{\ast} & = \arg \max_{\mathbf{w}} p(\mathbf{y} | \mathbf{x}; \mathbf{w} ) \\ & = \arg \max_{\mathbf{w}} \prod_{i = 1}^{n} p(y_{i} | \mathbf{x}_{i}; \mathbf{w} ) \\ & = \arg \max_{\mathbf{w}} \log \left[ \prod_{i = 1}^{n} p(y_{i} | \mathbf{x}_{i}; \mathbf{w} ) \right] \\ & = \arg \max_{\mathbf{w}} \sum_{i = 1}^{n} \log \left[ p(y_{i} | \mathbf{x}_{i}; \mathbf{w} ) \right] \\ & = \arg \max_{\mathbf{w}} \sum_{i = 1}^{n} \log \left[ p(y_{i} = 1 | \mathbf{x}_{i}; \mathbf{w})^{y_{i}} [1 - p(y_{i} = 1 | \mathbf{x}_{i}; \mathbf{w})]^{1 - y_{i}} \right] \\ & = \arg \max_{\mathbf{w}} \sum_{i = 1}^{n} \left[ y_{i} \log p(y_{i} = 1 | \mathbf{x}_{i}; \mathbf{w}) + (1 - y_{i}) \log [1 - p(y_{i} = 1 | \mathbf{x}_{i}; \mathbf{w})] \right] \\ \end{aligned} \tag {2}$

最低0.47元/天解锁文章