LR的代价函数 逻辑回归定义:
P
(
Y
=
1
∣
x
)
=
1
1
+
exp
(
−
w
x
)
=
σ
(
x
)
P
(
Y
=
0
∣
x
)
=
1
−
σ
(
x
)
\begin{array}{l}{P(Y=1 | x)=\frac{1}{1+\exp (-w x)}=\sigma(x)} \\ {P(Y=0 | x)=1-\sigma(x)}\end{array}
P(Y=1∣x)=1+exp(−wx)1=σ(x)P(Y=0∣x)=1−σ(x) 负对数函数作为损失函数:
L
(
w
)
=
−
log
(
∏
i
=
1
N
[
σ
(
x
i
)
]
y
[
[
1
−
σ
(
x
i
)
]
1
−
y
i
)
=
−
∑
i
=
1
N
[
y
i
log
σ
(
x
i
)
+
(
1
−
y
i
)
log
(
1
−
σ
)
]
=
−
∑
i
=
1
N
[
y
i
log
σ
(
x
i
)
1
−
σ
(
x
i
)
+
log
(
1
−
σ
(
x
i
)
)
]
L
(
w
)
=
−
∑
N
[
y
i
(
w
x
i
)
−
log
(
1
+
exp
(
w
x
i
)
)
]
\begin{aligned} L(w) &=-\log \left(\prod_{i=1}^{N}\left[\sigma\left(x_{i}\right)\right]^{y_{[ }}\left[1-\sigma\left(x_{i}\right)\right]^{1-y_{i}}\right) \\ &=-\sum_{i=1}^{N}\left[y_{i} \log \sigma\left(x_{i}\right)+\left(1-y_{i}\right) \log (1-\sigma)\right.]\\ &=-\sum_{i=1}^{N}\left[y_{i} \log \frac{\sigma\left(x_{i}\right)}{1-\sigma\left(x_{i}\right)}+\log \left(1-\sigma\left(x_{i}\right)\right)\right] \\L(w) &=-\sum^{N}\left[y_{i}\left(w x_{i}\right)-\log \left(1+\exp \left(w x_{i}\right)\right)\right] \end{aligned}
L(w)L(w)=−log(i=1∏N[σ(xi)]y[[1−σ(xi)]1−yi)=−i=1∑N[yilogσ(xi)+(1−yi)log(1−σ)]=−i=1∑N[yilog1−σ(xi)σ(xi)+log(1−σ(xi))]=−∑N[yi(wxi)−log(1+exp(wxi))] 求梯度:
∂
L
(
w
)
∂
w
=
−
∑
i
=
1
N
[
y
i
x
i
−
exp
(
w
x
i
)
1
+
exp
(
w
x
i
)
x
i
]
=
∑
i
=
1
N
[
σ
(
x
i
)
−
y
i
]
x
i
\begin{aligned} \frac{\partial L(w)}{\partial w}=&-\sum_{i=1}^{N}\left[y_{i} x_{i}-\frac{\exp \left(w x_{i}\right)}{1+\exp \left(w x_{i}\right)} x_{i}\right] \\=& \sum_{i=1}^{N}\left[\sigma\left(x_{i}\right)-y_{i}\right] x_{i} \end{aligned}
∂w∂L(w)==−i=1∑N[yixi−1+exp(wxi)exp(wxi)xi]i=1∑N[σ(xi)−yi]xi 最后在使用梯度下降求解参数:
P
(
Y
=
k
∣
x
)
=
exp
(
w
k
x
)
1
+
∑
k
=
1
K
−
1
exp
(
w
k
x
)
k
=
1
,
2
,
…
,
K
−
1
P
(
Y
=
K
∣
x
)
=
1
1
+
∑
k
=
1
K
−
1
exp
(
w
k
x
)
\begin{array}{l}{P(Y=k | x)=\frac{\exp \left(w_{k} x\right)}{1+\sum_{k=1}^{K-1} \exp \left(w_{k} x\right)} \quad k=1,2, \ldots, K-1} \\ {P(Y=K | x)=\frac{1}{1+\sum_{k=1}^{K-1} \exp \left(w_{k} x\right)}}\end{array}
P(Y=k∣x)=1+∑k=1K−1exp(wkx)exp(wkx)k=1,2,…,K−1P(Y=K∣x)=1+∑k=1K−1exp(wkx)1