(凸优化理论学习笔记2017/3/17）Theory of Convex Optimeization for Machine Learning(Sebatien Bubeck)

最新推荐文章于 2024-06-19 09:10:53 发布

focus_clam

最新推荐文章于 2024-06-19 09:10:53 发布

阅读量739

点赞数 2

分类专栏：机器学习之凸优化理论文章标签：机器学习凸优化 blackmodel Bubeck

本文链接：https://blog.csdn.net/david8766/article/details/62883955

版权

1 篇文章 0 订阅

订阅专栏

Chapter 5: Beyond the black-box model

Section5.1: propose a first order method with a 1/t^2 convergence rate, despite the non-smoothness
Section5.2: the function is the maximum of smooth functions
Section5.3: a concise description of Interior Point Methods

the problem:
f is convex and beta-smooth, and g is convex.
ISTA(Iterative Shrinkage-Thresholding Algorthm)
1. 根据Gradient Descent on the smooth function f
  xt+1的表达式
2. 结合这个问题minimize f+g得到xt+1的表达式
  这个就是ISTA算法的迭代式。
3. 查阅论文可得到证明此算法的收敛率（函数convex并且smooth，收敛率为1/t, 函数只convex时，收敛率为1/根号t)
4. 这个算法需要假设g is simple, 因为计算xt+1本身是一个凸优化问题，当假设g is simple时，计算xt+1可以通过解决n个在一维空间上的凸优化问题。
FISTA（Fast ISTA）
1. 结合Nesterov’s Accelerated Gradient Descent得到对应的
2. 收敛率（证明查询相关论文）
CMD and RDA
1. ISTA and FISTA assume smoothness in the Euclidean metric.
2. CMD and RDA use these ideas in a non-Euclidean metric
总结
- 当函数可以分解为sum of f and g（f is convex and smooth, g is convex)时，这个凸优化问题的收敛率比只知道函数为凸时的收敛率小。
未完待续