SPCAvRP 论文阅读与思考

最新推荐文章于 2024-11-14 16:47:38 发布

Shian150629

最新推荐文章于 2024-11-14 16:47:38 发布

阅读量627

点赞数

分类专栏：论文阅读文章标签：机器学习

本文链接：https://blog.csdn.net/weixin_43759518/article/details/113455174

版权

3 篇文章 0 订阅

订阅专栏

1. 论文泛读

通过随机矩阵投影来进行稀疏PCA

平衡计算量与统计
为了达到目的，有效样本大小与随机矩阵投影的互相影响
minmax:minimizing the possible loss for a worst case (maximum loss) scenario.
When dealing with gains, it is referred to as “maximin”—to maximize the minimum gain

SPCA-1
SPCA-2

请注意，（2）式的带上标的 $\Sigma$ 不是求和符号
为了解决（2）式中的非凸优化问题，前人提出了L-1惩罚项的方案。然而虽然能提高速度，但是没理论支持（作者后面会笑的）
为了解决（2）也有使用半正定松弛法的，但是慢（作者后面会笑的）
重点！it is now understood that, conditional on a
Planted Clique hypothesis from theoretical computer science, there is an asymptotic regime in which no randomized polynomial time algorithm can attain the minimax optimal rate
（ref:Wang, T., Berthet, Q. and Samworth, R. J. (2016a) Statistical and computational trade-offs in estimation of sparse principal components. Ann. Statist., 44, 1896–1930.
）

迭代的算法在确定的情况下，初值与真实值对应得很好：
Various fast, iterative algorithms were introduced by Johnstone and Lu (2009), Paul and Johnstone (2012), and Ma (2013); these have been shown to attain the minimax rate under certain conditions, provided that the initial starting point is reasonably well-aligned with the true signal.
the loss function：

图表的是做100次取平均
迭代算法：初值不好全完蛋。 Remarkably, each of the previously proposed algorithms we tested produces estimates that are almost orthogonal to the true principal component！啧，这感叹号用的，我怀疑作者在笑。笑啥，你用的是人家程序的默认初始化程序……等等，好像确实可以笑人家2333

作者还是很满意的，our algorithm, which we refer to as SPCAvRP
and implement in a publicly available R package ，is also attractive for both theoretical and computational reasons。对比前人要么没理论支持要么慢的情况，确实是很不错的trade-off
当有效样本量很大的时候，想要得到目标结果，只需要随着样本维度p的增大稍快增加PR数目
但这可没违背2016a那篇文章， which applies to an intermediate
effective sample size regime where the SPCAvRP algorithm would require an exponential
number of projections to attain the optimal rate.令人尴尬的是（？）作者的算法是并行的（林源：拍桌笑），并且不用计算 $\Sigma$ 的估计值，因为用RP提取目标子矩阵来算就够了（2016a:?）。当维度p非常大的时候，能显著节省计算。在section 4 中提到，有使用数值实验和真实数据的有限样本估计来进行算法比较
本文也参考了贪婪算法：We also mention the computationallyefficient combinatorial approaches proposed by Moghaddam, Weiss and Avidan (2006) and
d’Aspremont, Bach and El Ghaoui (2008) that aim to find solutions to the optimization
problem in (2) using greedy methods.
PR应用挺广的，也有例子：
记号：