关闭

the summary of sklearn.covariance

35人阅读 评论(0) 收藏 举报
分类:

sklearn.covariance has three categories:EmpiricalCovariance and so on,Shrunkage,GraphLasso.

EmpiricalCovariance:Maximum likelihood covariance estimator.

If sample dataset has noisy data,we use MinCovDet to get a robust covariance estimator.

MCD:https://tr8dr.wordpress.com/2010/09/24/minimum-covariance-determination/

MCD want to choose a  tolerance ellipse have minimum volume which is equaled to the determinant of splitted sample matrix by given accuracy.

And we can use EllipticEnvelope which use MCD estimator as covariance estimator to detect outlier.


Except for noisy data,we also encounter a situation where the number of data point N is small and the number of feature P is large.We can use Shrunkage method to handle this problem.


ShrunkageCovariance:

And LediotWolf and OAS can use certain formula to compute shrinkage  .They are the better choices.


GraphLasso:Sparse inverse covariance estimation with an l1-penalized estimator.


where K is precision matrix.

GraphLasso(and GraphLasso) is another method to estimate precison matrix when N is samll and P is large,especically GraphLasso is always better than Shrunkage method when N<P.

By using GraphLasso,we can get a sparse precision matrix which has good-condition: if two features are independent conditionally on the others, the corresponding coefficient in the precision matrix will be zero.

0
0

查看评论
* 以上用户言论只代表其个人观点,不代表CSDN网站的观点或立场
    个人资料
    • 访问:948次
    • 积分:144
    • 等级:
    • 排名:千里之外
    • 原创:14篇
    • 转载:0篇
    • 译文:0篇
    • 评论:0条
    文章分类
    文章存档