the summary of sklearn.covariance

原创 2017年01月03日 10:00:48

sklearn.covariance has three categories:EmpiricalCovariance and so on,Shrunkage,GraphLasso.

EmpiricalCovariance:Maximum likelihood covariance estimator.

If sample dataset has noisy data,we use MinCovDet to get a robust covariance estimator.

MCD:https://tr8dr.wordpress.com/2010/09/24/minimum-covariance-determination/

MCD want to choose a  tolerance ellipse have minimum volume which is equaled to the determinant of splitted sample matrix by given accuracy.

And we can use EllipticEnvelope which use MCD estimator as covariance estimator to detect outlier.


Except for noisy data,we also encounter a situation where the number of data point N is small and the number of feature P is large.We can use Shrunkage method to handle this problem.


ShrunkageCovariance:

And LediotWolf and OAS can use certain formula to compute shrinkage  .They are the better choices.


GraphLasso:Sparse inverse covariance estimation with an l1-penalized estimator.


where K is precision matrix.

GraphLasso(and GraphLasso) is another method to estimate precison matrix when N is samll and P is large,especically GraphLasso is always better than Shrunkage method when N<P.

By using GraphLasso,we can get a sparse precision matrix which has good-condition: if two features are independent conditionally on the others, the corresponding coefficient in the precision matrix will be zero.

版权声明:本文为博主原创文章,未经博主允许不得转载。

相关文章推荐

Summary_of_Changes_OCP_chinese

  • 2012-10-09 15:25
  • 115KB
  • 下载

A summary of Change in systemverilog 2012

ref:http://www.sutherland-hdl.com/papers/2012-DAC_What-is-new-in-SystemVerilog-2012.pdf (只提取一些目前自己感...

summary of design patterns

  • 2009-10-31 07:56
  • 1.09MB
  • 下载

pci_dss_v2_summary_of_changes.pdf

  • 2011-01-09 17:19
  • 397KB
  • 下载

Introduction to Algorithm - Summary of Chapter 8 - Sorting in Linear Time

算法导论课后习题解析 第八章 算法导论课后总结 第八章

Summary of random signal analysis

  • 2012-08-29 21:10
  • 760KB
  • 下载

操作系统概述操——Summary of the operating system

一、操作系统的基本概念     1.计算机系统的资源     在计算机系统中,资源包括两类,即硬件资源和软件资源。软件资源是计算机的灵魂,主要包括各种程序和数据以及操作系统本身等。操作系统的重要...

Summary of High-Speed Tracking with Kernelized Correlation Filters

With the theory of circulant matrices, this paper present a classification trained with all patches ...
内容举报
返回顶部
收藏助手
不良信息举报
您举报文章:深度学习:神经网络中的前向传播和反向传播算法推导
举报原因:
原因补充:

(最多只允许输入30个字)