python中var是什么_python中的numpy var()和statisticsvariant()有什么区别?

1586010002-jmsa.png

I was trying one Dataquest exercise and I figured out that the variance I am getting is different for the two packages..

e.g for [1,2,3,4]

from statistics import variance

import numpy as np

print(np.var([1,2,3,4]))

print(variance([1,2,3,4]))

//1.25

//1.6666666666666667

The expected answer of the exercise is calculated with np.var()

Edit

I guess it has to do that the later one is sample variance and not variance.. Anyone could explain the difference?

解决方案

Use this

print(np.var([1,2,3,4],ddof=1))

1.66666666667

Delta Degrees of Freedom: the divisor used in the calculation is N - ddof, where N represents the number of elements. By default, ddof is zero.

The mean is normally calculated as x.sum() / N, where N = len(x). If, however, ddof is specified, the divisor N - ddof is used instead.

In standard statistical practice, ddof=1 provides an unbiased estimator of the variance of a hypothetical infinite population. ddof=0 provides a maximum likelihood estimate of the variance for normally distributed variables.

Statistical libraries like numpy use the variance n for what they call var or variance and the standard deviation

For more information refer this documentation : numpy doc

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值