python numpy mean out,为什么“ numpy.mean”返回“ INF”？

最新推荐文章于 2023-03-07 21:24:31 发布

Morisato Geimato

最新推荐文章于 2023-03-07 21:24:31 发布

阅读量995

点赞数

文章标签： python numpy mean out

I need to calculate the mean in columns of an array with more than 1000 rows.

np.mean(some_array) gives me

inf as output

but i am pretty sure the values are ok. I am loading a csv from here into my Data variable and column 'cement' is "healthy" from my point of view.

In[254]:np.mean(Data[:230]['Cement'])

Out[254]:275.75

but if I increase the number of rows

the problem starts:

In [259]:np.mean(Data[:237]['Cement'])

Out[259]:inf

but when i look at the Data

In [261]:Data[230:237]['Cement']

Out[261]:

array([[ 425. ],

[ 333. ],

[ 250.25],

[ 491. ],

[ 160. ],

[ 229.75],

[ 338. ]], dtype=float16)

i do not find a reason for this behaviour

P.S This happens in Python 3.x using wakari (cloud based Ipython)

Numpy Version '1.8.1'

I am loading the Data with:

No_Col=9

conv = lambda valstr: float(valstr.replace(',','.'))

c={}

for i in range(0,No_Col,1):

c[i] = conv

Data=np.genfromtxt(get_data,dtype=float16 , delimiter='\t', skip_header=0, names=True, converters=c)

解决方案

I will guess that the problem is precision (as others have also commented). Quoting directly from the documentation for mean() we see

Notes

The arithmetic mean is the sum of the elements along the axis divided

by the number of elements.

Note that for floating-point input, the mean is computed using the

same precision the input has. Depending on the input data, this can

cause the results to be inaccurate, especially for float32 (see

example below). Specifying a higher-precision accumulator using the

dtype keyword can alleviate this issue.

Since your array is of type float16 you have very limited precision. Using dtype=np.float64 will probably alleviate the overflow. Also see the examples in the mean() documentation.

Morisato Geimato

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python numpy mean out,为什么“ numpy.mean”返回“ INF”？

I need to calculate the mean in columns of an array with more than 1000 rows.np.mean(some_array) gives meinf as outputbut i am pretty sure the values are ok. I am loading a csv from here into my Data ...
复制链接

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。