Datawhale Task05 大作业打卡

导入植物数据集,保持文本不变

import numpy as np
outfile = r'D:\Ajupyter\Dadawhale numpy\iris.csv'
iris_csv = np.loadtxt(outfile,dtype = object,delimiter = ',',skiprows = 1)
print(iris_csv[0:10])
[['5.1' '3.5' '1.4' '0.2' 'Iris-setosa']
 ['4.9' '3' '1.4' '0.2' 'Iris-setosa']
 ['4.7' '3.2' '1.3' '0.2' 'Iris-setosa']
 ['4.6' '3.1' '1.5' '0.2' 'Iris-setosa']
 ['5' '3.6' '1.4' '0.2' 'Iris-setosa']
 ['5.4' '3.9' '1.7' '0.4' 'Iris-setosa']
 ['4.6' '3.4' '1.4' '0.3' 'Iris-setosa']
 ['5' '3.4' '1.5' '0.2' 'Iris-setosa']
 ['4.4' '2.9' '1.4' '0.2' 'Iris-setosa']
 ['4.9' '3.1' '1.5' '0.1' 'Iris-setosa']]

求出鸢尾属植物萼片长度的平均值、中位数和标准差

import numpy as np
outfile = r'D:\Ajupyter\Dadawhale numpy\iris.csv'
sepalLength = np.loadtxt(outfile,dtype = float,delimiter = ',',skiprows = 1,usecols = [0])
print(sepalLength[0:10])
[5.1 4.9 4.7 4.6 5.  5.4 4.6 5.  4.4 4.9]
print(np.mean(sepalLength))
5.843333333333334
print(np.median(sepalLength))
5.8
print(np.std(sepalLength))
0.8253012917851409

创建一种标准化形式的萼片长度

import numpy as np
outfile = r'D:\Ajupyter\Dadawhale numpy\iris.csv'
sepalLength = np.loadtxt(outfile,dtype = float,delimiter = ',',skiprows = 1,usecols = [0])

aMax = np.amax(sepalLength)
aMin = np.amin(sepalLength)
x = (sepalLength - aMin) / (aMax - aMin)
print(x[0:10])
[0.22222222 0.16666667 0.11111111 0.08333333 0.19444444 0.30555556
 0.08333333 0.19444444 0.02777778 0.16666667]

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值