numpy 5 study task

from sklearn.datasets import load_iris
data = load_iris()
import numpy as np
import pandas as pd
# 查看data所具有的属性或方法
print(dir(data))
['DESCR', 'data', 'feature_names', 'filename', 'frame', 'target', 'target_names']
# 查看数据集的简介
print(data.DESCR)
print(type(data))
.. _iris_dataset:

Iris plants dataset
--------------------

**Data Set Characteristics:**

    :Number of Instances: 150 (50 in each of three classes)
    :Number of Attributes: 4 numeric, predictive attributes and the class
    :Attribute Information:
        - sepal length in cm
        - sepal width in cm
        - petal length in cm
        - petal width in cm
        - class:
                - Iris-Setosa
                - Iris-Versicolour
                - Iris-Virginica
                
    :Summary Statistics:

    ============== ==== ==== ======= ===== ====================
                    Min  Max   Mean    SD   Class Correlation
    ============== ==== ==== ======= ===== ====================
    sepal length:   4.3  7.9   5.84   0.83    0.7826
    sepal width:    2.0  4.4   3.05   0.43   -0.4194
    petal length:   1.0  6.9   3.76   1.76    0.9490  (high!)
    petal width:    0.1  2.5   1.20   0.76    0.9565  (high!)
    ============== ==== ==== ======= ===== ====================

    :Missing Attribute Values: None
    :Class Distribution: 33.3% for each of 3 classes.
    :Creator: R.A. Fisher
    :Donor: Michael Marshall (MARSHALL%PLU@io.arc.nasa.gov)
    :Date: July, 1988

The famous Iris database, first used by Sir R.A. Fisher. The dataset is taken
from Fisher's paper. Note that it's the same as in R, but not as in the UCI
Machine Learning Repository, which has two wrong data points.

This is perhaps the best known database to be found in the
pattern recognition literature.  Fisher's paper is a classic in the field and
is referenced frequently to this day.  (See Duda & Hart, for example.)  The
data set contains 3 classes of 50 instances each, where each class refers to a
type of iris plant.  One class is linearly separable from the other 2; the
latter are NOT linearly separable from each other.

.. topic:: References

   - Fisher, R.A. "The use of multiple measurements in taxonomic problems"
     Annual Eugenics, 7, Part II, 179-188 (1936); also in "Contributions to
     Mathematical Statistics" (John Wiley, NY, 1950).
   - Duda, R.O., & Hart, P.E. (1973) Pattern Classification and Scene Analysis.
     (Q327.D83) John Wiley & Sons.  ISBN 0-471-22361-1.  See page 218.
   - Dasarathy, B.V. (1980) "Nosing Around the Neighborhood: A New System
     Structure and Classification Rule for Recognition in Partially Exposed
     Environments".  IEEE Transactions on Pattern Analysis and Machine
     Intelligence, Vol. PAMI-2, No. 1, 67-71.
   - Gates, G.W. (1972) "The Reduced Nearest Neighbor Rule".  IEEE Transactions
     on Information Theory, May 1972, 431-433.
   - See also: 1988 MLC Proceedings, 54-64.  Cheeseman et al"s AUTOCLASS II
     conceptual clustering system finds 3 classes in the data.
   - Many, many more ...
<class 'sklearn.utils.Bunch'>
data.target_names
array(['setosa', 'versicolor', 'virginica'], dtype='<U10')
data.feature_names
['sepal length (cm)',
 'sepal width (cm)',
 'petal length (cm)',
 'petal width (cm)']
species = []
for i in data.target:
    if i==0:
        species.append('Iris-setosa')
    elif i==1:
        species.append('Iris-versicolor')
    else:
        species.append('Iris-virginica')
data.data.shape
(150, 4)
data.feature_names
['sepal length (cm)',
 'sepal width (cm)',
 'petal length (cm)',
 'petal width (cm)']
data_spes = data.target_names
data = pd.DataFrame(data.data, columns=data.feature_names)
data['species'] = species
data.head()
sepal length (cm)sepal width (cm)petal length (cm)petal width (cm)species
05.13.51.40.2Iris-setosa
14.93.01.40.2Iris-setosa
24.73.21.30.2Iris-setosa
34.63.11.50.2Iris-setosa
45.03.61.40.2Iris-setosa
# 写入csv文件
data.to_csv('./task.csv', sep=',', header=True, index=False)
data1 = data.values
data1 = data1[:,:4].astype(np.float)
print(data1)
[[5.1 3.5 1.4 0.2]
 [4.9 3.  1.4 0.2]
 [4.7 3.2 1.3 0.2]
 [4.6 3.1 1.5 0.2]
 [5.  3.6 1.4 0.2]
 [5.4 3.9 1.7 0.4]
 [4.6 3.4 1.4 0.3]
 [5.  3.4 1.5 0.2]
 [4.4 2.9 1.4 0.2]
 [4.9 3.1 1.5 0.1]
 [5.4 3.7 1.5 0.2]
 [4.8 3.4 1.6 0.2]
 [4.8 3.  1.4 0.1]
 [4.3 3.  1.1 0.1]
 [5.8 4.  1.2 0.2]
 [5.7 4.4 1.5 0.4]
 [5.4 3.9 1.3 0.4]
 [5.1 3.5 1.4 0.3]
 [5.7 3.8 1.7 0.3]
 [5.1 3.8 1.5 0.3]
 [5.4 3.4 1.7 0.2]
 [5.1 3.7 1.5 0.4]
 [4.6 3.6 1.  0.2]
 [5.1 3.3 1.7 0.5]
 [4.8 3.4 1.9 0.2]
 [5.  3.  1.6 0.2]
 [5.  3.4 1.6 0.4]
 [5.2 3.5 1.5 0.2]
 [5.2 3.4 1.4 0.2]
 [4.7 3.2 1.6 0.2]
 [4.8 3.1 1.6 0.2]
 [5.4 3.4 1.5 0.4]
 [5.2 4.1 1.5 0.1]
 [5.5 4.2 1.4 0.2]
 [4.9 3.1 1.5 0.2]
 [5.  3.2 1.2 0.2]
 [5.5 3.5 1.3 0.2]
 [4.9 3.6 1.4 0.1]
 [4.4 3.  1.3 0.2]
 [5.1 3.4 1.5 0.2]
 [5.  3.5 1.3 0.3]
 [4.5 2.3 1.3 0.3]
 [4.4 3.2 1.3 0.2]
 [5.  3.5 1.6 0.6]
 [5.1 3.8 1.9 0.4]
 [4.8 3.  1.4 0.3]
 [5.1 3.8 1.6 0.2]
 [4.6 3.2 1.4 0.2]
 [5.3 3.7 1.5 0.2]
 [5.  3.3 1.4 0.2]
 [7.  3.2 4.7 1.4]
 [6.4 3.2 4.5 1.5]
 [6.9 3.1 4.9 1.5]
 [5.5 2.3 4.  1.3]
 [6.5 2.8 4.6 1.5]
 [5.7 2.8 4.5 1.3]
 [6.3 3.3 4.7 1.6]
 [4.9 2.4 3.3 1. ]
 [6.6 2.9 4.6 1.3]
 [5.2 2.7 3.9 1.4]
 [5.  2.  3.5 1. ]
 [5.9 3.  4.2 1.5]
 [6.  2.2 4.  1. ]
 [6.1 2.9 4.7 1.4]
 [5.6 2.9 3.6 1.3]
 [6.7 3.1 4.4 1.4]
 [5.6 3.  4.5 1.5]
 [5.8 2.7 4.1 1. ]
 [6.2 2.2 4.5 1.5]
 [5.6 2.5 3.9 1.1]
 [5.9 3.2 4.8 1.8]
 [6.1 2.8 4.  1.3]
 [6.3 2.5 4.9 1.5]
 [6.1 2.8 4.7 1.2]
 [6.4 2.9 4.3 1.3]
 [6.6 3.  4.4 1.4]
 [6.8 2.8 4.8 1.4]
 [6.7 3.  5.  1.7]
 [6.  2.9 4.5 1.5]
 [5.7 2.6 3.5 1. ]
 [5.5 2.4 3.8 1.1]
 [5.5 2.4 3.7 1. ]
 [5.8 2.7 3.9 1.2]
 [6.  2.7 5.1 1.6]
 [5.4 3.  4.5 1.5]
 [6.  3.4 4.5 1.6]
 [6.7 3.1 4.7 1.5]
 [6.3 2.3 4.4 1.3]
 [5.6 3.  4.1 1.3]
 [5.5 2.5 4.  1.3]
 [5.5 2.6 4.4 1.2]
 [6.1 3.  4.6 1.4]
 [5.8 2.6 4.  1.2]
 [5.  2.3 3.3 1. ]
 [5.6 2.7 4.2 1.3]
 [5.7 3.  4.2 1.2]
 [5.7 2.9 4.2 1.3]
 [6.2 2.9 4.3 1.3]
 [5.1 2.5 3.  1.1]
 [5.7 2.8 4.1 1.3]
 [6.3 3.3 6.  2.5]
 [5.8 2.7 5.1 1.9]
 [7.1 3.  5.9 2.1]
 [6.3 2.9 5.6 1.8]
 [6.5 3.  5.8 2.2]
 [7.6 3.  6.6 2.1]
 [4.9 2.5 4.5 1.7]
 [7.3 2.9 6.3 1.8]
 [6.7 2.5 5.8 1.8]
 [7.2 3.6 6.1 2.5]
 [6.5 3.2 5.1 2. ]
 [6.4 2.7 5.3 1.9]
 [6.8 3.  5.5 2.1]
 [5.7 2.5 5.  2. ]
 [5.8 2.8 5.1 2.4]
 [6.4 3.2 5.3 2.3]
 [6.5 3.  5.5 1.8]
 [7.7 3.8 6.7 2.2]
 [7.7 2.6 6.9 2.3]
 [6.  2.2 5.  1.5]
 [6.9 3.2 5.7 2.3]
 [5.6 2.8 4.9 2. ]
 [7.7 2.8 6.7 2. ]
 [6.3 2.7 4.9 1.8]
 [6.7 3.3 5.7 2.1]
 [7.2 3.2 6.  1.8]
 [6.2 2.8 4.8 1.8]
 [6.1 3.  4.9 1.8]
 [6.4 2.8 5.6 2.1]
 [7.2 3.  5.8 1.6]
 [7.4 2.8 6.1 1.9]
 [7.9 3.8 6.4 2. ]
 [6.4 2.8 5.6 2.2]
 [6.3 2.8 5.1 1.5]
 [6.1 2.6 5.6 1.4]
 [7.7 3.  6.1 2.3]
 [6.3 3.4 5.6 2.4]
 [6.4 3.1 5.5 1.8]
 [6.  3.  4.8 1.8]
 [6.9 3.1 5.4 2.1]
 [6.7 3.1 5.6 2.4]
 [6.9 3.1 5.1 2.3]
 [5.8 2.7 5.1 1.9]
 [6.8 3.2 5.9 2.3]
 [6.7 3.3 5.7 2.5]
 [6.7 3.  5.2 2.3]
 [6.3 2.5 5.  1.9]
 [6.5 3.  5.2 2. ]
 [6.2 3.4 5.4 2.3]
 [5.9 3.  5.1 1.8]]
# 2 计算统计信息
print("均值: ", data1[:, 0].mean(), end="   ")
print(data['sepal length (cm)'].mean())
print("中位数: ", np.median(data1[:, 0]), end="   ")
print(data['sepal length (cm)'].median())
# 总体标准差, ddof=1是样本方差n-1
print("标准差: ", data1[:, 0].std(), end="   ")
print(data['sepal length (cm)'].std(ddof=0))
均值:  5.843333333333334   5.843333333333335
中位数:  5.8   5.8
标准差:  0.8253012917851409   0.8253012917851409
data1[:,0]
array([5.1, 4.9, 4.7, 4.6, 5. , 5.4, 4.6, 5. , 4.4, 4.9, 5.4, 4.8, 4.8,
       4.3, 5.8, 5.7, 5.4, 5.1, 5.7, 5.1, 5.4, 5.1, 4.6, 5.1, 4.8, 5. ,
       5. , 5.2, 5.2, 4.7, 4.8, 5.4, 5.2, 5.5, 4.9, 5. , 5.5, 4.9, 4.4,
       5.1, 5. , 4.5, 4.4, 5. , 5.1, 4.8, 5.1, 4.6, 5.3, 5. , 7. , 6.4,
       6.9, 5.5, 6.5, 5.7, 6.3, 4.9, 6.6, 5.2, 5. , 5.9, 6. , 6.1, 5.6,
       6.7, 5.6, 5.8, 6.2, 5.6, 5.9, 6.1, 6.3, 6.1, 6.4, 6.6, 6.8, 6.7,
       6. , 5.7, 5.5, 5.5, 5.8, 6. , 5.4, 6. , 6.7, 6.3, 5.6, 5.5, 5.5,
       6.1, 5.8, 5. , 5.6, 5.7, 5.7, 6.2, 5.1, 5.7, 6.3, 5.8, 7.1, 6.3,
       6.5, 7.6, 4.9, 7.3, 6.7, 7.2, 6.5, 6.4, 6.8, 5.7, 5.8, 6.4, 6.5,
       7.7, 7.7, 6. , 6.9, 5.6, 7.7, 6.3, 6.7, 7.2, 6.2, 6.1, 6.4, 7.2,
       7.4, 7.9, 6.4, 6.3, 6.1, 7.7, 6.3, 6.4, 6. , 6.9, 6.7, 6.9, 5.8,
       6.8, 6.7, 6.7, 6.3, 6.5, 6.2, 5.9])
# 归一化
_range = np.max(data1[:, 0])-np.min(data1[:, 0])
data1[:, 0] = (data1[:, 0]-data1[:,0].min())/_range
data1[:, 0]
array([0.22222222, 0.16666667, 0.11111111, 0.08333333, 0.19444444,
       0.30555556, 0.08333333, 0.19444444, 0.02777778, 0.16666667,
       0.30555556, 0.13888889, 0.13888889, 0.        , 0.41666667,
       0.38888889, 0.30555556, 0.22222222, 0.38888889, 0.22222222,
       0.30555556, 0.22222222, 0.08333333, 0.22222222, 0.13888889,
       0.19444444, 0.19444444, 0.25      , 0.25      , 0.11111111,
       0.13888889, 0.30555556, 0.25      , 0.33333333, 0.16666667,
       0.19444444, 0.33333333, 0.16666667, 0.02777778, 0.22222222,
       0.19444444, 0.05555556, 0.02777778, 0.19444444, 0.22222222,
       0.13888889, 0.22222222, 0.08333333, 0.27777778, 0.19444444,
       0.75      , 0.58333333, 0.72222222, 0.33333333, 0.61111111,
       0.38888889, 0.55555556, 0.16666667, 0.63888889, 0.25      ,
       0.19444444, 0.44444444, 0.47222222, 0.5       , 0.36111111,
       0.66666667, 0.36111111, 0.41666667, 0.52777778, 0.36111111,
       0.44444444, 0.5       , 0.55555556, 0.5       , 0.58333333,
       0.63888889, 0.69444444, 0.66666667, 0.47222222, 0.38888889,
       0.33333333, 0.33333333, 0.41666667, 0.47222222, 0.30555556,
       0.47222222, 0.66666667, 0.55555556, 0.36111111, 0.33333333,
       0.33333333, 0.5       , 0.41666667, 0.19444444, 0.36111111,
       0.38888889, 0.38888889, 0.52777778, 0.22222222, 0.38888889,
       0.55555556, 0.41666667, 0.77777778, 0.55555556, 0.61111111,
       0.91666667, 0.16666667, 0.83333333, 0.66666667, 0.80555556,
       0.61111111, 0.58333333, 0.69444444, 0.38888889, 0.41666667,
       0.58333333, 0.61111111, 0.94444444, 0.94444444, 0.47222222,
       0.72222222, 0.36111111, 0.94444444, 0.55555556, 0.66666667,
       0.80555556, 0.52777778, 0.5       , 0.58333333, 0.80555556,
       0.86111111, 1.        , 0.58333333, 0.55555556, 0.5       ,
       0.94444444, 0.55555556, 0.58333333, 0.47222222, 0.72222222,
       0.66666667, 0.72222222, 0.41666667, 0.69444444, 0.66666667,
       0.66666667, 0.55555556, 0.61111111, 0.52777778, 0.44444444])
# 4第5和95百分位数
five = np.percentile(data1[:,0], 5)
ninety_five = np.percentile(data1[:,0], 95)
print(five)
print(ninety_five)
0.08333333333333327
0.8208333333333329
# 5 将20个随机位置修改为np.nan
np.random.seed(20201201)
rows = np.random.randint(0, 150, 20)
cols = np.random.randint(0, 4, 20)
print(rows)
print(cols)
[ 53 107  74 131  40 144  17  27 106 101 101  95  49 104  23 144  87   3
  51  27]
[2 3 1 3 3 3 2 2 3 0 2 0 2 1 3 0 3 1 2 1]
for i in range(20):
    data1[rows[i], cols[i]] = np.nan
print(data1[53, 1])
2.3
# 缺失值的个数
np.sum(np.isnan(data1[:,0]))
3
# 缺失值的位置
np.where(np.isnan(data1[:,0]))
(array([ 95, 101, 144], dtype=int64),)
# 根据两个条件筛选nunpy
data1[(data1[:,0]<5.0) & (data1[:,2]>1.5)]
<ipython-input-27-e7dcbf429571>:2: RuntimeWarning: invalid value encountered in less
  data1[(data1[:,0]<5.0) & (data1[:,2]>1.5)]
<ipython-input-27-e7dcbf429571>:2: RuntimeWarning: invalid value encountered in greater
  data1[(data1[:,0]<5.0) & (data1[:,2]>1.5)]





array([[0.30555556, 3.9       , 1.7       , 0.4       ],
       [0.13888889, 3.4       , 1.6       , 0.2       ],
       [0.38888889, 3.8       , 1.7       , 0.3       ],
       [0.30555556, 3.4       , 1.7       , 0.2       ],
       [0.22222222, 3.3       , 1.7       ,        nan],
       [0.13888889, 3.4       , 1.9       , 0.2       ],
       [0.19444444, 3.        , 1.6       , 0.2       ],
       [0.19444444, 3.4       , 1.6       , 0.4       ],
       [0.11111111, 3.2       , 1.6       , 0.2       ],
       [0.13888889, 3.1       , 1.6       , 0.2       ],
       [0.19444444, 3.5       , 1.6       , 0.6       ],
       [0.22222222, 3.8       , 1.9       , 0.4       ],
       [0.22222222, 3.8       , 1.6       , 0.2       ],
       [0.75      , 3.2       , 4.7       , 1.4       ],
       [0.72222222, 3.1       , 4.9       , 1.5       ],
       [0.61111111, 2.8       , 4.6       , 1.5       ],
       [0.38888889, 2.8       , 4.5       , 1.3       ],
       [0.55555556, 3.3       , 4.7       , 1.6       ],
       [0.16666667, 2.4       , 3.3       , 1.        ],
       [0.63888889, 2.9       , 4.6       , 1.3       ],
       [0.25      , 2.7       , 3.9       , 1.4       ],
       [0.19444444, 2.        , 3.5       , 1.        ],
       [0.44444444, 3.        , 4.2       , 1.5       ],
       [0.47222222, 2.2       , 4.        , 1.        ],
       [0.5       , 2.9       , 4.7       , 1.4       ],
       [0.36111111, 2.9       , 3.6       , 1.3       ],
       [0.66666667, 3.1       , 4.4       , 1.4       ],
       [0.36111111, 3.        , 4.5       , 1.5       ],
       [0.41666667, 2.7       , 4.1       , 1.        ],
       [0.52777778, 2.2       , 4.5       , 1.5       ],
       [0.36111111, 2.5       , 3.9       , 1.1       ],
       [0.44444444, 3.2       , 4.8       , 1.8       ],
       [0.5       , 2.8       , 4.        , 1.3       ],
       [0.55555556, 2.5       , 4.9       , 1.5       ],
       [0.5       , 2.8       , 4.7       , 1.2       ],
       [0.58333333,        nan, 4.3       , 1.3       ],
       [0.63888889, 3.        , 4.4       , 1.4       ],
       [0.69444444, 2.8       , 4.8       , 1.4       ],
       [0.66666667, 3.        , 5.        , 1.7       ],
       [0.47222222, 2.9       , 4.5       , 1.5       ],
       [0.38888889, 2.6       , 3.5       , 1.        ],
       [0.33333333, 2.4       , 3.8       , 1.1       ],
       [0.33333333, 2.4       , 3.7       , 1.        ],
       [0.41666667, 2.7       , 3.9       , 1.2       ],
       [0.47222222, 2.7       , 5.1       , 1.6       ],
       [0.30555556, 3.        , 4.5       , 1.5       ],
       [0.47222222, 3.4       , 4.5       , 1.6       ],
       [0.66666667, 3.1       , 4.7       , 1.5       ],
       [0.55555556, 2.3       , 4.4       ,        nan],
       [0.36111111, 3.        , 4.1       , 1.3       ],
       [0.33333333, 2.5       , 4.        , 1.3       ],
       [0.33333333, 2.6       , 4.4       , 1.2       ],
       [0.5       , 3.        , 4.6       , 1.4       ],
       [0.41666667, 2.6       , 4.        , 1.2       ],
       [0.19444444, 2.3       , 3.3       , 1.        ],
       [0.36111111, 2.7       , 4.2       , 1.3       ],
       [0.38888889, 2.9       , 4.2       , 1.3       ],
       [0.52777778, 2.9       , 4.3       , 1.3       ],
       [0.22222222, 2.5       , 3.        , 1.1       ],
       [0.38888889, 2.8       , 4.1       , 1.3       ],
       [0.55555556, 3.3       , 6.        , 2.5       ],
       [0.77777778, 3.        , 5.9       , 2.1       ],
       [0.55555556, 2.9       , 5.6       , 1.8       ],
       [0.61111111,        nan, 5.8       , 2.2       ],
       [0.91666667, 3.        , 6.6       , 2.1       ],
       [0.16666667, 2.5       , 4.5       ,        nan],
       [0.83333333, 2.9       , 6.3       ,        nan],
       [0.66666667, 2.5       , 5.8       , 1.8       ],
       [0.80555556, 3.6       , 6.1       , 2.5       ],
       [0.61111111, 3.2       , 5.1       , 2.        ],
       [0.58333333, 2.7       , 5.3       , 1.9       ],
       [0.69444444, 3.        , 5.5       , 2.1       ],
       [0.38888889, 2.5       , 5.        , 2.        ],
       [0.41666667, 2.8       , 5.1       , 2.4       ],
       [0.58333333, 3.2       , 5.3       , 2.3       ],
       [0.61111111, 3.        , 5.5       , 1.8       ],
       [0.94444444, 3.8       , 6.7       , 2.2       ],
       [0.94444444, 2.6       , 6.9       , 2.3       ],
       [0.47222222, 2.2       , 5.        , 1.5       ],
       [0.72222222, 3.2       , 5.7       , 2.3       ],
       [0.36111111, 2.8       , 4.9       , 2.        ],
       [0.94444444, 2.8       , 6.7       , 2.        ],
       [0.55555556, 2.7       , 4.9       , 1.8       ],
       [0.66666667, 3.3       , 5.7       , 2.1       ],
       [0.80555556, 3.2       , 6.        , 1.8       ],
       [0.52777778, 2.8       , 4.8       , 1.8       ],
       [0.5       , 3.        , 4.9       , 1.8       ],
       [0.58333333, 2.8       , 5.6       , 2.1       ],
       [0.80555556, 3.        , 5.8       , 1.6       ],
       [0.86111111, 2.8       , 6.1       , 1.9       ],
       [1.        , 3.8       , 6.4       ,        nan],
       [0.58333333, 2.8       , 5.6       , 2.2       ],
       [0.55555556, 2.8       , 5.1       , 1.5       ],
       [0.5       , 2.6       , 5.6       , 1.4       ],
       [0.94444444, 3.        , 6.1       , 2.3       ],
       [0.55555556, 3.4       , 5.6       , 2.4       ],
       [0.58333333, 3.1       , 5.5       , 1.8       ],
       [0.47222222, 3.        , 4.8       , 1.8       ],
       [0.72222222, 3.1       , 5.4       , 2.1       ],
       [0.66666667, 3.1       , 5.6       , 2.4       ],
       [0.72222222, 3.1       , 5.1       , 2.3       ],
       [0.41666667, 2.7       , 5.1       , 1.9       ],
       [0.69444444, 3.2       , 5.9       , 2.3       ],
       [0.66666667, 3.        , 5.2       , 2.3       ],
       [0.55555556, 2.5       , 5.        , 1.9       ],
       [0.61111111, 3.        , 5.2       , 2.        ],
       [0.52777778, 3.4       , 5.4       , 2.3       ],
       [0.44444444, 3.        , 5.1       , 1.8       ]])
# 8不含nan的行
data2 = data1[np.sum(np.isnan(data1), axis=1)<1,:]
print(data2)
[[0.22222222 3.5        1.4        0.2       ]
 [0.16666667 3.         1.4        0.2       ]
 [0.11111111 3.2        1.3        0.2       ]
 [0.19444444 3.6        1.4        0.2       ]
 [0.30555556 3.9        1.7        0.4       ]
 [0.08333333 3.4        1.4        0.3       ]
 [0.19444444 3.4        1.5        0.2       ]
 [0.02777778 2.9        1.4        0.2       ]
 [0.16666667 3.1        1.5        0.1       ]
 [0.30555556 3.7        1.5        0.2       ]
 [0.13888889 3.4        1.6        0.2       ]
 [0.13888889 3.         1.4        0.1       ]
 [0.         3.         1.1        0.1       ]
 [0.41666667 4.         1.2        0.2       ]
 [0.38888889 4.4        1.5        0.4       ]
 [0.30555556 3.9        1.3        0.4       ]
 [0.38888889 3.8        1.7        0.3       ]
 [0.22222222 3.8        1.5        0.3       ]
 [0.30555556 3.4        1.7        0.2       ]
 [0.22222222 3.7        1.5        0.4       ]
 [0.08333333 3.6        1.         0.2       ]
 [0.13888889 3.4        1.9        0.2       ]
 [0.19444444 3.         1.6        0.2       ]
 [0.19444444 3.4        1.6        0.4       ]
 [0.25       3.4        1.4        0.2       ]
 [0.11111111 3.2        1.6        0.2       ]
 [0.13888889 3.1        1.6        0.2       ]
 [0.30555556 3.4        1.5        0.4       ]
 [0.25       4.1        1.5        0.1       ]
 [0.33333333 4.2        1.4        0.2       ]
 [0.16666667 3.1        1.5        0.2       ]
 [0.19444444 3.2        1.2        0.2       ]
 [0.33333333 3.5        1.3        0.2       ]
 [0.16666667 3.6        1.4        0.1       ]
 [0.02777778 3.         1.3        0.2       ]
 [0.22222222 3.4        1.5        0.2       ]
 [0.05555556 2.3        1.3        0.3       ]
 [0.02777778 3.2        1.3        0.2       ]
 [0.19444444 3.5        1.6        0.6       ]
 [0.22222222 3.8        1.9        0.4       ]
 [0.13888889 3.         1.4        0.3       ]
 [0.22222222 3.8        1.6        0.2       ]
 [0.08333333 3.2        1.4        0.2       ]
 [0.27777778 3.7        1.5        0.2       ]
 [0.75       3.2        4.7        1.4       ]
 [0.72222222 3.1        4.9        1.5       ]
 [0.61111111 2.8        4.6        1.5       ]
 [0.38888889 2.8        4.5        1.3       ]
 [0.55555556 3.3        4.7        1.6       ]
 [0.16666667 2.4        3.3        1.        ]
 [0.63888889 2.9        4.6        1.3       ]
 [0.25       2.7        3.9        1.4       ]
 [0.19444444 2.         3.5        1.        ]
 [0.44444444 3.         4.2        1.5       ]
 [0.47222222 2.2        4.         1.        ]
 [0.5        2.9        4.7        1.4       ]
 [0.36111111 2.9        3.6        1.3       ]
 [0.66666667 3.1        4.4        1.4       ]
 [0.36111111 3.         4.5        1.5       ]
 [0.41666667 2.7        4.1        1.        ]
 [0.52777778 2.2        4.5        1.5       ]
 [0.36111111 2.5        3.9        1.1       ]
 [0.44444444 3.2        4.8        1.8       ]
 [0.5        2.8        4.         1.3       ]
 [0.55555556 2.5        4.9        1.5       ]
 [0.5        2.8        4.7        1.2       ]
 [0.63888889 3.         4.4        1.4       ]
 [0.69444444 2.8        4.8        1.4       ]
 [0.66666667 3.         5.         1.7       ]
 [0.47222222 2.9        4.5        1.5       ]
 [0.38888889 2.6        3.5        1.        ]
 [0.33333333 2.4        3.8        1.1       ]
 [0.33333333 2.4        3.7        1.        ]
 [0.41666667 2.7        3.9        1.2       ]
 [0.47222222 2.7        5.1        1.6       ]
 [0.30555556 3.         4.5        1.5       ]
 [0.47222222 3.4        4.5        1.6       ]
 [0.66666667 3.1        4.7        1.5       ]
 [0.36111111 3.         4.1        1.3       ]
 [0.33333333 2.5        4.         1.3       ]
 [0.33333333 2.6        4.4        1.2       ]
 [0.5        3.         4.6        1.4       ]
 [0.41666667 2.6        4.         1.2       ]
 [0.19444444 2.3        3.3        1.        ]
 [0.36111111 2.7        4.2        1.3       ]
 [0.38888889 2.9        4.2        1.3       ]
 [0.52777778 2.9        4.3        1.3       ]
 [0.22222222 2.5        3.         1.1       ]
 [0.38888889 2.8        4.1        1.3       ]
 [0.55555556 3.3        6.         2.5       ]
 [0.77777778 3.         5.9        2.1       ]
 [0.55555556 2.9        5.6        1.8       ]
 [0.91666667 3.         6.6        2.1       ]
 [0.66666667 2.5        5.8        1.8       ]
 [0.80555556 3.6        6.1        2.5       ]
 [0.61111111 3.2        5.1        2.        ]
 [0.58333333 2.7        5.3        1.9       ]
 [0.69444444 3.         5.5        2.1       ]
 [0.38888889 2.5        5.         2.        ]
 [0.41666667 2.8        5.1        2.4       ]
 [0.58333333 3.2        5.3        2.3       ]
 [0.61111111 3.         5.5        1.8       ]
 [0.94444444 3.8        6.7        2.2       ]
 [0.94444444 2.6        6.9        2.3       ]
 [0.47222222 2.2        5.         1.5       ]
 [0.72222222 3.2        5.7        2.3       ]
 [0.36111111 2.8        4.9        2.        ]
 [0.94444444 2.8        6.7        2.        ]
 [0.55555556 2.7        4.9        1.8       ]
 [0.66666667 3.3        5.7        2.1       ]
 [0.80555556 3.2        6.         1.8       ]
 [0.52777778 2.8        4.8        1.8       ]
 [0.5        3.         4.9        1.8       ]
 [0.58333333 2.8        5.6        2.1       ]
 [0.80555556 3.         5.8        1.6       ]
 [0.86111111 2.8        6.1        1.9       ]
 [0.58333333 2.8        5.6        2.2       ]
 [0.55555556 2.8        5.1        1.5       ]
 [0.5        2.6        5.6        1.4       ]
 [0.94444444 3.         6.1        2.3       ]
 [0.55555556 3.4        5.6        2.4       ]
 [0.58333333 3.1        5.5        1.8       ]
 [0.47222222 3.         4.8        1.8       ]
 [0.72222222 3.1        5.4        2.1       ]
 [0.66666667 3.1        5.6        2.4       ]
 [0.72222222 3.1        5.1        2.3       ]
 [0.41666667 2.7        5.1        1.9       ]
 [0.69444444 3.2        5.9        2.3       ]
 [0.66666667 3.         5.2        2.3       ]
 [0.55555556 2.5        5.         1.9       ]
 [0.61111111 3.         5.2        2.        ]
 [0.52777778 3.4        5.4        2.3       ]
 [0.44444444 3.         5.1        1.8       ]]
# 9 计算两列的相关系数
np.corrcoef(data2[:,0], data2[:, 2])
array([[1.        , 0.87522764],
       [0.87522764, 1.        ]])
# 10 找出缺失值
np.where(np.isnan(data1))
(array([  3,  17,  23,  27,  27,  40,  49,  51,  53,  74,  87,  95, 101,
        101, 104, 106, 107, 131, 144, 144], dtype=int64),
 array([1, 2, 3, 1, 2, 3, 2, 2, 2, 1, 3, 0, 0, 2, 1, 3, 3, 3, 0, 3],
       dtype=int64))
# 11 将缺失值填充为0
data1[np.where(np.isnan(data1))] = 0
np.where(np.isnan(data1))
(array([], dtype=int64), array([], dtype=int64))
np.unique(data['species'])
array(['Iris-setosa', 'Iris-versicolor', 'Iris-virginica'], dtype=object)
# 每个样本种类的数量,唯一值计数
for i in np.unique(data['species']):
    print(i, ":  ", end='')
    print(np.sum(data['species']==i))

Iris-setosa :  50
Iris-versicolor :  50
Iris-virginica :  50
# 将第三列以分类变量形式
data3 = data1[:, 2].copy()
data_length_bins = np.digitize(data3.astype('float'), [0,3,5,10])
label_map = {1: 'small', 2: 'medium', 3: 'large'}
data_3 = [label_map[x] for x in data_length_bins]
print(data_3)
['small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'small', 'medium', 'small', 'medium', 'small', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'large', 'medium', 'medium', 'medium', 'medium', 'medium', 'large', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'medium', 'large', 'small', 'large', 'large', 'large', 'large', 'medium', 'large', 'large', 'large', 'large', 'large', 'large', 'large', 'large', 'large', 'large', 'large', 'large', 'large', 'large', 'medium', 'large', 'medium', 'large', 'large', 'medium', 'medium', 'large', 'large', 'large', 'large', 'large', 'large', 'large', 'large', 'large', 'large', 'medium', 'large', 'large', 'large', 'large', 'large', 'large', 'large', 'large', 'large', 'large', 'large']
data4 = np.pi*data1[:, 0]*(data1[:, 2]**2)/3
print(data4)
[ 0.45611271  0.34208453  0.19664043  0.19634954  0.39909862  0.92473362
  0.17104227  0.45814893  0.05701409  0.39269908  0.71994832  0.37233691
  0.28507044  0.          0.62831853  0.91629786  0.54076118  0.
  1.17693369  0.52359878  0.92473362  0.52359878  0.08726646  0.67253354
  0.52505322  0.52127167  0.52127167  0.          0.5131268   0.29786953
  0.37233691  0.71994832  0.58904862  0.68416907  0.39269908  0.29321531
  0.58992129  0.34208453  0.04916011  0.52359878  0.34412075  0.09832021
  0.04916011  0.52127167  0.84008515  0.28507044  0.59573905  0.17104227
  0.65449847  0.         17.34944543  0.         18.15898731  0.
 13.54142789  8.24668072 12.85144106  1.90066356 14.15694734  3.98196869
  2.49436639  8.2100288   7.91215928 11.56629695  4.90088454 13.51582973
  7.65763209  7.33474618 11.19192383  5.75173255 10.72330292  8.37758041
 13.96845178 11.56629695 11.29489825 12.95267016 16.75516082 17.45329252
 10.01382658  4.98873278  5.04051088  4.77871149  6.63661448 12.86220392
  6.47953485 10.01382658 15.42172927 11.26319144  6.35678002  5.58505361
  6.75791486 11.07935009  6.98131701  2.21744081  6.6706484   0.
  7.1837752  10.21919366  2.0943951   6.8457631  20.94395102  0.
 28.35229192 18.24450845 21.52805455 41.81459822  3.53429174 34.63605901
 23.48515041 31.38945571 16.64520508 17.15920454 21.99842078 10.1810873
 11.34900346 17.15920454 19.35861029 44.39710374 47.08723789 12.36274887
 24.57249054  9.07949366 44.39710374 13.96845178 22.68229896 30.36872898
 12.73392222 12.5716066  19.15673387 28.37789008 33.55424576 42.8932117
 19.15673387 15.13200461 16.4200576  36.80143083 18.24450845 18.47867346
 11.39350936 22.05398043 21.89341014 19.671606   11.34900346 25.31454636
  0.         18.87748119 14.54441043 17.30435776 16.11637031 12.10560369]
# 添加第五列
np.c_[data1, data4]
array([[2.22222222e-01, 3.50000000e+00, 1.40000000e+00, 2.00000000e-01,
        4.56112711e-01],
       [1.66666667e-01, 3.00000000e+00, 1.40000000e+00, 2.00000000e-01,
        3.42084533e-01],
       [1.11111111e-01, 3.20000000e+00, 1.30000000e+00, 2.00000000e-01,
        1.96640429e-01],
       [8.33333333e-02, 0.00000000e+00, 1.50000000e+00, 2.00000000e-01,
        1.96349541e-01],
       [1.94444444e-01, 3.60000000e+00, 1.40000000e+00, 2.00000000e-01,
        3.99098622e-01],
       [3.05555556e-01, 3.90000000e+00, 1.70000000e+00, 4.00000000e-01,
        9.24733615e-01],
       [8.33333333e-02, 3.40000000e+00, 1.40000000e+00, 3.00000000e-01,
        1.71042267e-01],
       [1.94444444e-01, 3.40000000e+00, 1.50000000e+00, 2.00000000e-01,
        4.58148929e-01],
       [2.77777778e-02, 2.90000000e+00, 1.40000000e+00, 2.00000000e-01,
        5.70140889e-02],
       [1.66666667e-01, 3.10000000e+00, 1.50000000e+00, 1.00000000e-01,
        3.92699082e-01],
       [3.05555556e-01, 3.70000000e+00, 1.50000000e+00, 2.00000000e-01,
        7.19948316e-01],
       [1.38888889e-01, 3.40000000e+00, 1.60000000e+00, 2.00000000e-01,
        3.72336907e-01],
       [1.38888889e-01, 3.00000000e+00, 1.40000000e+00, 1.00000000e-01,
        2.85070444e-01],
       [0.00000000e+00, 3.00000000e+00, 1.10000000e+00, 1.00000000e-01,
        0.00000000e+00],
       [4.16666667e-01, 4.00000000e+00, 1.20000000e+00, 2.00000000e-01,
        6.28318531e-01],
       [3.88888889e-01, 4.40000000e+00, 1.50000000e+00, 4.00000000e-01,
        9.16297857e-01],
       [3.05555556e-01, 3.90000000e+00, 1.30000000e+00, 4.00000000e-01,
        5.40761180e-01],
       [2.22222222e-01, 3.50000000e+00, 0.00000000e+00, 3.00000000e-01,
        0.00000000e+00],
       [3.88888889e-01, 3.80000000e+00, 1.70000000e+00, 3.00000000e-01,
        1.17693369e+00],
       [2.22222222e-01, 3.80000000e+00, 1.50000000e+00, 3.00000000e-01,
        5.23598776e-01],
       [3.05555556e-01, 3.40000000e+00, 1.70000000e+00, 2.00000000e-01,
        9.24733615e-01],
       [2.22222222e-01, 3.70000000e+00, 1.50000000e+00, 4.00000000e-01,
        5.23598776e-01],
       [8.33333333e-02, 3.60000000e+00, 1.00000000e+00, 2.00000000e-01,
        8.72664626e-02],
       [2.22222222e-01, 3.30000000e+00, 1.70000000e+00, 0.00000000e+00,
        6.72533538e-01],
       [1.38888889e-01, 3.40000000e+00, 1.90000000e+00, 2.00000000e-01,
        5.25053217e-01],
       [1.94444444e-01, 3.00000000e+00, 1.60000000e+00, 2.00000000e-01,
        5.21271670e-01],
       [1.94444444e-01, 3.40000000e+00, 1.60000000e+00, 4.00000000e-01,
        5.21271670e-01],
       [2.50000000e-01, 0.00000000e+00, 0.00000000e+00, 2.00000000e-01,
        0.00000000e+00],
       [2.50000000e-01, 3.40000000e+00, 1.40000000e+00, 2.00000000e-01,
        5.13126800e-01],
       [1.11111111e-01, 3.20000000e+00, 1.60000000e+00, 2.00000000e-01,
        2.97869526e-01],
       [1.38888889e-01, 3.10000000e+00, 1.60000000e+00, 2.00000000e-01,
        3.72336907e-01],
       [3.05555556e-01, 3.40000000e+00, 1.50000000e+00, 4.00000000e-01,
        7.19948316e-01],
       [2.50000000e-01, 4.10000000e+00, 1.50000000e+00, 1.00000000e-01,
        5.89048623e-01],
       [3.33333333e-01, 4.20000000e+00, 1.40000000e+00, 2.00000000e-01,
        6.84169067e-01],
       [1.66666667e-01, 3.10000000e+00, 1.50000000e+00, 2.00000000e-01,
        3.92699082e-01],
       [1.94444444e-01, 3.20000000e+00, 1.20000000e+00, 2.00000000e-01,
        2.93215314e-01],
       [3.33333333e-01, 3.50000000e+00, 1.30000000e+00, 2.00000000e-01,
        5.89921287e-01],
       [1.66666667e-01, 3.60000000e+00, 1.40000000e+00, 1.00000000e-01,
        3.42084533e-01],
       [2.77777778e-02, 3.00000000e+00, 1.30000000e+00, 2.00000000e-01,
        4.91601073e-02],
       [2.22222222e-01, 3.40000000e+00, 1.50000000e+00, 2.00000000e-01,
        5.23598776e-01],
       [1.94444444e-01, 3.50000000e+00, 1.30000000e+00, 0.00000000e+00,
        3.44120751e-01],
       [5.55555556e-02, 2.30000000e+00, 1.30000000e+00, 3.00000000e-01,
        9.83202145e-02],
       [2.77777778e-02, 3.20000000e+00, 1.30000000e+00, 2.00000000e-01,
        4.91601073e-02],
       [1.94444444e-01, 3.50000000e+00, 1.60000000e+00, 6.00000000e-01,
        5.21271670e-01],
       [2.22222222e-01, 3.80000000e+00, 1.90000000e+00, 4.00000000e-01,
        8.40085147e-01],
       [1.38888889e-01, 3.00000000e+00, 1.40000000e+00, 3.00000000e-01,
        2.85070444e-01],
       [2.22222222e-01, 3.80000000e+00, 1.60000000e+00, 2.00000000e-01,
        5.95739051e-01],
       [8.33333333e-02, 3.20000000e+00, 1.40000000e+00, 2.00000000e-01,
        1.71042267e-01],
       [2.77777778e-01, 3.70000000e+00, 1.50000000e+00, 2.00000000e-01,
        6.54498469e-01],
       [1.94444444e-01, 3.30000000e+00, 0.00000000e+00, 2.00000000e-01,
        0.00000000e+00],
       [7.50000000e-01, 3.20000000e+00, 4.70000000e+00, 1.40000000e+00,
        1.73494454e+01],
       [5.83333333e-01, 3.20000000e+00, 0.00000000e+00, 1.50000000e+00,
        0.00000000e+00],
       [7.22222222e-01, 3.10000000e+00, 4.90000000e+00, 1.50000000e+00,
        1.81589873e+01],
       [3.33333333e-01, 2.30000000e+00, 0.00000000e+00, 1.30000000e+00,
        0.00000000e+00],
       [6.11111111e-01, 2.80000000e+00, 4.60000000e+00, 1.50000000e+00,
        1.35414279e+01],
       [3.88888889e-01, 2.80000000e+00, 4.50000000e+00, 1.30000000e+00,
        8.24668072e+00],
       [5.55555556e-01, 3.30000000e+00, 4.70000000e+00, 1.60000000e+00,
        1.28514411e+01],
       [1.66666667e-01, 2.40000000e+00, 3.30000000e+00, 1.00000000e+00,
        1.90066356e+00],
       [6.38888889e-01, 2.90000000e+00, 4.60000000e+00, 1.30000000e+00,
        1.41569473e+01],
       [2.50000000e-01, 2.70000000e+00, 3.90000000e+00, 1.40000000e+00,
        3.98196869e+00],
       [1.94444444e-01, 2.00000000e+00, 3.50000000e+00, 1.00000000e+00,
        2.49436639e+00],
       [4.44444444e-01, 3.00000000e+00, 4.20000000e+00, 1.50000000e+00,
        8.21002880e+00],
       [4.72222222e-01, 2.20000000e+00, 4.00000000e+00, 1.00000000e+00,
        7.91215928e+00],
       [5.00000000e-01, 2.90000000e+00, 4.70000000e+00, 1.40000000e+00,
        1.15662970e+01],
       [3.61111111e-01, 2.90000000e+00, 3.60000000e+00, 1.30000000e+00,
        4.90088454e+00],
       [6.66666667e-01, 3.10000000e+00, 4.40000000e+00, 1.40000000e+00,
        1.35158297e+01],
       [3.61111111e-01, 3.00000000e+00, 4.50000000e+00, 1.50000000e+00,
        7.65763209e+00],
       [4.16666667e-01, 2.70000000e+00, 4.10000000e+00, 1.00000000e+00,
        7.33474618e+00],
       [5.27777778e-01, 2.20000000e+00, 4.50000000e+00, 1.50000000e+00,
        1.11919238e+01],
       [3.61111111e-01, 2.50000000e+00, 3.90000000e+00, 1.10000000e+00,
        5.75173255e+00],
       [4.44444444e-01, 3.20000000e+00, 4.80000000e+00, 1.80000000e+00,
        1.07233029e+01],
       [5.00000000e-01, 2.80000000e+00, 4.00000000e+00, 1.30000000e+00,
        8.37758041e+00],
       [5.55555556e-01, 2.50000000e+00, 4.90000000e+00, 1.50000000e+00,
        1.39684518e+01],
       [5.00000000e-01, 2.80000000e+00, 4.70000000e+00, 1.20000000e+00,
        1.15662970e+01],
       [5.83333333e-01, 0.00000000e+00, 4.30000000e+00, 1.30000000e+00,
        1.12948983e+01],
       [6.38888889e-01, 3.00000000e+00, 4.40000000e+00, 1.40000000e+00,
        1.29526702e+01],
       [6.94444444e-01, 2.80000000e+00, 4.80000000e+00, 1.40000000e+00,
        1.67551608e+01],
       [6.66666667e-01, 3.00000000e+00, 5.00000000e+00, 1.70000000e+00,
        1.74532925e+01],
       [4.72222222e-01, 2.90000000e+00, 4.50000000e+00, 1.50000000e+00,
        1.00138266e+01],
       [3.88888889e-01, 2.60000000e+00, 3.50000000e+00, 1.00000000e+00,
        4.98873278e+00],
       [3.33333333e-01, 2.40000000e+00, 3.80000000e+00, 1.10000000e+00,
        5.04051088e+00],
       [3.33333333e-01, 2.40000000e+00, 3.70000000e+00, 1.00000000e+00,
        4.77871149e+00],
       [4.16666667e-01, 2.70000000e+00, 3.90000000e+00, 1.20000000e+00,
        6.63661448e+00],
       [4.72222222e-01, 2.70000000e+00, 5.10000000e+00, 1.60000000e+00,
        1.28622039e+01],
       [3.05555556e-01, 3.00000000e+00, 4.50000000e+00, 1.50000000e+00,
        6.47953485e+00],
       [4.72222222e-01, 3.40000000e+00, 4.50000000e+00, 1.60000000e+00,
        1.00138266e+01],
       [6.66666667e-01, 3.10000000e+00, 4.70000000e+00, 1.50000000e+00,
        1.54217293e+01],
       [5.55555556e-01, 2.30000000e+00, 4.40000000e+00, 0.00000000e+00,
        1.12631914e+01],
       [3.61111111e-01, 3.00000000e+00, 4.10000000e+00, 1.30000000e+00,
        6.35678002e+00],
       [3.33333333e-01, 2.50000000e+00, 4.00000000e+00, 1.30000000e+00,
        5.58505361e+00],
       [3.33333333e-01, 2.60000000e+00, 4.40000000e+00, 1.20000000e+00,
        6.75791486e+00],
       [5.00000000e-01, 3.00000000e+00, 4.60000000e+00, 1.40000000e+00,
        1.10793501e+01],
       [4.16666667e-01, 2.60000000e+00, 4.00000000e+00, 1.20000000e+00,
        6.98131701e+00],
       [1.94444444e-01, 2.30000000e+00, 3.30000000e+00, 1.00000000e+00,
        2.21744081e+00],
       [3.61111111e-01, 2.70000000e+00, 4.20000000e+00, 1.30000000e+00,
        6.67064840e+00],
       [0.00000000e+00, 3.00000000e+00, 4.20000000e+00, 1.20000000e+00,
        0.00000000e+00],
       [3.88888889e-01, 2.90000000e+00, 4.20000000e+00, 1.30000000e+00,
        7.18377520e+00],
       [5.27777778e-01, 2.90000000e+00, 4.30000000e+00, 1.30000000e+00,
        1.02191937e+01],
       [2.22222222e-01, 2.50000000e+00, 3.00000000e+00, 1.10000000e+00,
        2.09439510e+00],
       [3.88888889e-01, 2.80000000e+00, 4.10000000e+00, 1.30000000e+00,
        6.84576310e+00],
       [5.55555556e-01, 3.30000000e+00, 6.00000000e+00, 2.50000000e+00,
        2.09439510e+01],
       [0.00000000e+00, 2.70000000e+00, 0.00000000e+00, 1.90000000e+00,
        0.00000000e+00],
       [7.77777778e-01, 3.00000000e+00, 5.90000000e+00, 2.10000000e+00,
        2.83522919e+01],
       [5.55555556e-01, 2.90000000e+00, 5.60000000e+00, 1.80000000e+00,
        1.82445084e+01],
       [6.11111111e-01, 0.00000000e+00, 5.80000000e+00, 2.20000000e+00,
        2.15280545e+01],
       [9.16666667e-01, 3.00000000e+00, 6.60000000e+00, 2.10000000e+00,
        4.18145982e+01],
       [1.66666667e-01, 2.50000000e+00, 4.50000000e+00, 0.00000000e+00,
        3.53429174e+00],
       [8.33333333e-01, 2.90000000e+00, 6.30000000e+00, 0.00000000e+00,
        3.46360590e+01],
       [6.66666667e-01, 2.50000000e+00, 5.80000000e+00, 1.80000000e+00,
        2.34851504e+01],
       [8.05555556e-01, 3.60000000e+00, 6.10000000e+00, 2.50000000e+00,
        3.13894557e+01],
       [6.11111111e-01, 3.20000000e+00, 5.10000000e+00, 2.00000000e+00,
        1.66452051e+01],
       [5.83333333e-01, 2.70000000e+00, 5.30000000e+00, 1.90000000e+00,
        1.71592045e+01],
       [6.94444444e-01, 3.00000000e+00, 5.50000000e+00, 2.10000000e+00,
        2.19984208e+01],
       [3.88888889e-01, 2.50000000e+00, 5.00000000e+00, 2.00000000e+00,
        1.01810873e+01],
       [4.16666667e-01, 2.80000000e+00, 5.10000000e+00, 2.40000000e+00,
        1.13490035e+01],
       [5.83333333e-01, 3.20000000e+00, 5.30000000e+00, 2.30000000e+00,
        1.71592045e+01],
       [6.11111111e-01, 3.00000000e+00, 5.50000000e+00, 1.80000000e+00,
        1.93586103e+01],
       [9.44444444e-01, 3.80000000e+00, 6.70000000e+00, 2.20000000e+00,
        4.43971037e+01],
       [9.44444444e-01, 2.60000000e+00, 6.90000000e+00, 2.30000000e+00,
        4.70872379e+01],
       [4.72222222e-01, 2.20000000e+00, 5.00000000e+00, 1.50000000e+00,
        1.23627489e+01],
       [7.22222222e-01, 3.20000000e+00, 5.70000000e+00, 2.30000000e+00,
        2.45724905e+01],
       [3.61111111e-01, 2.80000000e+00, 4.90000000e+00, 2.00000000e+00,
        9.07949366e+00],
       [9.44444444e-01, 2.80000000e+00, 6.70000000e+00, 2.00000000e+00,
        4.43971037e+01],
       [5.55555556e-01, 2.70000000e+00, 4.90000000e+00, 1.80000000e+00,
        1.39684518e+01],
       [6.66666667e-01, 3.30000000e+00, 5.70000000e+00, 2.10000000e+00,
        2.26822990e+01],
       [8.05555556e-01, 3.20000000e+00, 6.00000000e+00, 1.80000000e+00,
        3.03687290e+01],
       [5.27777778e-01, 2.80000000e+00, 4.80000000e+00, 1.80000000e+00,
        1.27339222e+01],
       [5.00000000e-01, 3.00000000e+00, 4.90000000e+00, 1.80000000e+00,
        1.25716066e+01],
       [5.83333333e-01, 2.80000000e+00, 5.60000000e+00, 2.10000000e+00,
        1.91567339e+01],
       [8.05555556e-01, 3.00000000e+00, 5.80000000e+00, 1.60000000e+00,
        2.83778901e+01],
       [8.61111111e-01, 2.80000000e+00, 6.10000000e+00, 1.90000000e+00,
        3.35542458e+01],
       [1.00000000e+00, 3.80000000e+00, 6.40000000e+00, 0.00000000e+00,
        4.28932117e+01],
       [5.83333333e-01, 2.80000000e+00, 5.60000000e+00, 2.20000000e+00,
        1.91567339e+01],
       [5.55555556e-01, 2.80000000e+00, 5.10000000e+00, 1.50000000e+00,
        1.51320046e+01],
       [5.00000000e-01, 2.60000000e+00, 5.60000000e+00, 1.40000000e+00,
        1.64200576e+01],
       [9.44444444e-01, 3.00000000e+00, 6.10000000e+00, 2.30000000e+00,
        3.68014308e+01],
       [5.55555556e-01, 3.40000000e+00, 5.60000000e+00, 2.40000000e+00,
        1.82445084e+01],
       [5.83333333e-01, 3.10000000e+00, 5.50000000e+00, 1.80000000e+00,
        1.84786735e+01],
       [4.72222222e-01, 3.00000000e+00, 4.80000000e+00, 1.80000000e+00,
        1.13935094e+01],
       [7.22222222e-01, 3.10000000e+00, 5.40000000e+00, 2.10000000e+00,
        2.20539804e+01],
       [6.66666667e-01, 3.10000000e+00, 5.60000000e+00, 2.40000000e+00,
        2.18934101e+01],
       [7.22222222e-01, 3.10000000e+00, 5.10000000e+00, 2.30000000e+00,
        1.96716060e+01],
       [4.16666667e-01, 2.70000000e+00, 5.10000000e+00, 1.90000000e+00,
        1.13490035e+01],
       [6.94444444e-01, 3.20000000e+00, 5.90000000e+00, 2.30000000e+00,
        2.53145464e+01],
       [0.00000000e+00, 3.30000000e+00, 5.70000000e+00, 0.00000000e+00,
        0.00000000e+00],
       [6.66666667e-01, 3.00000000e+00, 5.20000000e+00, 2.30000000e+00,
        1.88774812e+01],
       [5.55555556e-01, 2.50000000e+00, 5.00000000e+00, 1.90000000e+00,
        1.45444104e+01],
       [6.11111111e-01, 3.00000000e+00, 5.20000000e+00, 2.00000000e+00,
        1.73043578e+01],
       [5.27777778e-01, 3.40000000e+00, 5.40000000e+00, 2.30000000e+00,
        1.61163703e+01],
       [4.44444444e-01, 3.00000000e+00, 5.10000000e+00, 1.80000000e+00,
        1.21056037e+01]])
# 按照第一列排序
data_sort = data1[np.argsort(data1[:,0]), :]
print(data_sort)
[[0.         3.3        5.7        0.        ]
 [0.         2.7        0.         1.9       ]
 [0.         3.         1.1        0.1       ]
 [0.         3.         4.2        1.2       ]
 [0.02777778 3.2        1.3        0.2       ]
 [0.02777778 2.9        1.4        0.2       ]
 [0.02777778 3.         1.3        0.2       ]
 [0.05555556 2.3        1.3        0.3       ]
 [0.08333333 3.6        1.         0.2       ]
 [0.08333333 3.4        1.4        0.3       ]
 [0.08333333 3.2        1.4        0.2       ]
 [0.08333333 0.         1.5        0.2       ]
 [0.11111111 3.2        1.6        0.2       ]
 [0.11111111 3.2        1.3        0.2       ]
 [0.13888889 3.4        1.9        0.2       ]
 [0.13888889 3.1        1.6        0.2       ]
 [0.13888889 3.         1.4        0.3       ]
 [0.13888889 3.4        1.6        0.2       ]
 [0.13888889 3.         1.4        0.1       ]
 [0.16666667 2.5        4.5        0.        ]
 [0.16666667 2.4        3.3        1.        ]
 [0.16666667 3.6        1.4        0.1       ]
 [0.16666667 3.1        1.5        0.2       ]
 [0.16666667 3.1        1.5        0.1       ]
 [0.16666667 3.         1.4        0.2       ]
 [0.19444444 3.5        1.6        0.6       ]
 [0.19444444 3.2        1.2        0.2       ]
 [0.19444444 3.4        1.5        0.2       ]
 [0.19444444 3.         1.6        0.2       ]
 [0.19444444 3.3        0.         0.2       ]
 [0.19444444 2.         3.5        1.        ]
 [0.19444444 3.6        1.4        0.2       ]
 [0.19444444 2.3        3.3        1.        ]
 [0.19444444 3.4        1.6        0.4       ]
 [0.19444444 3.5        1.3        0.        ]
 [0.22222222 3.8        1.9        0.4       ]
 [0.22222222 3.8        1.6        0.2       ]
 [0.22222222 2.5        3.         1.1       ]
 [0.22222222 3.4        1.5        0.2       ]
 [0.22222222 3.5        1.4        0.2       ]
 [0.22222222 3.5        0.         0.3       ]
 [0.22222222 3.3        1.7        0.        ]
 [0.22222222 3.7        1.5        0.4       ]
 [0.22222222 3.8        1.5        0.3       ]
 [0.25       4.1        1.5        0.1       ]
 [0.25       3.4        1.4        0.2       ]
 [0.25       0.         0.         0.2       ]
 [0.25       2.7        3.9        1.4       ]
 [0.27777778 3.7        1.5        0.2       ]
 [0.30555556 3.7        1.5        0.2       ]
 [0.30555556 3.9        1.7        0.4       ]
 [0.30555556 3.9        1.3        0.4       ]
 [0.30555556 3.4        1.5        0.4       ]
 [0.30555556 3.         4.5        1.5       ]
 [0.30555556 3.4        1.7        0.2       ]
 [0.33333333 2.5        4.         1.3       ]
 [0.33333333 2.6        4.4        1.2       ]
 [0.33333333 2.4        3.7        1.        ]
 [0.33333333 3.5        1.3        0.2       ]
 [0.33333333 2.4        3.8        1.1       ]
 [0.33333333 4.2        1.4        0.2       ]
 [0.33333333 2.3        0.         1.3       ]
 [0.36111111 2.7        4.2        1.3       ]
 [0.36111111 3.         4.1        1.3       ]
 [0.36111111 2.8        4.9        2.        ]
 [0.36111111 2.9        3.6        1.3       ]
 [0.36111111 3.         4.5        1.5       ]
 [0.36111111 2.5        3.9        1.1       ]
 [0.38888889 3.8        1.7        0.3       ]
 [0.38888889 2.5        5.         2.        ]
 [0.38888889 2.9        4.2        1.3       ]
 [0.38888889 2.8        4.1        1.3       ]
 [0.38888889 4.4        1.5        0.4       ]
 [0.38888889 2.8        4.5        1.3       ]
 [0.38888889 2.6        3.5        1.        ]
 [0.41666667 2.7        4.1        1.        ]
 [0.41666667 2.8        5.1        2.4       ]
 [0.41666667 2.6        4.         1.2       ]
 [0.41666667 4.         1.2        0.2       ]
 [0.41666667 2.7        5.1        1.9       ]
 [0.41666667 2.7        3.9        1.2       ]
 [0.44444444 3.         5.1        1.8       ]
 [0.44444444 3.         4.2        1.5       ]
 [0.44444444 3.2        4.8        1.8       ]
 [0.47222222 2.7        5.1        1.6       ]
 [0.47222222 2.2        5.         1.5       ]
 [0.47222222 3.4        4.5        1.6       ]
 [0.47222222 2.2        4.         1.        ]
 [0.47222222 3.         4.8        1.8       ]
 [0.47222222 2.9        4.5        1.5       ]
 [0.5        3.         4.9        1.8       ]
 [0.5        2.6        5.6        1.4       ]
 [0.5        2.9        4.7        1.4       ]
 [0.5        2.8        4.7        1.2       ]
 [0.5        3.         4.6        1.4       ]
 [0.5        2.8        4.         1.3       ]
 [0.52777778 2.9        4.3        1.3       ]
 [0.52777778 2.2        4.5        1.5       ]
 [0.52777778 3.4        5.4        2.3       ]
 [0.52777778 2.8        4.8        1.8       ]
 [0.55555556 2.7        4.9        1.8       ]
 [0.55555556 2.8        5.1        1.5       ]
 [0.55555556 2.9        5.6        1.8       ]
 [0.55555556 3.3        4.7        1.6       ]
 [0.55555556 3.3        6.         2.5       ]
 [0.55555556 2.5        4.9        1.5       ]
 [0.55555556 2.3        4.4        0.        ]
 [0.55555556 2.5        5.         1.9       ]
 [0.55555556 3.4        5.6        2.4       ]
 [0.58333333 3.1        5.5        1.8       ]
 [0.58333333 2.8        5.6        2.2       ]
 [0.58333333 2.8        5.6        2.1       ]
 [0.58333333 3.2        0.         1.5       ]
 [0.58333333 3.2        5.3        2.3       ]
 [0.58333333 0.         4.3        1.3       ]
 [0.58333333 2.7        5.3        1.9       ]
 [0.61111111 3.         5.5        1.8       ]
 [0.61111111 3.         5.2        2.        ]
 [0.61111111 3.2        5.1        2.        ]
 [0.61111111 0.         5.8        2.2       ]
 [0.61111111 2.8        4.6        1.5       ]
 [0.63888889 2.9        4.6        1.3       ]
 [0.63888889 3.         4.4        1.4       ]
 [0.66666667 3.         5.2        2.3       ]
 [0.66666667 3.1        4.7        1.5       ]
 [0.66666667 3.1        5.6        2.4       ]
 [0.66666667 3.1        4.4        1.4       ]
 [0.66666667 3.         5.         1.7       ]
 [0.66666667 3.3        5.7        2.1       ]
 [0.66666667 2.5        5.8        1.8       ]
 [0.69444444 3.         5.5        2.1       ]
 [0.69444444 2.8        4.8        1.4       ]
 [0.69444444 3.2        5.9        2.3       ]
 [0.72222222 3.2        5.7        2.3       ]
 [0.72222222 3.1        5.4        2.1       ]
 [0.72222222 3.1        4.9        1.5       ]
 [0.72222222 3.1        5.1        2.3       ]
 [0.75       3.2        4.7        1.4       ]
 [0.77777778 3.         5.9        2.1       ]
 [0.80555556 3.         5.8        1.6       ]
 [0.80555556 3.6        6.1        2.5       ]
 [0.80555556 3.2        6.         1.8       ]
 [0.83333333 2.9        6.3        0.        ]
 [0.86111111 2.8        6.1        1.9       ]
 [0.91666667 3.         6.6        2.1       ]
 [0.94444444 2.6        6.9        2.3       ]
 [0.94444444 3.8        6.7        2.2       ]
 [0.94444444 3.         6.1        2.3       ]
 [0.94444444 2.8        6.7        2.        ]
 [1.         3.8        6.4        0.        ]]
# 查找众数,先找唯一值,然后计算数量,求最大值。
data1_3 = np.unique(data1[:,2])
print(data1_3)
[0.  1.  1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.9 3.  3.3 3.5 3.6 3.7 3.8 3.9 4.
 4.1 4.2 4.3 4.4 4.5 4.6 4.7 4.8 4.9 5.  5.1 5.2 5.3 5.4 5.5 5.6 5.7 5.8
 5.9 6.  6.1 6.3 6.4 6.6 6.7 6.9]
times = [np.sum(data1[:,2]==i) for i in data1_3]
print(times)
[6, 1, 1, 2, 7, 11, 12, 7, 4, 2, 1, 2, 2, 1, 1, 1, 3, 4, 3, 4, 2, 4, 7, 3, 5, 4, 5, 4, 7, 2, 2, 2, 3, 6, 3, 3, 2, 2, 3, 1, 1, 1, 2, 1]
times = np.array(times)
loc = np.where(times==np.max(times))
# 输出众数
print(data1_3[loc])
[1.5]
# 第一次大于某值的位置
judge = data1[:,3]>1.0
print(judge)
[False False False False False False False False False False False False
 False False False False False False False False False False False False
 False False False False False False False False False False False False
 False False False False False False False False False False False False
 False False  True  True  True  True  True  True  True False  True  True
 False  True False  True  True  True  True False  True  True  True  True
  True  True  True  True  True  True  True False  True False  True  True
  True  True  True False  True  True  True  True  True False  True  True
  True  True  True  True  True  True  True  True  True  True False False
  True  True  True  True  True  True  True  True  True  True  True  True
  True  True  True  True  True  True  True  True  True  True  True False
  True  True  True  True  True  True  True  True  True  True  True  True
 False  True  True  True  True  True]
for i in range(len(judge)):
    if judge[i]==True:
        print(i)
        break
50
  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值