1.对数据集打乱是个很重要的课题,在sklearn里面提供了置乱的函数,我这里提供一个简单的例子:
import numpy as np
from sklearn.utils import shuffle
data = np.array([['王大'], ['王二'], ['王三'], ['王四'],['王五'],['王六'],['王七'],['王八'],['王九'],['王十']])
label = np.array([1, 2, 3, 4,5,6,7,8,9,10])
data,label = shuffle(data,label)
print('data = \n' ,data,'\nlabel = ',label)
输出结果:
data =
[['王六']
['王五']
['王四']
['王二']
['王八']
['王三']
['王七']
['王十']
['王大']
['王九']]
label = [ 6 5 4 2 8 3 7 10 1 9]