Train-Test-Split

以鸢尾花的数据为例:

import numpy as np
import matplotlib.pyplot as plt
from sklearn import datasets

iris = datasets.load_iris()

X = iris.data
y = iris.target

X.shape

y.shape

### train_test_split

y

对第0149个索引进行乱序排列

shuffle_indexes = np.random.permutation(len(X))

shuffle_indexes

test_ratio = 0.2
test_size = int(len(X)*test_ratio)

test_size

test_indexes = shuffle_indexes[:test_size]
train_indexes =  shuffle_indexes[test_size:]

X_train = X[train_indexes]
y_train = y[train_indexes]

X_test = X[test_indexes]
y_test = y[test_indexes]

print(X_train.shape)
print(y_train.shape)

print(X_test.shape)
print(y_test.shape)
### sklearn中的train_test_split

from sklearn .model_selection import train_test_split

X_train,X_test,y_train,y_test = train_test_split(X,y,test_size=0.2,random_state=666)

print(X_train.shape)
print(y_train.shape)

print(X_test.shape)
print(y_test.shape)

二者结果一致

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值