sklearn的train_test_split函数的random_state

最新推荐文章于 2024-06-23 22:18:44 发布

mumu157

最新推荐文章于 2024-06-23 22:18:44 发布

阅读量1.1w

点赞数 13

分类专栏： sklearn Python 文章标签： python sklearn random_state

本文链接：https://blog.csdn.net/zhu_1997/article/details/89214966

版权

Python 同时被 2 个专栏收录

25 篇文章 1 订阅

订阅专栏

sklearn

1 篇文章 0 订阅

订阅专栏

我们使用sklearn进行机器学习之前，一般使用train_test_split来进行数据集的分割,其参数random_state代表什么呢？

>>>from sklearn.model_selection import train_test_split

>>> x = [1,2,3,4,5,6,7,8,9,10]
>>> y = [1,2,3,4,5,6,7,8,9,10]
x_train, x_test, y_train, y_test = train_test_split(
...     x, y, test_size=0.3)# 测试集比例为30%， random_state默认为None
>>> x_train, x_test
([7, 8, 3, 1, 9, 5, 2], [10, 6, 4])

#重新分割
>>> x_train, x_test, y_train, y_test = train_test_split(
...		x, y, test_size=0.3)
>>> x_train, x_test
([7, 8, 5, 4, 9, 1, 2], [6, 10, 3])
>>>

可以看到，random_state默认状态下，两次分割的结果不一样

>>> x_train, x_test, y_train, y_test = train_test_split(
...		x, y, test_size=0.3, random_state=1)
>>> x_train, x_test
([5, 1, 4, 2, 8, 9, 6], [3, 10, 7])
>>> x_train, x_test, y_train, y_test = train_test_split(
...		x, y, test_size=0.3, random_state=1)
>>> x_train, x_test
([5, 1, 4, 2, 8, 9, 6], [3, 10, 7])