sklearn学习笔记一 model_selection

最新推荐文章于 2024-05-06 14:33:29 发布

leon_lavie

最新推荐文章于 2024-05-06 14:33:29 发布

阅读量945

点赞数

分类专栏： sklearn

本文链接：https://blog.csdn.net/leon_lavie/article/details/82977404

版权

本文介绍了sklearn库中的model_selection模块，重点讲解了train_test_split函数，用于随机划分训练集和测试集。通过设置random_state参数可以确保数据划分的可重复性。

摘要由CSDN通过智能技术生成

一常用函数

1 sklearn.model_selection.train_test_split随机划分训练集和测试集：

http://scikit-learn.org/stable/modules/generated/sklearn.model_selection.train_test_split.html#sklearn.model_selection.train_test_split

Parameters:

Parameters:	arrays : sequence of indexables with same length / shape[0] Allowed inputs are lists, numpy arrays, scipy-sparse matrices or pandas dataframes. test_size* : float, int or None, optional (default=0.25) If float, should be between 0.0 and 1.0 and represent the proportion of the dataset to include in the test split. If int, represents the absolute number of test samples. If None, the value is set to the complement of the train size. By default, the value is set to 0.25. The default will change in version 0.21. It will remain 0.25 only if `train_size` is unspecified, otherwise it will complement the specified `train_size`

*arrays : sequence of indexables with same length / shape[0]

Allowed inputs are lists, numpy arrays, scipy-sparse matrices or pandas dataframes.

test_size : float, int or None, optional (default=0.25)

If float, should be between 0.0 and 1.0 and represent the proportion of the dataset to include in the test split. If int, represents the absolute number of test samples. If None, the value is set to the complement of the train size. By default, the value is set to 0.25. The default will change in version 0.21. It will remain 0.25 only if train_size is unspecified, otherwise it will complement the specified train_size