展开全部
def train_test_split(*arrays, **options):
"""Split arrays or matrices into random train and test subsets
Quick utility that wraps calls to ``check_arrays`` and
``next(iter(ShuffleSplit(n_samples)))`` and application to input
data into a single call for splitting (and optionally subsampling)
data in a oneliner.
Parameters
----------
*arrays : sequence of arrays or scipy.sparse matrices with same shape[0]
Python lists or tuples occurring in arrays are converted to 1D numpy
arrays.
这是源码里面的一段话,62616964757a686964616fe58685e5aeb931333335313137参数*arrays里就是所谓的train_data,train_target,说的很清楚,类型为数组或矩阵。如果你的数据在文本中,读入list中或使用numpy模块存储。n_arrays = len(arrays)
if n_arrays == 0:
raise ValueError("At least one array required as input")
也就是说按照需要*arrays中只传入一个数组或矩阵,即这种形式:X_train, X_test= cross_validation.train_test_split(train_data, test_size=0.4, random_state=0)