Hold-out data set
Hold-out is when you split up your dataset into a ‘train’ and ‘test’ set. The training set is what the model is trained on, and the test set is used to see how well that model performs on unseen data. A common split when using the hold-out method is using 80% of data for training and the remaining 20% of the data for testing
Hold-out data set 是当您将数据集拆分为“训练”和“测试”集时。训练集是模型训练的对象,测试集用于查看模型在未见过的数据上的表现。使用留出方法时的常见拆分是使用 80% 的数据进行训练,其余 20% 的数据用于测试。