题目
In the Second ML assignment you have to compare the performance of three different classification algorithms, namely Naive Bayes, SVM, and Random Forest.
For this assignment you need to generate a random binary classification problem, and then train and test (using 10-fold cross validation) the three algorithms. For some algorithms inner cross validation (5-fold) for choosing the parameters is needed. Then, show the classification performace (per-fold and averaged) in the report, and briefly discussing the results.
步骤
1.Create a classification dataset (n_samples >= 1000, n_features >= 10)
from sklearn import datasets
from sklearn import cross_validation
data = datasets.make_classification(n_samples =