接触一个数据集,找不到.mat可用文件的,于是自己琢磨怎么做。首先,用一个比较简单常见的数据集做示例。机器学习入门最经典的为鸢尾花的数据集。UCI上有iris.data,但是没有iris.mat,可以用其来练手。关于iris的介绍,网上很多,看官网的介绍(iris.names)主要看以下内容
The data set contains 3 classes of 50 instances each,
where each class refers to a type of iris plant. One class is
linearly separable from the other 2; the latter are NOT linearly
separable from each other.
The 35th sample should be: 4.9,3.1,1.5,0.2,"Iris-setosa"
where the error is in the fourth feature.
The 38th sample: 4.9,3.6,1.4,0.1,"Iris-setosa"
where the errors are in the second and third features.
5. Number of Instances: 150 (50 in each of three classes)
6. Number of Attributes: 4 numeric, predictive attributes and the class
7. Attribute Information:
1. sepal length in cm
2. sepal width in cm
3. petal length in cm