目前在研究智能学习算法的时候,学习了一点python使用支持向量机做分类
首先,导入必要的算法包
from sklearn import svm
import numpy as np
import pandas as pd
from sklearn.model_selection import train_test_split
使用pandas内的函数读取文件
data = pd.read_csv("D:/Bank_dataset.csv", header=None)
选择样本特征集和样本结果
y, x = np.split(data, (1,), axis=1)
split(数据,分割位置,轴=1(水平分割) or 0(垂直分割))
使用train_test_split将读入的数据划分成训练集和测试集
x_train, x_test, y_train, y_test = train_test_split(x, y, random_state=1, test_size=0.2)
再接下来要训练SVM分类器
clf = svm.SVC(C=0.1, kernel='linear', decision_function_shape='ovr')
clf.fit(x_train, y_train.values.ravel())
训练完之后就可以算测试集或者是训练集准确度了
print(clf.score(x_test, y_test)) # 测试集准确度
print(clf.score(x_train, y_train)) # 训练集准确度
还可以使用predict方法进行对特征的预测结果,可以预测出结果
print(clf.predict([[1,2,1]]))
完整代码如下
from sklearn import svm
import numpy as np
import pandas as pd
from sklearn.model_selection import train_test_split
if __name__ == '__main__':
data = pd.read_csv("D:/Bank_dataset.csv", header=None)
y, x = np.split(data, (1,), axis=1)
x_train, x_test, y_train, y_test = train_test_split(x, y, random_state=1, test_size=0.2)
clf = svm.SVC(C=0.1, kernel='linear', decision_function_shape='ovr')
clf.fit(x_train, y_train.values.ravel())
print(clf.score(x_test, y_test)) # 测试集准确度
print(clf.score(x_train, y_train)) # 训练集准确度
print(clf.predict([[1,2,1]]))
python中使用支持向量机做分类还是不难的