kaggle入门digits Recognizer

最新推荐文章于 2021-06-05 18:37:25 发布

VIP文章 qccc_dm

最新推荐文章于 2021-06-05 18:37:25 发布

阅读量477

点赞数

分类专栏：数据挖掘文章标签： kaggle 机器学习数据挖掘

本文链接：https://blog.csdn.net/qccc_dm/article/details/52817355

版权

经典的数字识别问题，调用Knn, randforest, svm&pca这3种方法。

主要利用的是sklearn库，pandas库, numpy库

1.knn是是看了别人的博客，然后自己动手重复了一下，后来发现这种方法的提取数据太冗长了，后续会贴出更精炼的code

from numpy import *
import operator
import csv
def loadTrainData():
    l = []
    with open('train.csv') as file:
        lines = csv.reader(file)
        for line in lines:
            l.append(line)
    l.remove(l[0])
    l = array(l)
    label = l[:,0]
    data = l[:,1:]
    return nomalizing(toInt(data)),toInt(label)
    #label 1*42000 data 42000*784
    #return data label

def toInt(array):
    array = mat(array)
    m,n = shape(array)
    newArray = zeros((m,n))
    for i in xrange(m):
        for j in xrange(n):
            newArray[i,j] = int(array[i,j])
    return newArray

def nomalizing(array):
    m

最低0.47元/天解锁文章

优惠劵

qccc_dm

关注关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
kaggle入门digits Recognizer

经典的数字识别问题，调用Knn, randforest, svm&pca这3种方法。主要利用的是sklearn库，pandas库, numpy库1.knn是是看了别人的博客，然后自己动手重复了一下，后来发现这种方法的提取数据太冗长了，后续会贴出更精炼的codefrom numpy import *import operatorimport csvdef load
复制链接

扫一扫