python十折交叉验证_(数据挖掘-入门-6)十折交叉验证和K近邻

# #K Nearest Neighbor Classifier for Pima dataset#

importheapqimportrandomclassClassifier:def __init__(self, bucketPrefix, testBucketNumber, dataFormat, k):"""a classifier will be built from files with the bucketPrefix

excluding the file with textBucketNumber. dataFormat is a string that

describes how to interpret each line of the data files. For example,

for the mpg data the format is:

"class num num num num num comment""""self.medianAndDeviation=[]

self.k=k#reading the data in from the file

self.format= dataFormat.strip().split('\t')

self.data=[]#for each of the buckets numbered 1 through 10:

for i in range(1, 11):#if it is not the bucket we should ignore, read in the data

if i !=testBucketNumber:

filename= "%s-%02i" %(bucketPrefix, i)

f=open(filename)

lines=f.readlines()

f.close()for line in lines[1:]:

fields= line.strip().split('\t')

ignore=[]

vector=[]for i inrange(len(fields)):if self.format[i] == 'num':

vector.append(float(fields[i]))elif self.format[i] == 'comment':

ignore.append(fields[i])elif self.format[i] == 'class':

classification=fields[i]

self.data.append((classification, vector, ignore

  • 0
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值