python遇到错误跳过_python如何跳过错误继续运行，同时删除产生错误的文档

最新推荐文章于 2024-08-09 04:01:46 发布

weixin_39540020

最新推荐文章于 2024-08-09 04:01:46 发布

阅读量650

收藏 1

点赞数

文章标签： python遇到错误跳过

因为我用的package有bug有些文档不能处理当程序在读取这个文件的时候会出现mathdomainerror，所以我现在要实现的目的就是跳过这些error，同时删除产生error的文档。我的code如下所示：...

因为我用的package有bug有些文档不能处理当程序在读取这个文件的时候会出现math domain error，所以我现在要实现的目的就是跳过这些error，同时删除产生error的文档。

我的code如下所示：

首先建立一个excel文档，因为我需要把算的结果导出到excel里面，

然后就用了一个for loop直接运行上面所写的三个method,请前辈帮忙在我现有的code基础改一下达到我想要实现的目的。

import os,csv,nltk, math

from nltk.model.ngram import NgramModel

from nltk.probability import LidstoneProbDist

#open the csv file

fout = open("/Users//WN1.data.csv", "w")

outfilehandle = csv.writer(fout,

delimiter=",",

quotechar='"',

quoting=csv.QUOTE_NONNUMERIC)

localrow = []

localrow.append("File name")

localrow.append("Perplexity for unigram")

localrow.append("Perplexity for bigram")

localrow.append("Perplexity for trigram")

outfilehandle.writerow(localrow)

# unigram model

def unigram(file):

#read file

file_object = open(file)

ln=file_object.read()

words = nltk.word_tokenize(ln)

estimator = lambda fdist, bins: LidstoneProbDist(fdist, 0.2)

tt=NgramModel(1, words, estimator = estimator)

return tt.perplexity(words)

#bigram model

def bigram(file):

file_object = open(file)

ln=file_object.read()

words = nltk.word_tokenize(ln)

my_bigrams = nltk.bigrams(words)

#fdist = nltk.FreqDist(my_bigrams)

#lapalce smoothing

estimator = lambda fdist, bins: LidstoneProbDist(fdist, 0.2)

tt2=NgramModel(2, my_bigrams, estimator = estimator)

return tt2.perplexity(my_bigrams)

#trigram model

def trigram(file):

file_object = open(file)

ln=file_object.read()

words = nltk.word_tokenize(ln)

my_trigrams = nltk.trigrams(words)

#lapalce smoothing

estimator = lambda fdist, bins: LidstoneProbDist(fdist, 0.2)

tt3=NgramModel(3, my_trigrams, estimator = estimator)

return tt3.perplexity(my_trigrams)

#set the path of the folder

os.chdir("/Users/Documents/A")

s = os.getcwd()

#read files in the folder

files = os.listdir(s)

bg=0

for file in files:

uni = unigram(file)

bi=bigram(file)

tri=trigram(file)

localrow= []

localrow.append(file)

localrow.append(uni)

localrow.append(bi)

localrow.append(tri)

outfilehandle.writerow(localrow)

fout.close()

展开

weixin_39540020

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。

余额充值