各位大佬,本人准备写一个通过正则表达式过滤非中文字字符的脚本,然后翻车了,求各位大佬指点迷津,本人python入门不久,有C和PHP的基础
代码如下
// An highlighted block
import re
ifn = r"C:\Users\zheng\Desktop\train.txt"
ofn = r"C:\Users\zheng\Desktop\train_output.txt"
infile = open(ifn,'rb')
outfile = open(ofn,'wb')
for eachline in infile.readlines():
lines = re.sub("[A-Za-z0-9\!\%\[\]\,\。]", eachline , 0 )
for x in lines:
print(x)
# outfile.write(s)
pass
infile.close
outfile.close