功能描述:从new_4.txt中读取出数据,然后用jieba分词,最后保存到new_5.txt中.
实验环境:Python3.7
代码实现:
import jieba
f_out = open('./new_5.txt','wb+')
with open('./new_4.txt','r',encoding = 'utf-8') as f:
for line in f.readlines():
seg = jieba.cut(line, cut_all = False)
s='/'.join(seg)
m = list(s)
for word in m:
f_out.write(word.encode('utf-8'))
f.close()
f_out.close()