TXT目录重新编写
把TXT文件中杂乱的章节名按照统一的格式重新编排,使用了简单的正则表达式,代码也写得很简单,限制挺大,需要的自己根据杂乱情况修改即可。
# 能修改的章节名格式示例:
# ===1. 章节名===
# ===2、章节名===
# ===章节名===
# ===第1章章节名===
import re
f = open("《十万个为什么[无限]》.txt", "r")
fo = open("test.txt", 'a+')
chapter_count = 0
for line in f.readlines():
if (re.search('===', line) != None):
chapter_count += 1
chapter_name = ''
i = len(line) - 5
while line[i] not in [' ', '章', '=', '、', '.']:
chapter_name = line[i] + chapter_name
i -= 1
chapter_name = "第" + str(chapter_count) + "章:" + chapter_name + '\n'
fo.write(chapter_name)
else:
fo.write(line)
f.close()
fo.close()