长markdown文档的拆分与合并

最新推荐文章于 2023-03-27 22:46:26 发布

zdlwhereyougo

最新推荐文章于 2023-03-27 22:46:26 发布

阅读量1.4k

点赞数 2

分类专栏：嗑盐杂烩文章标签：编辑器

本文链接：https://blog.csdn.net/weixin_39361931/article/details/126263013

版权

嗑盐杂烩专栏收录该内容

23 篇文章 3 订阅

订阅专栏

目的

使用vscode写长markdown文档时，实时渲染会卡顿，不方便检查，拆分成每一部分比较好检查。
在导出全文的时候，需要保证交叉引用不会出问题，最好再合并成一个文档。万能python实现了这个功能。这个功能不难，参考了站内【羊城迷鹿】的文章，对他的函数进行了改动，更符合我的要求。

from os.path import basename
import re
import os
from urllib.parse import quote
import warnings
warnings.filterwarnings('ignore')

def get_subtitle(content):
    # 从md文件中获取h1标题
    # 返回日期和索引
    pattern = re.compile(r'#{1}(.*?)\n')#正则表达
    heads = pattern.findall(content)
    # print(heads)
    # 防止出现空h1标题
    subtitle = [h[1:] for h in heads if h[0] == ' ' and len(h) > 2]#以井号开头
    indexes = [content.find(title+'\n')-2 for title in subtitle]
    indexes.append(len(content))
    return subtitle, indexes

def save_md(path, article):
    # 保存分割后的文件
    with open(path, 'w+', encoding='utf8') as f:
        f.write(article)

def split_and_save_md(filepath,savepath):
    file_path,fullflname = os.path.split(filepath)
    fname,ext = os.path.splitext(fullflname)
    with open(filepath, 'r+', encoding='utf8') as f:
        content = f.read()
        if not os.path.exists(savepath):
            os.mkdir(savepath)
        sub_title, indexes = get_subtitle(content)
        for i in range(len(sub_title)):
            article_path = savepath+'/'+fname+'_'+sub_title[i]+'.md'
            if os.path.exists(article_path):
                continue
            article = content[indexes[i]:indexes[i+1]]
            save_md(article_path, article)
# 再定义一个合并文件的

# 合并的代码
def combine_md(filepath,savepath):
    md_list = os.listdir(savepath)#列出保存路径下所有的md文件
    file_path,fullflname = os.path.split(filepath)
    fname,ext = os.path.splitext(fullflname)
    fname_split=fname+'_'
    contents = []
    for md in md_list:
        if fname_split in md:
            md_file =savepath + '\\' + md
            with open(md_file, 'r', encoding='utf-8') as file:
                contents.append(file.read() + "\n")
    #构造输出路径
    output_path=savepath+'\\'+fname+'.md'
    with open(output_path,"w", encoding='utf-8') as file:
        file.writelines(contents)


#拆分成小部分，更好查看,原始文件所在目录
filepath=r'D:\BaiduNetdiskWorkspace\V6.md'
#拆分后文件所在目录，为了方便，需要新建一个文件夹
savepath=r'D:\BaiduNetdiskWorkspace\newV6t'
#执行拆分命令
split_and_save_md(filepath,savepath)
#执行合并命令
# combine_md(filepath,savepath)

zdlwhereyougo

关注

2
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
长markdown文档的拆分与合并

在导出全文的时候，需要保证交叉引用不会出问题，最好再合并成一个文档。万能python实现了这个功能。这个功能不难，参考了站内【羊城迷鹿】的文章，对他的函数进行了改动，更符合我的要求。使用vscode写长markdown文档时，实时渲染会卡顿，不方便检查，拆分成每一部分比较好检查。...
复制链接

扫一扫