aspose.word删除分页符

Ask_Gra01

已于 2022-05-17 21:58:02 修改

阅读量1.3k

点赞数

分类专栏： aspose 文章标签： java 开发语言

于 2022-05-17 21:56:09 首次发布

本文链接：https://blog.csdn.net/sisto/article/details/124831069

版权

aspose 专栏收录该内容

1 篇文章 0 订阅

订阅专栏

Aspose.word用法都类似，此处使用aspose for java进行操作
项目需要将word去掉所有的分页符，再进行一级大纲为划分的分页

目标文件状态：

思考逻辑：遍历整个paragraphs节点下run节点，并取得分页符号节点后移除该节点

   public Document deletePageBreaker(String fileName) throws Exception{
        //获取文件
        InputStream inputStream = this.getClass().getClassLoader().getResourceAsStream(fileName + ".docx");
        Document doc = new Document(inputStream);
        for (Section section : doc.getSections()) {
            Body body = section.getBody();
            for (Paragraph paragraph : body.getParagraphs()) {
                for (Run run : paragraph.getRuns()) {
                    if("\f".equals(run.getText())){
                        run.remove();
                    }
                }
            }
        }
        return doc;
    }

但是此方法移除节点后会导致在原有的分页符位置中有换行符的残留，因为以文件节点的思路来说，run移除自身，但是原本的父级节点paragraph依旧存在（无内容）会以单个换行符进行占位

       InputStream inputStream = this.getClass().getClassLoader().getResourceAsStream(fileName + ".docx");
        Document doc = new Document(inputStream);
        for (Section section : doc.getSections()) {
            Body body = section.getBody();
            for (Paragraph paragraph : body.getParagraphs()) {
                for (int i = 0; i < paragraph.getRuns().getCount(); i++) {
                    Run run = paragraph.getRuns().get(i);
                    if("\f".equals(run.getText())&&paragraph.getRuns().getCount()==1){
                        paragraph.remove();
                    }
                }
            }
        }
        doc.save(HOME + "tee.docx");