Java使用aspose.word完美实现docx转doc

Java使用aspose.word完美实现docx转doc,同时打开转后的doc文件也不会报错,处理效率高于Docx4j

一、处理逻辑

  • 1、使用aspose先将docx字节数组转为html数组,
  • 2、将html数组转为所需要的格式的文档(doc或docx)

二、实现代码

代码实现

    /**
     * docx字节数组转doc字节数组
     * @Title: docxToDoc
     * @Description: docx字节数组转doc字节数组
     * @param content
     * @return: byte
     */
    private static byte[] docxToDoc(byte[] content) {
        // docx字节数组转成html字符串
        String htmlStr = byteToHtmlStr(content);
        // html字节数组转doc字节数组
        return AsposeWordUtils.htmlToWord(htmlStr.getBytes(StandardCharsets.UTF_8), SaveFormat.DOC);
    }
    
    /**
     * word字节数组转为html字符串
     * @Title: byteToHtmlStr
     * @Description: word字节数组转为html字符串
     * @param content
     * @return: String
     */
    private static String byteToHtmlStr(byte[] content) {
        String result = "";
        try {
            byte[] htmlContent = AsposeWordUtils.wordToHtml(content);
            InputStream is = new ByteArrayInputStream(htmlContent);
            InputStreamReader streamReader = new InputStreamReader(is, StandardCharsets.UTF_8);
            BufferedReader reader = new BufferedReader(streamReader);
            String line;
            StringBuilder html = new StringBuilder();
            while ((line = reader.readLine()) != null) {
                html.append(line);
            }
            reader.close();
            result = String.valueOf(html);
        } catch (IOException e) {
            logger.error("html转字符串异常", e);
        }
        return result;
    }

AsposeWordUtils工具类

/**
     * html字节数组转word字节数组
     * @Title: htmlToWordTest
     * @Description: html字节数组转word字节数组
     * @param content html字节数组
     * @param toType 值为SaveFormat.DOCX或SavaFormat.Doc对应的值
     * @return: byte
     */
    public static byte[] htmlToWord(byte[] content, Integer toType) {
        byte[] result = new byte[1];
        try {
            ByteArrayOutputStream os = new ByteArrayOutputStream();
            InputStream is = new ByteArrayInputStream(content);
            Document doc = new Document();
            DocumentBuilder builder = new DocumentBuilder(doc);
            InputStreamReader streamReader = new InputStreamReader(is, StandardCharsets.UTF_8);
            BufferedReader reader = new BufferedReader(streamReader);
            String line;
            StringBuilder html = new StringBuilder();
            while ((line = reader.readLine()) != null) {
                html.append(line);
            }
            reader.close();
            builder.insertHtml(String.valueOf(html));
            doc.save(os, toType);
            log.info("html转word成功!");
            result = os.toByteArray();
        } catch (Exception e) {
            log.error("html转word失败!", e);
        }
        return result;
    }

    /**
     * word字节数组转html字节数组
     * @Title: wordToHtml
     * @Description: word字节数组转html字节数组
     * @param content doc、docx字节数组
     * @return: byte
     */
    public static byte[] wordToHtml(byte[] content) {
        byte[] result = new byte[1] ;
        try {
            ByteArrayOutputStream os = new ByteArrayOutputStream();
            InputStream sbs = new ByteArrayInputStream(content);
            Document document = new Document(sbs);
            HtmlSaveOptions options = new HtmlSaveOptions(SaveFormat.HTML);
            options.setExportImagesAsBase64(true);
            document.save(os, options);
            log.info("html转word成功!");
            result = os.toByteArray();
        } catch (Exception e) {
            log.error("word转html失败!", e);
        }
        return result;
    }
  • 1
    点赞
  • 7
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值