简介
Apache PDFBox® - A Java PDF Library
The Apache PDFBox® library is an open source Java tool for working with PDF documents. This project allows creation of new PDF documents, manipulation of existing documents and the ability to extract content from documents. Apache PDFBox also includes several command-line utilities. Apache PDFBox is published under the Apache License v2.0.
特征
- 提取文本
- 拆分合并
- 预检
- 另存为图像
- 创建文件
用法介绍
可以使用命令行的方式,实现对pdf文件的拆分等操作。比如:
java -jar pdfbox-app-2.0.3.jar PDFSplit -split 1 -startPage 1 -outputPrefix $(basename 文件名称) 文件名称
实现对指定pdf文件,按页拆分并生成文件。