JAVA库读取PDF名称机构时间,Java:使用itext读取PDF书签名称

I am working with a single PDF containing multiple documents. Each document has a bookmark. I need to read the bookmark names for a reconciliation application that I am building. The code below is not working for me. I am trying to place the bookmark name in the title string. Can anyone provide any guidance? Thank you very much.

PdfReader reader = new PdfReader("C:\\Work\\Input.pdf");

List> bookmarks = SimpleBookmark.getBookmark(reader);

for(int i = 0; i < bookmarks.size(); i++){

HashMap bm = bookmarks.get(i);

String title = ((String)bm.get("Title"));

}

解决方案

You are not taking into account that bookmarks are stored in a tree structure with branches and leaves (in the PDF specification, it's called the outline tree).

As @Todoy says in the comment section, your code works for the top-level, but if you want to see all the titles, you need to use a recursive method that also looks at the "Kids".

public void inspectPdf(String filename) throws IOException, DocumentException {

PdfReader reader = new PdfReader(filename);

List> bookmarks = SimpleBookmark.getBookmark(reader);

for (int i = 0; i < bookmarks.size(); i++){

showTitle(bookmarks.get(i));

}

reader.close();

}

public void showTitle(HashMap bm) {

System.out.println((String)bm.get("Title"));

List> kids = (List>)bm.get("Kids");

if (kids != null) {

for (int i = 0; i < kids.size(); i++) {

showTitle(kids.get(i));

}

}

}

The showTitle() method is recursive. It calls itself if an examined bookmark entry has kids. With this code snippet, you can walk through all the branches and leaves of the outline tree.

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值