04-23.eri-test 答案:使用PDFbox确定文档中单词的坐标

\n

I\'m working on extract data from PDF files. This post helps me to determine for the coordinate position by word searching.

\n\n\n
\n \n
\n \n

take a look on this, I think it\'s what you need.

\n

https://jackson-brain.com/using-pdfbox-to-locate-text-coordinates-within-a-pdf-in-java/

\n\n

Here is the code:

\n\n
import java.io.File;\nimport java.io.IOException;\nimport java.text.DecimalFormat;\nimport java.util.ArrayList;\nimport java.util.Arrays;\nimport java.util.List;\n\nimport org.apache.pdfbox.exceptions.InvalidPasswordException;\nimport org.apache.pdfbox.pdmodel.PDDocument;\nimport org.apache.pdfbox.pdmodel.PDPage;\nimport org.apache.pdfbox.pdmodel.common.PDStream;\nimport org.apache.pdfbox.util.PDFTextStripper;\nimport org.apache.pdfbox.util.TextPosition;\n\npublic class PrintTextLocations extends PDFTextStripper {\n\npublic static StringBuilder tWord
\xe2\x80\xa6\n \n
\n
\n \n Open Full Answer\n \n
\n
\n\n\n\n
  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值