前提:提取文本内容的文件必须是应用服务生成PDF文件,而非扫描的pdf文档,当前pdfplumber的版本为0.5.28
第一步:在服务应用的终端中使用下述命令安装pdfplumber包
poetry add pdfplumber
在输入了上述命令后,会在终端中弹出下述相关安装信息
PS D:\Code\python\poetry-demo> poetry add pdfplumber
Using version ^0.5.28 for pdfplumber
Updating dependencies
Resolving dependencies...
Writing lock file
Package operations: 7 installs, 0 updates, 0 removals
• Installing chardet (4.0.0)
• Installing pycryptodome (3.10.1)
• Installing sortedcontainers (2.4.0)
• Installing pdfminer.six (20200517)
• Installing pillow (8.3.1)
• Installing wand (0.6.7)
• Installing pdfplumber (0.5.28)
同时可以看到在对应服务的site-packages目录下会新增下述几个目录:
pdfminer
pdfminer.six-20200517.dist-info
pdfplumber
pdfplumber-0.5.28.d