python读取pdf表格_使用Tabula从PDF以字符串形式读取表格

Pythonversion:3.7.6(default,Jan82020,13:42:34)[Clang4.0.1(tags/RELEASE_401/final)]Javaversion:openjdk version"13.0.2"2020-01-14OpenJDKRuntimeEnvironment(build13.0.2+8)OpenJDK64-BitServerVM(build13.0.2+8,mixed mode,sharing)tabula-py version:2.0.4platform:Darwin-19.3.0-x86_64-i386-64bituname:uname_result(system='Darwin',node='MacBook-Pro-10.local',release='19.3.0',version='Darwin Kernel Version 19.3.0: Thu Jan 9 20:58:23 PST 2020; root:xnu-6153.81.5~1/RELEASE_X86_64',machine='x86_64',processor='i386')linux_distribution:('Darwin','19.3.0','')mac_ver:('10.15.3',('','',''),'x86_64')None'pages'argument isn't specified.Will extract only from page 1 by default.

Unnamed: 0 object

mpg object

cyl object

disp object

hp object

drat object

wt object

qsec object

vs object

am object

gear object

carb object

dtype: object

Unnamed: 0 mpg cyl disp hp drat wt qsec vs am gear carb

0 Mazda RX4 21.0 6 160.0 110 3.90 2.620 16.46 0 1 4 4

1 Mazda RX4 Wag 21.0 6 160.0 110 3.90 2.875 17.02 0 1 4 4

2 Datsun 710 22.8 4 108.0 93 3.85 2.320 18.61 1 1 4 1

3 Hornet 4 Drive 21.4 6 258.0 110 3.08 3.215 19.44 1 0 3 1

4 Hornet Sportabout 18.7 8 360.0 175 3.15 3.440 17.02 0 0 3 2

'pages' argument isn't specified.Willextract onlyfrompage1by default.Unnamed:0object

mpg float64

cyl int64

disp float64

hp int64

drat float64

wt float64

qsec float64

vs int64

am int64

gear int64

carb int64

dtype:objectUnnamed:0mpg cyl disp hp drat wt qsec vs am gear carb0MazdaRX421.06160.01103.902.62016.4601441MazdaRX4Wag21.06160.01103.902.87517.0201442Datsun71022.84108.0933.852.32018.6111413Hornet4Drive21.46258.01103.083.21519.4410314HornetSportabout18.78360.01753.153.44017.020032

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值