python pdfplumber 转换多页PDF表格为Excel

llrraa2010

已于 2022-02-23 18:34:52 修改

阅读量1.5k

点赞数

分类专栏： python 文章标签： python 数据分析开发语言

于 2022-02-23 18:33:10 首次发布

本文链接：https://blog.csdn.net/llrraa2010/article/details/123096546

版权

python 专栏收录该内容

24 篇文章 0 订阅

订阅专栏

import pdfplumber as pr
import pandas as pd
pdf = pr.open('21.PDF')
ps = pdf.pages
i1 = 0
table1 = [[0 for i in range(20)] for j in range(200)]
for p in range(9):
    pg = ps[p]
    tables = pg.extract_tables()
    table = tables[0]
    print(table)
    df = pd.DataFrame(table[1:],columns = table[0])

    for i in range(len(table)):
        for j in range(len(table[i])):
            #table[i][j] = table[i][j].replace('\n','')
            table1[i1][j] = table[i][j]
        i1 = i1+1
    df1 = pd.DataFrame(table1[1:],columns = table1[0])
    df1.to_excel('1.xlsx')

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

llrraa2010

关注关注

0
点赞
踩
5

收藏

觉得还不错? 一键收藏
0
评论
python pdfplumber 转换多页PDF表格为Excel

import pdfplumber as primport pandas as pdpdf = pr.open('21.PDF')ps = pdf.pagesi1 = 0table1 = [[0 for i in range(20)] for j in range(200)]for p in range(9): pg = ps[p] tables = pg.extract_tables() table = tables[0] print(table) df
复制链接

扫一扫