PyPDF2 merge pdf and keep bookmarks

关于python 处理pdf的第三方库有很多,基础的是ReportLab
参考 reportlab user guide
Py2pdf 实际上是ReportLab 的Demo, ReportLab中没有直接合并pdf和添加bookmarks的工具, Py2pdf中有经过二次开发得到的这样的工具

from __future__ import print_function
from sys import argv
from PyPDF2 import PdfFileMerger, PdfFileReader
import os
class bookmark_class:
    def __init__(self, title, page_num, parent = 0):
        self.page_num = page_num
        self.parent = parent
        self.title = title
        
    def print_content(self):
        print("page_num: ", self.page_num)
        print("parent: ", self.parent)
        print("title: ", self.title)

# get bookmark info from the pdf outlines
def bookmark_list(bookmark_list, parent = 0):
    print("bookmark:", bookmark_list)
    result = []
    parent_suanz = parent
    for item in bookmark_list:
        if isinstance(item, list):
            result += bookmark_list(item, parent_suanz)
        else:
            bookmark = bookmark_class(item.get("/Title"), item.get("/Page"), 
                                      parent)
            result.append(bookmark)
            print("bookmark_content:")
            bookmark.print_content()
        parent_suanz = result[-1].page_num
    return result

# merge pdf, keep origin bookmarks and add a higher bookmark for each pdf
# if there is marks, one mark corresponds to one pdf
def merge_pdf(pdfs, target_file = "res.pdf", marks = None):
    if pdfs:
        merger = PdfFileMerger()
        page_num = 0
        bookmark_r = [None]
        for index, pdf_file in enumerate(pdfs):
            if os.path.isfile(pdf_file):
                try:
                    file_suanz = PdfFileReader(open(pdf_file, 'rb'))
                    suanz_num = file_suanz.getNumPages()
                    print("suanz_num, page_num: ", suanz_num, page_num)
                    bookmark_r += [None]*suanz_num
                    bookmarks = bookmark_list(file_suanz.getOutlines(), 
                                              page_num)
                    merger.append(file_suanz)
                    print("give bookmark pdffile, ", page_num)
                    mark = pdf_file
                    if marks:
                        if len(marks) > index:
                            mark = marks[index]
                    bookmark_r[page_num] = merger.addBookmark(mark, 
                                                              page_num)
                    for bookmark in bookmarks:
                        print("give back bookmark, ", 
                              page_num + bookmark.page_num)
                        print("use back bookmark, ", bookmark.parent)
                        bookmark_suanz = merger.addBookmark(
                            bookmark.title, 
                            page_num + bookmark.page_num, 
                            bookmark_r[bookmark.parent])
                        if not bookmark_r[page_num + bookmark.page_num]:
                            bookmark_r[page_num + bookmark.page_num] = \
                            bookmark_suanz
                    page_num += suanz_num
                except:
                    print("merge pdf, but {} read failed".format(pdf_file))
        merger.write(target_file)

要在Python中使用PyPDF2库添加水印,你可以按照以下步骤操作: 1. 首先,确保已经安装了PyPDF2库。你可以使用pip install PyPDF2命令来进行安装。 2. 导入PyPDF2模块,并打开要添加水印的PDF文件。 3. 创建一个PdfReader对象,用于读取PDF内容。 4. 创建一个PdfWriter对象,用于写入修改后的PDF内容。 5. 创建另一个PdfReader对象,用于读取包含水印的PDF文件。 6. 使用merge_page()方法将要添加水印的页面与水印文件的第一页合并。 7. 将合并后的页面添加到PdfWriter对象中。 8. 最后,使用write()方法将修改后的PDF内容写入新的PDF文件。 下面是一个示例代码,演示了如何使用PyPDF2库在Python中添加水印: ```python import PyPDF2 # 打开要添加水印的PDF文件 pdf_file = open('input.pdf', 'rb') # 创建PdfReader对象,读取PDF内容 pdf_reader = PyPDF2.PdfReader(pdf_file) # 创建PdfWriter对象,用于写入修改后的PDF内容 pdf_writer = PyPDF2.PdfWriter() # 打开包含水印的PDF文件 watermark_file = open('watermark.pdf', 'rb') # 创建PdfReader对象,读取水印文件 watermark_reader = PyPDF2.PdfReader(watermark_file) # 遍历每一页,并将水印添加到页面上 for page in pdf_reader.pages: # 合并页面与水印文件的第一页 page.merge_page(watermark_reader.pages<span class="em">1</span><span class="em">2</span><span class="em">3</span> #### 引用[.reference_title] - *1* [Python中通过PyPDF2实现PDF添加水印](https://blog.csdn.net/weixin_42311427/article/details/113493188)[target="_blank" data-report-click={"spm":"1018.2226.3001.9630","extra":{"utm_source":"vip_chatgpt_common_search_pc_result","utm_medium":"distribute.pc_search_result.none-task-cask-2~all~insert_cask~default-1-null.142^v93^chatsearchT3_1"}}] [.reference_item style="max-width: 50%"] - *2* *3* [python PyPDF2处理PDF文件](https://blog.csdn.net/username666/article/details/126363097)[target="_blank" data-report-click={"spm":"1018.2226.3001.9630","extra":{"utm_source":"vip_chatgpt_common_search_pc_result","utm_medium":"distribute.pc_search_result.none-task-cask-2~all~insert_cask~default-1-null.142^v93^chatsearchT3_1"}}] [.reference_item style="max-width: 50%"] [ .reference_list ]
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值