批量将PDF文档每一页转化为图片格式
一、案例:
需要批量将pdf文档每一页转为为png格式的图片,并已该pdf文档名+页码作为图片的命名。
二、代码:
import os
import fitz # PyMuPDF
from PIL import Image
def is_chart(image):
# Implement your chart detection logic here
return True # For demonstration purposes, always return True
def extract_and_save_charts(input_folder, output_folder):
for filename in os.listdir(input_folder):
if filename.endswith('.pdf'):
input_path = os.path.join(input_folder, filename)
pdf_document = fitz.open(input_path)
for page_num in range(pdf_document.page_count):
page = pdf_document[page_num]
# Convert the PDF page to an image (PNG format)