for page in PDFPage.create_pages(document):
interpreter.process_page(page)
layout = device.get_result()
for x in layout:
with open('a.txt','a') as f:
f.write(x.get_text().encode('utf-8'))
代码如上,报错了
解决办法:
加上判断语句 if (isinstance(x, LTTextBoxHorizontal)):
使用isinstance函数,判断是否是LTTextBoxHorizontal格式