Python办公自动化，有效告别繁琐操作，955不是梦，建议收藏！

2301_76268112

于 2024-04-27 11:24:17 发布

阅读量813

点赞数 16

文章标签： python c# windows

本文链接：https://blog.csdn.net/2301_76268112/article/details/138245215

版权

文章介绍了使用Python进行批量操作，包括从多个Excel文件读取数据合并到单个文件，以及将Word文档转换为PDF格式。主要内容涉及读写Excel，创建和操作Word文档，以及批量文件转换功能。

摘要由CSDN通过智能技术生成

list_row_data = []
for f in range(0, len(allFile_url)):
#for f in allFile\_url:
    # 打开excel文件
    print('打开%s文件' % allFile_url[f])
    excel = xlrd.open\_workbook(allFile_url[f])
    # 根据索引获取sheet，这里是获取第一个sheet
    table = excel.sheet\_by\_index(0)
    print('该文件行数为：%d，列数为：%d' % (table.nrows, table.ncols))

    # 获取excel文件所有的行
    for i in range(table.nrows):
        # yezi表头修改处，如果表头是2行则为2，1行则为1
        if have_title and i < top and f != 0:
            continue
        else:
            row = table.row\_values(i)  # 获取整行的值，返回列表
            list_row_data.append(row)

print('总数据量为%d' % len(list_row_data))
# 写入all文件
add\_row(list_row_data, file_name)

创建文件名为file_name,表头为title的excel文件

def create_excel(file_name, title):
print(‘创建文件%s’ % file_name)
a = xlwt.Workbook()
# 新建一个sheet
table = a.add_sheet(‘sheet1’, cell_overwrite_ok=True)
# 写入数据
#for i in range(len(title)):
# table.write(0, i, title[i])
a.save (file_name)

向文件中添加n行数据

def add_row(list_row_data, file_name):
# 打开excel文件
allExcel1 = xlrd.open_workbook(file_name)
sheet = allExcel1.sheet_by_index(0)
# copy一份文件,准备向它添加内容
allExcel2 = copy(allExcel1)
sheet2 = allExcel2.get_sheet(0)

# 写入数据
i = 0
for row_data in list_row_data:
    for j in range(len(row_data)):
        sheet2.write(sheet.nrows + i, j, row_data[j])
    i += 1
# 保存文件，将原文件覆盖
allExcel2.save(file_name)
print('合并完成')

if name == ‘__main__’:
# 设置文件夹路径
# ““为字符串中的特殊字符，加上r后变为原始字符串，则不会对字符串中的”\t”、“\r” 进行字符串转义
file_dir = ‘.\01 报表合并\word’
#模板顶部表头行数,当前行数减1
top = 2
# 设置文件名，用于保存数据
file_name = ‘save_demo.xls’

# 获取文件夹的路径,该路径下的所有文件夹，以及所有文件
root, dirs, files = get\_allfile\_msg(file_dir)
# 拼凑目录路径+文件名,组成文件的路径,用一个列表存储
allFile_url = get\_allfile\_url(root, files)
# have\_title参数默认为True,为True时不读取excel文件的首行
all\_to\_one(root, allFile_url, file_name=file_name, title=None, have_title=True)


![图片](https://img-blog.csdnimg.cn/img_convert/34645fb5050d45affb6d0bf7b76ca703.png)


![图片](https://img-blog.csdnimg.cn/img_convert/430fa487bec08fe621120e91e911932f.png)


### 批量word转pdf

import win32com.client
import pythoncom
import os

class Word_2_PDF(object):

def \_\_init\_\_(self, filepath, Debug=False):
    """
    :param filepath:
    :param Debug: 控制过程是否可视化
    """
    self.wordApp = win32com.client.Dispatch('word.Application')
    self.wordApp.Visible = Debug
    self.myDoc = self.wordApp.Documents.Open(filepath)

def export\_pdf(self, output_file_path):
    """
    将Word文档转化为PDF文件
    :param output_file_path:
    :return:
    """
    self.myDoc.ExportAsFixedFormat(output_file_path, 17, Item=7, CreateBookmarks=0)

def close(self):
    self.wordApp.Quit()

if name == ‘__main__’:

rootpath = os.getcwd()  # 文件夹路径
save_path = os.getcwd()   # PDF储存位置
pythoncom.CoInitialize()

os_dict = {root:[dirs, files] for root, dirs, files in os.walk(rootpath)}
for parent, dirnames, filenames in os.walk(rootpath):
    for filename in filenames:
        if u'.doc' in filename and u'~$' not in filename:
              # 直接保存为PDF文件
            #print(rootpath+filename)
            a = Word\_2\_PDF(rootpath +'\\'+ filename, True)
            title = filename.split('.')[0]  # 删除.docx
            a.export\_pdf(rootpath  +'\\'+ title+'.pdf')
print('转化完成')


![图片](https://img-blog.csdnimg.cn/img_convert/7fd6852f736391e11d8720b1a8d68c12.png)


### 合同生成

from openpyxl import load_workbook
from docx import Document
from os import listdir
‘’’
定义替换函数
‘’’
def replace_text(old_text, new_text):
#读取所有的自然段
all_paragraphs = document.paragraphs
for paragraph in all_paragraphs:
#循环读取所有的run，并进行新旧文本的替换
for run in paragraph.runs:
run_text = run.text.replace(old_text, new_text)
run.text = run_text
#读取所有的表格
all_tables = document.tables
for table in all_tables:
for row in table.rows:
#循环读取表格中所有的cells，并进行新旧文本的替换
for cell in row.cells:
cell_text = cell.text.replace(old_text, new_text)
cell.text = cell_text
‘’’
获取Excel和Word的文件名
‘’’
for file in listdir():
print(file, ‘listdir’)
if ‘模板.docx’ in file:
docx_name = file
if ‘信息.xlsx’ in file:
xlsx_name = file
‘’’
读取Excel内数据
‘’’
wb = load_workbook(xlsx_name)
sheetx0 = wb.sheetnames
sheetx = wb[sheetx0[0]]

#新文件以第几列数据命名
filename_pos = 1
‘’’
循环读取并替换
‘’’
#合同要素Excel中逐列循环
for row in range(3,sheetx.max_row+1):
document = Document(docx_name)
#openpyxl在使用sheetx.max_column时可能会读取到空的单元格，这里进行剔除
if sheetx.cell(row=row,column=1).value!=None:
#合同要素Excel中逐行循环
for l in range(1,sheetx.max_column+1):
#合同要素Excel中对第一列逐行读取编号
old_text = sheetx.cell(row=1,column=l).value
#合同要素Excel中对循环的当前列逐行读取新要素
new_text = sheetx.cell(row=row,column=l).value
replace_text(str(old_text),str(new_text)) #进行替换
#定义文件名为当前列第一行的内容
filename = str(sheetx.cell(row=row,column=filename_pos).value)
#按定义的文件名进行保存
document.save(“%s.docx”%(filename))
print(‘合同生成完毕！’)


![图片](https://img-blog.csdnimg.cn/img_convert/d2ccf470bf1c8257c0c6eb1e5eed43ee.png)


![在这里插入图片描述](https://img-blog.csdnimg.cn/20210511152217670.jpg?x-oss-process=image/watermark,type_ZmFuZ3poZW5naGVpdGk,shadow_10,text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L3poaWd1aWd1,size_16,color_FFFFFF,t_70)

**感谢每一个认真阅读我文章的人，看着粉丝一路的上涨和关注，礼尚往来总是要有的：**



①　2000多本Python电子书（主流和经典的书籍应该都有了）

②　Python标准库资料（最全中文版）

③　项目源码（四五十个有趣且经典的练手项目及源码）

④　Python基础入门、爬虫、web开发、大数据分析方面的视频（适合小白学习）

⑤ Python学习路线图（告别不入流的学习）




**网上学习资料一大堆，但如果学到的知识不成体系，遇到问题时只是浅尝辄止，不再深入研究，那么很难做到真正的技术提升。**

**[需要这份系统化学习资料的朋友，可以戳这里无偿获取](https://bbs.csdn.net/topics/618317507)**

**一个人可以走的很快，但一群人才能走的更远！不论你是正从事IT行业的老鸟或是对IT行业感兴趣的新人，都欢迎加入我们的的圈子（技术交流、学习资源、职场吐槽、大厂内推、面试辅导），让我们一起学习成长！**

2301_76268112

关注

16
点赞
踩
17

收藏

觉得还不错? 一键收藏
0
评论
Python办公自动化，有效告别繁琐操作，955不是梦，建议收藏！

““为字符串中的特殊字符，加上r后变为原始字符串，则不会对字符串中的”\t”、“\r” 进行字符串转义。#openpyxl在使用sheetx.max_column时可能会读取到空的单元格，这里进行剔除。replace_text(str(old_text),str(new_text)) #进行替换。print(‘创建文件%s’ % file_name)#循环读取表格中所有的cells，并进行新旧文本的替换。#合同要素Excel中对循环的当前列逐行读取新要素。#循环读取所有的run，并进行新旧文本的替换。
复制链接

扫一扫