前言
Python 可以通过以下两个模块访问 Excel:
- xlrd 模块实现对 Excel 文件内容读取。
- xlwt 模块实现对 Excel 文件的写入。
一、开发准备
1. 安装
使用前需要安装模块:
$ pip install xlwt
$ pip install xlrd
2. 导入
import xlwt
二、xlrd 模块使用
1. 基础操作
1.1 打开 Excel 文件
workbook = xlrd.open_workbook('xx.xls')
1.2 获取所有 Sheet 相关信息
# 获取所有sheet名称
sheet_name = workbook.sheet_names()
# 根据下标获取sheet名称
sheet2_name = workbook.sheet_names()[1]
# 根据sheet索引或者名称获取sheet内容,同时获取sheet名称、列数、行数
sheet2 = workbook.sheet_by_index(1)
print('sheet2名称:{}\nsheet2列数: {}\nsheet2行数: {}'.format(sheet2.name, sheet2.ncols, sheet2.nrows))
1.4 读取整行和整列的值
# 获取整行
print(sheet1.row_values(3))
# 整列整列
print(sheet1.col_values(3))
1.5 读取指定单元格的内容
# 获取 第 2 行 1 列内容
print(sheet1.cell(1,0).value)
print(sheet1.cell_value(1,0))
print(sheet1.row(1)[0].value)
2. 单元格类型
2.1 读取单元格内容的数据类型
说明:ctype : 0 empty,1 string, 2 , 3 date, 4 boolean, 5 error
# 获取单元格内容的数据类型
print(sheet1.cell(1,0).ctype)
类型清单:
类型 | 值 |
---|---|
empty | 0 |
string | 1 |
number | 2 |
date | 3 |
boolean | 4 |
error | 5 |
2.2 获取特殊格式单元格内容
- 日期类型
# 使用xlrd的xldate_as_tuple处理为date格式
from datetime import datetime,date
if sheet1.cell(3,6).ctype == 3 :
print(sheet1.cell(3, 6).value) # 41463.0
date_value = xlrd.xldate_as_tuple(sheet1.cell(3, 6).value, data.datemode)
print(date_value) # (2013, 7, 8, 0, 0, 0)
print(date(*date_value[:3])) # 2013-07-08
print(date(*date_value[:3]).strftime('%Y/%m/%d')) # 2013/07/08
- number 类型
if sheet1.cell(3, 5).ctype == 2:
print(sheet1.cell(3, 5).value) # 133111.0
num_value = int(sheet1.cell(3, 5).value) # 转为整型数据
print(num_value) # 133111
3. 获取合并单元格的内容
3.1 直接读取
需要 merged_cells 属性,merged_cells 四个参数的含义是:( row, row_range, col, col_range),其中 [row, row_range) 左闭右开(包括 row,不包括 row_range),col 也是一样,下标从0开始。例如:
- (0, 1,
0, 8
) 表示 1 列 - 8 列合并 - (
2, 6
, 0, 1) 表示 3 行 - 6 行合并
# 需要在读取文件的时候添加个参数,将formatting_info参数设置为True,默认是False,否则可能调用merged_cells属性获取到的是空值。
print(sheet1.merged_cells) # 结果: [(0, 1, 0, 8), (2, 6, 0, 1)]
# 分别获取合并2个单元格的内容:
print(sheet1.cell(0,0).value)
print(sheet1.cell_value(2, 0))
规律 : 获取merge_cells返回的row和col低位的索引即可!
3.2 简便方式
merge_value = []
for (row, row_range, col, col_range) in sheet1.merged_cells:
merge_value.append((row, col))
print(merge_value) # [(0, 0), (2, 0)]
for v in merge_value:
print(sheet1.cell(v[0], v[1]).value)
三、xlwt 模块使用
1. 基础操作
1.1 创建 Workbook
创建一个 Excel 文件并制定编码格式为 UTF-8
workbook = xlwt.Workbook(encoding = 'utf-8')
1.2 新建 Worksheet
worksheet = workbook.add_sheet('My Worksheet')
1.3 写入 Excel
# 参数对应 行, 列, 值
worksheet.write(1,0, label = 'this is test')
1.4 保存
workbook.save('Excel_test.xls')
运行后 会在当前目录生成一个Excel_test.xls
2. 设置样式
2.1 设置单元格样式
style = xlwt.XFStyle() # 初始化样式
font = xlwt.Font() # 为样式创建字体
font.name = 'Times New Roman'
font.bold = True # 黑体
font.underline = True # 下划线
font.italic = True # 斜体字
style.font = font # 设定样式
# 不带样式的写入
worksheet.write(0, 0, 'Unformatted value')
# 带样式的写入`
worksheet.write(1, 0, 'Formatted value', style)
2.2 设置单元格宽度
worksheet.col(0).width = 3333
2.3 设置日期格式
style = xlwt.XFStyle()
# Other options: D-MMM-YY, D-MMM, MMM-YY, h:mm, h:mm:ss, h:mm, h:mm:ss, M/D/YY h:mm, mm:ss, [h]:mm:ss, mm:ss.0
style.num_format_str = 'M/D/YY'
worksheet.write(0, 0, datetime.datetime.now(), style)
2.4 设置单元格对齐
alignment = xlwt.Alignment() # Create Alignment
alignment.horz = xlwt.Alignment.HORZ_CENTER # May be: HORZ_GENERAL, HORZ_LEFT, HORZ_CENTER, HORZ_RIGHT, HORZ_FILLED, HORZ_JUSTIFIED, HORZ_CENTER_ACROSS_SEL, HORZ_DISTRIBUTED
alignment.vert = xlwt.Alignment.VERT_CENTER # May be: VERT_TOP, VERT_CENTER, VERT_BOTTOM, VERT_JUSTIFIED, VERT_DISTRIBUTED
style = xlwt.XFStyle() # Create Style
style.alignment = alignment # Add Alignment to Style
worksheet.write(0, 0, 'Cell Contents', style)
2.5 设置单元格边框
# Please note: While I was able to find these constants within the source code, on my system (using LibreOffice,) I was only presented with a solid line, varying from thin to thick; no dotted or dashed lines.
borders = xlwt.Borders() # Create Borders
# DASHED虚线 NO_LINE没有 THIN实线
borders.left = xlwt.Borders.DASHED
# May be: NO_LINE, THIN, MEDIUM, DASHED, DOTTED, THICK, DOUBLE, HAIR, MEDIUM_DASHED, THIN_DASH_DOTTED, MEDIUM_DASH_DOTTED, THIN_DASH_DOT_DOTTED, MEDIUM_DASH_DOT_DOTTED, SLANTED_MEDIUM_DASH_DOTTED, or 0x00 through 0x0D.
borders.right = xlwt.Borders.DASHED
borders.top = xlwt.Borders.DASHED
borders.bottom = xlwt.Borders.DASHED
borders.left_colour = 0x40
borders.right_colour = 0x40
borders.top_colour = 0x40
borders.bottom_colour = 0x40
style = xlwt.XFStyle() # Create Style
style.borders = borders # Add Borders to Style
worksheet.write(0, 0, 'Cell Contents', style)
2.6 为单元格设置背景色:
# Create the Pattern
pattern = xlwt.Pattern()
# May be: NO_PATTERN, SOLID_PATTERN, or 0x00 through 0x12
pattern.pattern = xlwt.Pattern.SOLID_PATTERN
# May be: 8 through 63. 0 = Black, 1 = White, 2 = Red, 3 = Green, 4 = Blue, 5 = Yellow, 6 = Magenta, 7 = Cyan, 16 = Maroon, 17 = Dark Green, 18 = Dark Blue, 19 = Dark Yellow , almost brown), 20 = Dark Magenta, 21 = Teal, 22 = Light Gray, 23 = Dark Gray, the list goes on...
pattern.pattern_fore_colour = 5
style = xlwt.XFStyle()
# Add Pattern to Style
style.pattern = pattern
worksheet.write(0, 0, 'Cell Contents', style)
3. 添加公式
# Should output "10" (A1[5] * A2[2])
worksheet.write(1, 0, xlwt.Formula('A1*B1'))
# Should output "7" (A1[5] + A2[2])
worksheet.write(1, 1, xlwt.Formula('SUM(A1,B1)'))
# 添加一个超链接
worksheet.write(0, 0, xlwt.Formula('HYPERLINK("http://www.google.com";"Google")'))
4. 合并列和行
# Merges row 0's columns 0 through 3.
worksheet.write_merge(0, 0, 0, 3, 'First Merge')
font = xlwt.Font() # Create Font
font.bold = True # Set font to Bold
style = xlwt.XFStyle() # Create Style
style.font = font # Add Bold Font to Style
# Merges row 1 through 2's columns 0 through 3.
worksheet.write_merge(1, 2, 0, 3, 'Second Merge', style)
总结
有了这个, 咱就可以用 Python 快速完成 Excel 编辑了。