使用字符格式去匹配1.log中的日志,会有部分文件显示无法解码,使用utf-8也是同样情况,因此需要对正则和文件都已二进制方式处理,即可。
import re
re_compile = re.compile("meter_data received.seq = (\d+). mr_typ = (\d+). mac = (\w{2}:\w{2}:\w{2}:\w{2}:\w{2}:\w{2}). len = \d+")
with open('1.log', 'r') as f:
for line in f:
line = line.strip()
print(line)
m_meter_data_receive = re_compile.search(line)
if m_meter_data_receive:
print(m_meter_data_receive.group(1), m_meter_data_receive.group(3))
是用二进制方式,需要修改部分
re_compile = re.compile(b"meter_data received.seq = (\d+). mr_typ = (\d+). mac = (\w{2}:\w{2}:\w{2}:\w{2}:\w{2}:\w{2}). len = \d+")
with open('cco_20200816141112_part0.log', 'rb') as f:
匹配ok