第六章：文件系统-codecs:字符串编码和解码-非Unicode编码

最新推荐文章于 2023-11-14 11:40:04 发布

学习中的编程老菜鸟

最新推荐文章于 2023-11-14 11:40:04 发布

阅读量214

点赞数

分类专栏： Python标准库

Python标准库专栏收录该内容

819 篇文章 19 订阅

订阅专栏

6_10_6_非Unicode编码
尽管之前多大多数例子都使用Unicode编码，但实际上codecs还可以用于很多其他数据转换。例如，Python包含了处理base-64,bzip2,ROT-13,ZIP和其他数据格式的codecs。

import codecs
import io

buffer = io.StringIO()
stream = codecs.getwriter('rot_13')(buffer)

text = 'abcdefghijklmnopqrstuvwxyz'

stream.write(text)
stream.flush()

print('Original:',text)
print('ROT-13  :',buffer.getvalue())

如果转换可以被表述为有单个输入参数的函数，并且返回一个字节或Unicode串，那么这样的转换都可以注册为一个codec。对于“rot_13”codec，输入应当是一个Unicode串，输出也是一个Unicode串。
运行结果：
在这里插入图片描述
使用codecs包装一个数据流，可以提供比直接使用zlib更简单的接口。

import codecs
import io

from codecs_to_hex import to_hex

buffer = io.BytesIO()
stream = codecs.getwriter('zlib')(buffer)

text = b'abcdefghijklmnopqrstuvwxyz\n' * 50

stream.write(text)
stream.flush()

print('Original length :',len(text))
compressed_data = buffer.getvalue()
print('ZIP compressed  :',len(compressed_data))

buffer = io.BytesIO(compressed_data)
stream = codecs.getreader('zlib')(buffer)

first_line = stream.readline()
print('Read first line :',repr(first_line))

uncompressed_data = first_line + stream.read()
print('Uncompressed    :',len(uncompressed_data))
print('Same            :',text == uncompressed_data)