1. module --- codecs
用于处理unicode文件
2. BOM ---- byte-order Marker
使用Unicode character U+FEFF来表示
(for example:
“\fe \ff” for UTF-16 big-end "\ff \fe" for UTF-16 little-end)
3. module ---- unicodedata
the Unicode Character Database (UCD) which defines character properties for all Unicode characters