1、写中文文件乱码
with io.open(code_log_path,"a+",encoding='utf-8') as f:
f.write(unicode("\xEF\xBB\xBF", "utf-8"))
f.write(u'{}\n'.format(ss))
2、Python将字符串格式的unicode编码转换成unicode编码,如:\u53eb\u6211,需要转换成中文时有两种方式:
(1).使用eval:
eval("u"+"\'"+unicodestr+"\'")
(2)使用decode:
str1 = '\u4f60\u597d'
print str1.decode('unicode_escape')
你好
unicodestr.decode('unicode_escape') # 将转义字符\u读取出来
3、按行读文件
with open('order.txt', 'r+') as f:
while True:
ss= f.readline().decode("GBK").encode("utf-8")
if not ss:
break
print ss,
4、一次读多行
with open('test3.txt', 'r+') as f:
while f:
ss= f.readline()
ss2= f.readline()
5、设置系统默认编码
import sys
reload(sys)
sys.setdefaultencoding('utf8')
6、超出utf-8
with open("./test.log","r+",encoding="ISO-8859-1")as f: