python2.7 查找替换文件中的中文

最新推荐文章于 2024-09-04 11:20:05 发布

huycheaven

最新推荐文章于 2024-09-04 11:20:05 发布

阅读量4.4k

点赞数

分类专栏： python 文章标签： python乱码查找替换中文

本文链接：https://blog.csdn.net/huyuchaoheaven/article/details/81036072

版权

python 专栏收录该内容

1 篇文章 0 订阅

订阅专栏

Python中默认的编码格式是 ASCII 格式，在没修改编码格式时无法正确打印汉字，所以在读取中文时会报错。

解决方法为只要在文件开头加入 # -- coding: UTF-8 -- 或者 #coding=utf-8 就行了

注意：#coding=utf-8 的 = 号两边不要空格。

Python3.X 源码文件默认使用utf-8编码，所以可以正常解析中文，无需指定 UTF-8 编码。

另外：
批量查找替换文件中的中文

#中文前加u，表示unicode格式
old_str=u"百老汇影城"
name=u"四川太平洋"
print '模板中的字符串：',old_str

def test():
    cinemaNamePath='./../dist/pages/login/agreement/agreement.wxml'

    fileName = './agreement22222.wxml'

    f = file(cinemaNamePath)

    output = open(fileName, "w")

    for line in f:
	# 将line 解码成unicode 
        line = unicode(line,"utf-8")
        #在unicode中找unicode
        if line.find(old_str) > 0:
            print '找到了 字符串 %s',old_str
            line = line.replace(old_str, name)
            print '替换后的line = ',line
            #把替换后的line，写入到新文件。  将unicode编码成字符串，通过指定的编码(utf-8)
            output.write(line.encode('utf-8'))
        #print 'line = ',line    
    f.close()
    output.close()
#调用 test方法
test()