读取文件时报错Unicode编码错误,一般这种情况较多是在国标码(GBK)和utf8之间选择出现了问题。
这里是官方提供的LJSPEECH文件,只读,无法修改,于是copy一份,使用自己写的文件。
改法一:
# errors = 'ignore'
with open(self._metadata_path, "r", newline="", errors='ignore') as metadata:
flist = csv.reader(metadata, delimiter="|", quoting=csv.QUOTE_NONE)
self._flist = list(flist)
改法二:
# encoding = 'utf-8'
with open(self._metadata_path, "r", newline="", encoding='utf-8') as metadata:
flist = csv.reader(metadata, delimiter="|", quoting=csv.QUOTE_NONE)
self._flist = list(flist)