在合并两个csv表格时,表中有汉字,初始代码如下
data2013 = pd.read_csv(path1 + "-4.csv", encoding='utf-8', header=None, low_memory=False)
data2014 = pd.read_csv(path2 + "-4.csv", encoding='utf-8', header=None, low_memory=False)
mkdir('daily/lonely_2010-2019/' + path)
data2013.to_csv('daily/lonely_2010-2019/' + path + '/' + path + ".csv", encoding="utf_8", header=False, index=False,
mode='a')
data2014.to_csv('daily/lonely_2010-2019/' + path + '/' + path + ".csv", encoding="utf_8", header=False, index=False,
mode='a')
产生了错列和汉字乱码的问题
这种问题是因为编码格式问题,在改为
data2013 = pd.read_csv(path1 + "-4.csv", encoding='utf-8', header=None, low_memory=False)
data2014 = pd.read_csv(path2 + "-4.csv", encoding='utf-8', header=None, low_memory=False)
mkdir('daily/lonely_2010-2019/' + path)
data2013.to_csv('daily/lonely_2010-2019/' + path + '/' + path + ".csv", encoding="utf_8_sig", header=False, index=False,
mode='a')
data2014.to_csv('daily/lonely_2010-2019/' + path + '/' + path + ".csv", encoding="utf_8_sig", header=False, index=False,
mode='a')
可恢复正常。