list 中中文乱码转码为utf-8
http://www.newsmth.net/nForum/#!article/Python/38863
先转码为unicode,再转为utf-8
feature_names=[]
data=['\xb9\xd8\xbc\xfc\xb4\xca_\xb4\xb4\xd2\xb5', '\xb9\xd8\xbc\xfc\xb4\xca_\xb3\xc9\xb7\xd6',]
for i in data:
print i.decode('gb18030').encode('utf-8')
feature_names.append(i.decode('gb18030').encode('utf-8'))
print feature_names
关键词_创业
关键词_成分
[’\xe5\x85\xb3\xe9\x94\xae\xe8\xaf\x8d_\xe5\x88\x9b\xe4\xb8\x9a’, ‘\xe5\x85\xb3\xe9\x94\xae\xe8\xaf\x8d_\xe6\x88\x90\xe5\x88\x86’]