数据:
11,"American President, The (1995)",Comedy|Drama|Romance
分割后理想情况:
11
"American President, The (1995)"
Comedy|Drama|Romance
实际单纯用split分割后
11
"American President
The (1995)"
Comedy|Drama|Romance
怎么解决引号内不分割?
方法如下:
fp = open(item_file, encoding='UTF-8')
for line in fp:
lex = shlex.shlex(line)
lex.whitespace=','
lex.quotes='"'
lex.whitespace_split = True
itemlist=list(lex)
if len(itemlist) < 3:
continue
[itemid, title, genres] = itemlist[0],itemlist[1],itemlist[2]
['11','"American President, The (1995)"', 'Comedy|Drama|Romance\n']