df1=pd.read_csv('/Users/macbookair/Desktop/毕设/yelp/review.csv')
报错:
ParserError: Error tokenizing data. C error: Expected 10 fields in line 42511, saw 14
后改变参数,将delimiter设为'\t'
data1=pd.read_csv('/Users/macbookair/Desktop/毕设/yelp/review.csv', header=0, delimiter='\t')
In [7]: df.info()
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 114661 entries, 0 to 114660
Data columns (total 1 columns):
review_id,user_id,business_id,stars,date,text,useful,funny,cool,type 114660 non-null object
dtypes: object(1)
memory usage: 895.9+ KB