这里写自定义目录标题
【pandas常见问题】
data = pd.read_csv('out.csv',delimiter=',')
datas = data[data.author_id==data.viewer_id]['author_id']
print(type(datas))#<class 'pandas.core.series.Series'>
datas = data[data.author_id==data.viewer_id][['author_id']]
print(type(datas))#<class 'pandas.core.frame.DataFrame'>
# datas = data[data.author_id==data.viewer_id][['author_id']].unique()
# print(datas)#AttributeError: 'DataFrame' object has no attribute 'unique'
datas = data[data.author_id==data.viewer_id]['author_id'].unique()
print(type(datas)) #<class 'numpy.ndarray'>
print(datas) #['--' '7' '4']
print(type(datas.tolist()))#<class 'list'>
print(datas.tolist())#['--', '7', '4']
c = pd.DataFrame(datas,columns=['id']).sort_values(by='id',ascending=True)
print(c)
c = pd.DataFrame(datas.tolist(),columns=['id']).sort_values(by='id',ascending=True)
print(c)
c = pd.DataFrame(set(datas.tolist()),columns=['id']).sort_values(by='id',ascending=True)
print(c)
c = pd.DataFrame(tuple(datas.tolist()),columns=['id']).sort_values(by='id',ascending=True)
print(c)
1、unique()只针对series操作,操作以后的结果是narray格式,可以通过tolist()转化为列表
2、datas = data[data.author_iddata.viewer_id][[‘author_id’]],datas是Dataframe格式
3、datas = data[data.author_iddata.viewer_id][‘author_id’],datas是Series格式
4、pands对任何数据结构的数据都可以操作为dataframe。
narray:
pd.DataFrame(datas,columns=['id'])
list:
pd.DataFrame(datas.tolist(),columns=['id'])
tuple:
pd.DataFrame(tuple(datas.tolist()),columns=['id'])
set:
pd.DataFrame(set(datas.tolist()),columns=['id'])
dict:
dic = {i+1:tuple(i) for i in range(3)}
pd.DataFrame(dic,columns=['id'])