numpy能够帮我们处理处理数值型数据,
但是这还不够
数据除了数值之外,还有字符串string,还有时间序列timestamp等
pandas除了处理数值之外(基于numpy),还能够帮助我们处理字符串string,还有时间序列timestamp数据等
Series 一维,index 行索引
DataFrame 二维,index和columns 行索引和列索引
创建series
import pandas as pd
import numpy as np
data = pd.Series(np.array([3,4,5,1,2]),index=['b','c','d','e','f']) # 一维
print(data)
print(type(data))
b 3
c 4
d 5
e 1
f 2
dtype: int32
<class 'pandas.core.series.Series'>
字典创建series
# 字典创建series
mydict = {
string.ascii_lowercase[i]:i for i in range(10)}
print(mydict)
data2 = pd.Series(mydict)
print(data2) # 键 index,值 value
{
'a': 0, 'b': 1, 'c': 2, 'd': 3, 'e': 4, 'f': 5, 'g': 6, 'h': 7, 'i': 8, 'j':