pandas数据类型
Series:一维数组
创建Series的3种方式
import pandas as pd
import numpy as np
s1 = pd.Series([1,'a',43,13,68])
s1
0 1
1 a
2 43
3 13
4 68
dtype: object
s1.index
RangeIndex(start=0, stop=5, step=1)
s1.values
array([1, 'a', 43, 13, 68], dtype=object)
s1[2]
s1[[2,3]]
2 43
3 13
dtype: object
type(s1[2])
int
s2 = pd.Series([1,'a',4,6,54],index=['d','c','a','b','e'])
s2
d 1
c a
a 4
b 6
e 54
dtype: object
s2.index
Index(['d', 'c', 'a', 'b', 'e'], dtype='object')
s2.values
array([1, 'a', 4, 6, 54], dtype=object)
data = {'lin':35000,'deng':13000,'huang':20000,'wang':14000}
s3 = pd.Series(data)
s3
lin 35000
deng 13000
huang 20000
wang 14000
dtype: int64
s3.index
Index(['lin', 'deng', 'huang', 'wang'], dtype='object')
s3.values
array([35000, 13000, 20000, 14000], dtype=int64)
DataFrame:二维数组
创建方法
1. 读取纯文本文件、excel表、MySQL数据库
2. 根据字典序列创建
data={
'name':['lin','hong','lian'],
'age':[18,16,17],
'course':[90,87,88]
}
df1 = pd.DataFrame(data)
df1
| name | age | course |
---|
0 | lin | 18 | 90 |
---|
1 | hong | 16 | 87 |
---|
2 | lian | 17 | 88 |
---|
df1.dtypes
name object
age int64
course int64
dtype: object
df1.index
RangeIndex(start=0, stop=3, step=1)
df1.columns
Index(['name', 'age', 'course'], dtype='object')
df1['name']
0 lin
1 hong
2 lian
Name: name, dtype: object
df1[['name','age']]
| name | age |
---|
0 | lin | 18 |
---|
1 | hong | 16 |
---|
2 | lian | 17 |
---|
df1.loc[1]
name hong
age 16
course 87
Name: 1, dtype: object
df1.loc[1:2]
| name | age | course |
---|
1 | hong | 16 | 87 |
---|
2 | lian | 17 | 88 |
---|