01pandas的dataFrame的创建
import pandas as pd
import numpy as np
t1 = pd.DataFrame(np.array(range(12)).reshape(3,4))
t1
t1 = pd.DataFrame(np.arange(12).reshape(3,4),index=list('abc'),columns=list('efgh'))
t1
d1 = {'name':["xiaoming","xiaohong"],"age":[20,32],"tel":[10086,10010]}
d1
{‘age’: [20, 32], ‘name’: [‘xiaoming’, ‘xiaohong’], ‘tel’: [10086, 10010]}
pd.DataFrame(d1)
t1 = pd.DataFrame(d1)
type(t1)
pandas.core.frame.DataFrame
多个字典变成DataFrame
d2 = [{'name':'xiaohong','age':31,'tel':10098},{'name':'xiaoyue','age':12,'tel':9999},{'name':'xiaohua','age':34,'tel':123456}]
d2
[{‘age’: 31, ‘name’: ‘xiaohong’, ‘tel’: 10098},
{‘age’: 12, ‘name’: ‘xiaoyue’, ‘tel’: 9999},
{‘age’: 34, ‘name’: ‘xiaohua’, ‘tel’: 123456}]
t2 = pd.DataFrame(d2)
t2
02Dataframe的描述信息
import numpy as np
import pandas as pd
t1 = [{'name':'xiaohong','age':32,'tel':10010},{'name':'xiaogang','tel':10000},{'name':'xiaowang','age':22}]
print(t1)
t2 = pd.DataFrame(t1)
t2
[{‘age’: 32, ‘tel’: 10010, ‘name’: ‘xiaohong’},
{‘tel’: 10000, ‘name’: ‘xiaogang’},
{‘age’: 22, ‘name’: ‘xiaowang’}]
t2.shape
(3, 3)
t2.index
RangeIndex(start=0, stop=3, step=1)
t2.columns
Index([‘age’, ‘name’, ‘tel’], dtype=‘object’)
t2.values
array([[32.0, ‘xiaohong’, 10010.0],
[nan, ‘xiaogang’, 10000.0],
[22.0, ‘xiaowang’, nan]], dtype=object)
t2.dtypes
age float64
name object
tel float64
dtype: object
t2.ndim
2
t2.head()
t2.tail(3)
t2.info()
<class ‘pandas.core.frame.DataFrame’>
RangeIndex: 3 entries, 0 to 2
Data columns (total 3 columns):
age 2 non-null float64
name 3 non-null object
tel 2 non-null float64
dtypes: float64(2), object(1)
memory usage: 152.0+ bytes
t2.describe()
按列排序:
t2.sort_values(by = 'age') #升序排列
t2.sort_values(by = 'age',ascending = False)#降序排列