从零开始数据分析Kaggle项目—泰坦尼克号(三)
本节主要内容如何利用Pandas进行排序、算术计算以及函数describe()的使用。
# title: "Kaggle项目泰坦尼克号 1__1.3"
# author: "小鱼"
# date: "2021-12-15"
#加载所需的库
import numpy as np
import pandas as pd
df = pd.read_csv("train.csv")
df.head()
PassengerId | Survived | Pclass | Name | Sex | Age | SibSp | Parch | Ticket | Fare | Cabin | Embarked | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 1 | 0 | 3 | Braund, Mr. Owen Harris | male | 22.0 | 1 | 0 | A/5 21171 | 7.2500 | NaN | S |
1 | 2 | 1 | 1 | Cumings, Mrs. John Bradley (Florence Briggs Th... | female | 38.0 | 1 | 0 | PC 17599 | 71.2833 | C85 | C |
2 | 3 | 1 | 3 | Heikkinen, Miss. Laina | female | 26.0 | 0 | 0 | STON/O2. 3101282 | 7.9250 | NaN | S |
3 | 4 | 1 | 1 | Futrelle, Mrs. Jacques Heath (Lily May Peel) | female | 35.0 | 1 | 0 | 113803 | 53.1000 | C123 | S |
4 | 5 | 0 | 3 | Allen, Mr. William Henry | male | 35.0 | 0 | 0 | 373450 | 8.0500 | NaN | S |
#创建ataFrame
df1 = pd.DataFrame(np.arange(8).reshape((2, 4)),
index=['2', '1'],
columns=['a', 'b', &#