一、说明
包含:
1.如何创建DataFrame
2.取指定列指定值的行
3.按照条件取值
二、创建DataFrame示例数据
import pandas as pd
import numpy as np
df = {"name":['A', 'B', 'C', 'D', np.NaN, 'F', 'G','G'],
"score":[85.0, 99.0, np.NaN, 94.0, 65.0, 96.0, 100.0,99]}
df = pd.DataFrame(df)
df
name score
0 A 85.0
1 B 99.0
2 C NaN
3 D 94.0
4 NaN 65.0
5 F 96.0
6 G 100.0
7 G 99.0
三、取指定列指定值的行
3.1 字符串
3.1.1 取出name中值为G的行
df[df['name'] == 'G']
name score
6 G 100.0
7 G 99.0
3.2 数值
3.2.1 指定值
取出分数为99的行
df[df['score'] == 99]
name score
1 B 99.0
7 G 99.0
3.2.2 条件取值
1.取出大于90分的行
df[df['score'] > 90]
name score
1 B 99.0
3 D 94.0
5 F 96.0
6 G 100.0
7 G 99.0
2.取出分数大于90且小于99的行
df[(df['score'] > 90) & (df['score'] < 99)]
name score
3 D 94.0
5 F 96.0
3.取出小于90分或者大于99分的行
df[(df['score'] < 90) | (df['score'] > 99)]
name score
0 A 85.0
4 NaN 65.0
6 G 100.0
3.3 按照最大最小值取行
最大值
df[df['score'] == df['score'].max()]
name score
6 G 100.0
最小值
df[df['score'] == df['score'].min()]
name score
4 NaN 65.0