1. 缺失值**
a) 可以用None或者np.nan来表示缺失的值
import pandas as pd
import numpy as np
data=[['mark',55,'Italy',4.5,'Europe'],
['John',33,'China',3.8,'Asian'],
['mary',40,'Japan',2.3,'Asian']]
df=pd.DataFrame(data=data,columns=['name','age','country','score','continent'],
index=[1001,1002,1003])
df.loc[1001,'score'] = None
df.loc[1004,:]=None
print(df)
b) 移除所有包含缺失数据的行
df=df.dropna()
注:
pd.dropna()同样不能改动原Dataframe,需要重新赋值
c) 只移除所有数据却缺失的行
df=df.dropna(how='all')