我试图从pandas数据框中删除NA值。
我使用了dropna()(它应该从数据帧中删除所有NA行)。然而,它不起作用。
代码如下:import pandas as pd
import numpy as np
prison_data = pd.read_csv('https://andrewshinsuke.me/docs/compas-scores-two-years.csv')
这就是获取数据帧的方法。如下所示,默认的read_csv方法确实将NA数据点转换为np.nan。np.isnan(prison_data.head()['out_custody'][4])
Out[2]: True
方便的是,DF的head()已经包含一个NaN值(在out_custody列中),所以打印prison_data.head()这个,您将得到:id name first last compas_screening_date sex
0 1 miguel hernandez miguel hernandez 2013-08-14 Male
1 3 kevon dixon kevon dixon 2013-01-27 Male
2 4 ed philo ed philo 2013-04-14 Male
3 5 marcu brown marcu brown 2013-01-13 Male
4 6 bouthy pierrelouis bouthy pierrelouis 2013-03-26 Male
dob age age_cat race ...
0 1947-04-18 69 Greater than 45 Other ...
1 1982-01-22 34 25 - 45 African-America