pandas修改值
http://pandas.pydata.org/pandas-docs/stable/getting_started/10min.html
饼图
data = pandas.read_csv(data_path)
status = data['marital-status']
counts = status.value_counts()
counts.plot(kind='pie',
figsize=(5, 6),
autopct='%1.f%%', # add in percentages
startangle=90, # start angle 90° (Africa)
# shadow=True, # add shadow
)
plt.title('marital-status')
plt.axis('equal') # Sets the pie chart to look like a circle.
plt.show()
直方图
data['age'].plot(kind='hist', rwidth=0.9)
plt.xlabel("age")
plt.ylabel("count")
plt.show()
选取包含指定值的列
例如数据中有一列为income,如果我们想选择income=‘<=50K.’的记录,那么:
data_lt50 = data.loc[data['income']=='<=50K.']