Pandas数据分析初学--Apply函数

最新推荐文章于 2022-10-27 12:57:15 发布

Booting-O

最新推荐文章于 2022-10-27 12:57:15 发布

阅读量563

点赞数 1

文章标签： python 数据分析大数据

本文链接：https://blog.csdn.net/Google_eason/article/details/105464987

版权

首先导入Pandas库

import pandas as pd

（1）. 将数据框命名为crime；

#因为这个表中的数据有不同，所以需要将col=0的列设为index
crime = pd.read_csv('US_Crime_Rates_1960_2014.csv',index_col=0)

（2）. 每一列(column)的数据类型是什么样的？

crime.info()

（3）. 将Year的数据类型转换为datetime64；

crime.Year = pd.to_datetime(crime.Year,format='%Y')

（4）. 将列Year设置为数据框的索引；

crime = crime.set_index('Year',drop=True)

（5）. 删除名为Total的列；

del crime['Total']
#打印一下，查看是否删除成功
crime.head()

（6）. 按照Year（每十年）对数据框进行分组并求和；

crimes = crime.resample('10AS').sum()
population = crime.resample('10AS').max() #人口是累计数，不能直接求和
crimes['Population'] = population
crimes

（7）. 何时是美国历史上生存最危险的年代？

crime.idxmax(0)

数据表Github链接：https://github.com/Booting-O/Pandas-Practice.git

关注