本需求是对2020年新冠肺炎疫情快速发展期进行分析,了解一下它的发展变化情况。首先,了解一下数据集:
import pandas as pd
import matplotlib.pyplot as plt
plt.rcParams['figure.figsize']=(10.0,6.0)
plt.rcParams['font.family']=['sans-serif']
plt.rcParams['font.sans-serif']=['SimHei']
df=pd.read_csv('https://www.gairuo.com/file/data/dataset/countries-aggregated.csv',parse_dates=['Date'])
df.tail()
'''
Date Country Confirmed Recovered Deaths
19339 2020-05-04 West Bank and Gaza 362 102 2
19340 2020-05-04 Western Sahara 6 5 0
19341 2020-05-04 Yemen 12 1 2
19342 2020-05-04 Zambia 137 78 3
19343 2020-05-04 Zimbabwe 34 5 4
'''
首先来看一下中国累计确诊人数趋势,可见爆发之初是快速上升的,如下:
(
df.assign(Da