现有数据:可见日期不连续
补全后:日期连续,缺失值用0填充
实现方法:
data = pd.read_table(r"./origdata/14-03-statics.txt", converters={'acc': str})
data = pd.DataFrame(data)
print(data)
data['date'] = pd.to_datetime(data['date'])
mux = pd.MultiIndex.from_product([data['acc'].unique(), pd.date_range(start='2014-03-01', end='2014-03-31')], names=['acc', 'date'])
data = data.set_index(['acc', 'date']).reindex(mux, fill_value=0).reset_index()
print(data)