如果数据文件来自
link
,问题是某些缺少的值是
?
.
所以必要的参数
na_values='?'
.
dataset = pd.read_csv('household_power_consumption.txt',
sep=';',
header=0,
low_memory=False,
infer_datetime_format=True,
parse_dates={'datetime': [0,1]}, #Date and time has been combined
index_col=['datetime'],
na_values='?')
print(dataset.head())
Global_active_power Global_reactive_power Voltage \
datetime
2006-12-16 17:24:00 4.216 0.418 234.84
2006-12-16 17:25:00 5.360 0.436 233.63
2006-12-16 17:26:00 5.374 0.498 233.29
2006-12-16 17:27:00 5.388 0.502 233.74
2006-12-16 17:28:00 3.666 0.528 235.68
Global_intensity Sub_metering_1 Sub_metering_2 \
datetime
2006-12-16 17:24:00 18.4 0.0 1