- 例子:对一个csv中的文件的某列元素值进行计算,输出均值
- 工具:jupyter
- 处理的csv文件,如对TEMP值进行均值计算,几万条数据,截图部分显示:
python代码实现:
%%timeit
import csv
def read_csv(file_name):
with open(file_name,'r+') as f:
reader = csv.DictReader(f)
return list(reader)
data = read_csv('D:/python/test.csv')
temp_list = [float(item['TEMP']) for item in data if item['TEMP']!='NA']
print (sum(temp_list)/len(temp_list))
结果输出:
pandas实现:
%%timeit
import pandas as pd
df=pd.read_csv('C:/Users/Administrator/BeijingPM20100101_20151231.csv')
a=df['TEMP'].mean() #取出所有temp的值,计算出平均温度,代码更简洁,速度会快很多
print (a)
结果输出: