chapter02
python第三方模块将json字符串转换为python字典:
import json
path = 'ch02/usagov_bitly_data2012-03-16-1331923249.txt'
records = [json.loads(line) for line in open(path)] #list 解析
上面返回的列表的元素是字典类型。
在列表解析中加入判断语句:
time_zones = [rec['tz'] for rec in records if 'tz' in rec]
collections模块,collections.defaultdict类:
from collections import defaultdict
def get_counts2(sequence):
counts = defaultdict(int) #所有的值均会被初始化0
for x in sequence:
counts[x] += 1
return counts
collections模块,collections.Counter类:
from collections import Counter
counts = Counter(time_zones)
counts.most_common(10) #输出前10位最大