str1 ="北京101010100朝阳101010300顺义101010400怀柔101010500通州101010600昌平101010700延庆101010800丰台101010900石景山101011000大兴101011100房山101011200密云101011300门头沟101011400平谷101011500八达岭101011600佛爷顶101011700汤河口101011800密云上甸子101011900斋堂101012000霞云岭101012100北京城区101012200"
tuples = re.findall(ur"([\u4e00-\u9fa5]+)(\d+)",str1.decode('utf8')) #\u4e00-\u9fa5为中文Unicode编码
dic1 = dict(tuples)
for key in dic1:
print key.encode('utf-8')+':'+dic1[key].encode('utf-8')
tuples = re.findall(ur"([\u4e00-\u9fa5]+)(\d+)",str1.decode('utf8')) #\u4e00-\u9fa5为中文Unicode编码
dic1 = dict(tuples)
for key in dic1:
print key.encode('utf-8')+':'+dic1[key].encode('utf-8')