全国数据汇总
首先我们先来看下9个城市招聘数据的汇总情况
数据汇总
首先读取所有的文件数据,再通过 concat 函数合并
beijing = pd.read_csv("beijing_data.csv")
shanghai = pd.read_csv("shanghai_data.csv")
shenzhen = pd.read_csv("shenzhen_data.csv")
guangzhou = pd.read_csv("guangzhou_data.csv")
hangzhou = pd.read_csv("hangzhou_data.csv")
nanjing = pd.read_csv("nanjing_data.csv")
wuhan = pd.read_csv("wuhan_data.csv")
xian = pd.read_csv("xian_data.csv")
chengdu = pd.read_csv("chengdu_data.csv")
all_data = pd.concat([beijing, shanghai, shenzhen, guangzhou, hangzhou, nanjing, wuhan, xian, chengdu], ignore_index=True)
计算平均薪资
由于抓取到的薪资都是一个范围值,所以需要简单处理下,求出每个岗位的平均薪资
import re
rege = r'(\d+)-(\d+)K'
def get_num(mystr):
res = re.match(rege, mystr)
res