高校录取分数分析

高校录取分数分析及预测

import pandas as pd
import os
import re

#moveDocLocation 调档分位次 15\16\17的平均值
def avg_moveDocLocation(historydata,year=2018):
    historydata=historydata.fillna(-1)
    historydata["matricDiffYear"]=historydata["matricDiffYear"].astype('int64')
    historydata=historydata.sort_values(by=["collegeCode","order","matricDiffYear"])
    collegecode=historydata["collegeCode"]
    unique_collegecode = collegecode.unique()
    result=pd.DataFrame()
    for code in unique_collegecode:
        temp=historydata.loc[historydata.collegeCode==code,["collegeCode","collegeName","order","matricDiffYear","moveDocLocation","averageLocation","moveDocGrade"]]
        name= temp.loc[0,"collegeName"]
        tt_rst=pd.DataFrame()
        for order in temp["order"].unique():
            temp_order=temp.loc[temp.order==order,:]
            temp_year1 = temp_order.loc[temp_order.matricDiffYear>=(year-3),:]
            temp_year2=temp_year1.loc[temp_year1.matricDiffYear<=(year-1),:]
            temp_year3=temp_year2.loc[temp_year2.moveDocLocation>0,:]
            temp_year4=temp_year3.loc[temp_year3.averageLocation>0,:]
            tmp_value=temp_order.loc[temp_order.matricDiffYear==year,"moveDocGrade"]
            if len(tmp_value)==0:
                tmp_moveDocGrade=0
            else:
                tmp_moveDocGrade=int(tmp_value)
            mean_mov_avg=list(temp_year4[["moveDocLocation","averageLocation"]].mean())
            temp_result = pd.DataFrame([code,name,order,mean_mov_avg[0],mean_mov_avg[1],tmp_moveDocGrade],
                                       index = ["collegecode","collegeName","order","moveDocLocation","averageLocation","moveDocGrade"])
            te
  • 0
    点赞
  • 3
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值