需求背景:公司和外包公司进行结算,需要对每天的作业量进行统计。作业量分为进出口量,进口量分为天津市区进口量、全国进口量以及其他。出口量类似。常规的方法是使用excel进行筛选。但人工筛选存在两个问题,大量的表格会耗费很多时间和精力,人工筛选也会出现一些纰漏。
代码如下:出口测试
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
from datetime import datetime
from pandas import Series, DataFrame
df = pd.read_excel('data/17out.xls')
df1=df.loc[(df["邮路"]=="WS30005000") | (df["邮路"] == "二枢纽-速处") | (df["邮路"] == "WS30006000")
| (df["邮路"] == "WS30008100")| (df["邮路"] == "WS30004000")].数量.sum()
df2 = df.loc[(df["邮路"]=="市内一次1路") | (df["邮路"] == "市内一次2路") | (df["邮路"] == "市内一次3路")
| (df["邮路"] == "市内一次4路")| (df["邮路"] == "市内一次5路")| (df["邮路"] == "市内一次6路")
| (df["邮路"] == "市内一次7路")| (df["邮路"] == "市内一次8路")| (df["邮路"] == "市内一次9路")
| (df["邮路"] == "市内一次10路")| (df["邮路"] == "市内一次11路")| (df["邮路"] == "市内一次12路")
| (df["邮路"] == "市内一次13路")
| (df["邮路"] == "市内二次1路")| (df["邮路"] == "市内二次2路")| (df["邮路"] == "市内二次3路")
| (df["邮路"] == "市内二次4路")| (df["邮路"] == "市内二次5路")| (df["邮路"] == "市内二次6路")
| (df["邮路"] == "市内二次7路")| (df["邮路"] == "市内二次8路")| (df["邮路"] == "市内二次9路")
| (df["邮路"] == "市内二次10路")| (df["邮路"] == "市内二次11路")| (df["邮路"] == "市内