用惯了java的我,习惯于使用java来处理统计服务器上生成的报表文件,其实可以用shell脚本来出来日志文件,这样效率更高,而且能满足多变的需求,例如有以下一个日志文件,为了演示demo,以下这段内容只是真是环境中的日志文件的一小段:
IP: 10.246.133.165 corename:search4newecrmonline-113,indexsize:4647684,indexNum:13546485,timesum:1076306,querycount:9624 IP: 10.246.133.165 corename:search4newecrmonline-114,indexsize:4859960,indexNum:14225152,timesum:1063298,querycount:5646 IP: 10.246.133.165 corename:search4newecrmonline-115,indexsize:4292468,indexNum:12561415,timesum:875506,querycount:3813 IP: 10.246.133.165 corename:search4newecrmonline-153,indexsize:4314760,indexNum:12604725,timesum:885560,querycount:4509 IP: 10.246.133.165 corename:search4newecrmonline-154,indexsize:3950384,indexNum:11540657,timesum:831698,querycount:5803 IP: 10.246.133.165 corename:search4newecrmonline-155,indexsize:3898128,indexNum:11439464,timesum:926018,querycount:8293 IP: 10.246.133.165 corename:search4newecrmonline-193,indexsize:4187096,indexNum:12269895,timesum:875416,querycount:5022 IP: 10.246.133.165 corename:search4newecrmonline-194,indexsize:4224724,indexNum:12356496,timesum:1112942,querycount:6041 IP: 10.246.133.165 corename:search4newecrmonline-195,indexsize:4292484,indexNum:12582706,timesum:2564479,querycount:12318 IP: 10.246.133.165 corename:search4newecrmonline-233,indexsize:4379768,indexNum:12851563,timesum:914215,querycount:3402 IP: 10.246.133.165 corename:search4newecrmonline-234,indexsize:4658288,indexNum:13489599,timesum:1553443,querycount:13432 IP: 10.246.133.165 corename:search4newecrmonline-235,indexsize:3888760,indexNum:11383280,timesum:821678,querycount:4243 IP: 10.246.133.165 corename:search4newecrmonline-273,indexsize:4463124,indexNum:13033226,timesum:1179811,querycount:9059 IP: 10.246.133.165 corename:search4newecrmonline-274,indexsize:3951476,indexNum:11559612,timesum:1057978,querycount:5684 IP: 10.246.133.165 corename:search4newecrmonline-275,indexsize:4312312,indexNum:12611877,timesum:868934,querycount:3365 IP: 10.246.133.165 corename:search4newecrmonline-313,indexsize:4006048,indexNum:11764585,timesum:830098,querycount:5760 IP: 10.246.133.165 corename:search4newecrmonline-314,indexsize:4733364,indexNum:13854315,timesum:1185240,querycount:11434 IP: 10.246.133.165 corename:search4newecrmonline-315,indexsize:4491000,indexNum:13126275,timesum:1034849,querycount:5061 IP: 10.246.133.165 corename:search4newecrmonline-33,indexsize:4098464,indexNum:12025008,timesum:830250,querycount:2831 IP: 10.246.133.165 corename:search4newecrmonline-34,indexsize:4393756,indexNum:12728587,timesum:2630776,querycount:14342 IP: 10.246.133.165 corename:search4newecrmonline-35,indexsize:4753732,indexNum:13850790,timesum:1087923,querycount:5018
为了统计indexsize 这一项的总和可以用以下这个shell脚本来出来,只要一句话,结果就出来,是不是以非常方便呢?
cat ./*|grep search4newecrmonline- | awk -F 'indexNum:' '{sum=sum+$2; print sum} END{print "timesum:"sum}'
结果:
timesum:4028632138