简化的服务器日志文件acs.log的格式为:
124.53.180.23 [25/Jun/2016:00:00:01 +0800] http://www.gvld.com/api/stock? user_type=100
24.128.233.214[25/Jun/2016:00:01:00+0800]http://www.gvld.com/message/cnt? uid=6806622
……………………….
提取其中的接口的代码仅供参考:
import urlparse
import re
#将日志文件分割为list
def log2list():
p=re.compile(r'http:\/\/www.gvld.com\/.*\/(.*).*')
f= open('log.txt')
str=f.read()
l=[]
rows=str.split('\n')
for row in rows:
t=p.findall(row)
for t1 in t:
l.append(t1.split("?")[0])
return l
#统计各种接口的数量
def count():
l=log2list()
a={}
for i in l:
if l.count(i)>=1:
a[i]=l.count(i)
print a
return a
count()