The problem is:
Determines the top 10 most common source IP addresses, and their hit rates, for a fleet of 1000 web servers within the last hour.
The following assumptions may be used…
- web servers are locally writing access logs in the Apache Combined Log Format.
- web servers are accessible by
ssh
.
My solution is:
http://bo-yang.github.io/2014/07/07/determine-top-common-ip-addresses