转自 : Netflix_Linux_Perf_Analysis_60s.pdf (brendangregg.com)
Linux Performance Analysis in 60,000 Milliseconds
You login to a Linux server with a performance issue: what do you check in the first minute?
At Netflix we have a massive EC2 Linux cloud, and numerous performance analysis tools to monitor and
investigate its performance. These include Atlas for cloudwide monitoring, and Vector for ondemand
instance analysis. While those tools help us solve most issues, we sometimes need to login to an instance
and run some standard Linux performance tools.
In this post, the Netflix Performance Engineering team will show you the first 60 seconds of an optimized
performance investigation at the command line, using standard Linux tools you should have available.
In 60 seconds you can get a high level idea of system resource usage and running processes by running the
following ten commands. Look for errors and saturation metrics, as they are both easy to interpret, and then
resource utilization. Saturation is where a resource has more load than it can handle, and can be exposed
either as the length of a request queue, or time spent waiting.