前言
- linux RH7
- ulimit命令
- lsof命令
- /proc/{pid}/limits
- 了解linux:系统对open files的限制
今天发现某个程序发生了Too Many Open Files(打开的文件过多)错误。该程序的PID为4837。下面为检查过程。
检查系统对单个进程文件句柄的限制
用ulimit -n
命令查询:
> ulimit -n
65535
或用ulimit -a
命令查询open files:
> ulimit -a
core file size (blocks, -c) 0
data seg size (kbytes, -d) unlimited
scheduling priority (-e) 0
file size (blocks, -f) unlimited
pending signals (-i) 255796
max locked memory (kbytes, -l) 64
max memory size (kbytes, -m) unlimited
open files (-n) 65535
pipe size (512 bytes, -p) 8
POSIX message queues (bytes, -q) 819200
real-time priority (-r) 0
stack size (kbytes, -s) 8192
cpu time (seconds, -t) unlimited
max user processes (-u) 255796
virtual memory (kbytes, -v) unlimited
file locks (-x) unlimited
检查发生Too Many Open Files的进程的Max open files
每个进程均会有对应的Max open files设置,用到的命令:
cat /proc/{pid}/limits |grep 'Max open files'
检查结果:
> cat /proc/4837/limits |grep 'Max open files'
Max open files 4096 4096 files
或:
> cat /proc/4837/limits
Limit Soft Limit Hard Limit Units
Max cpu time unlimited unlimited seconds
Max file size unlimited unlimited bytes
Max data size unlimited unlimited bytes
Max stack size 8388608 unlimited bytes
Max core file size 0 unlimited bytes
Max resident set unlimited unlimited bytes
Max processes 255796 255796 processes
Max open files 4096 4096 files
Max locked memory 65536 65536 bytes
Max address space unlimited unlimited bytes
Max file locks unlimited unlimited locks
Max pending signals 255796 255796 signals
Max msgqueue size 819200 819200 bytes
Max nice priority 0 0
Max realtime priority 0 0
Max realtime timeout unlimited unlimited us
进程使用了多少文件句柄
用到的命令:
> lsof -p pid | wc -l
检查结果:
> lsof -p 4837 | wc -l
lsof: WARNING: can't stat() fuse.gvfsd-fuse file system /run/user/1001/gvfs
Output information may be incomplete.
4169
至此,发现了问题:ulimit查询到最大open files数量是65535, /proc/4837/limits查询到Max open files数量是4096,lsof查询到的打开文件数量4169。显然进程4837打开文件数量超过了/proc/4837/limits中的限制。
至于lsof查询到的打开文件数量是4169而不是4096,这个还没搞清楚。
查pid
> ps -ef | grep java