故障现象是hadoop集群中某个datanode无故失联,调阅log记录中发现
INFO org.apache.hadoop.hdfs.server.datanode.DataNode: Got finalize command for block pool BP-***
ERROR org.apache.hadoop.hdfs.server.datanode.DataNode: RECEIVED SIGNAL 15: SIGTERM
INFO org.apache.hadoop.hdfs.server.datanode.DataNode: SHUTDOWN_MSG:
signal 15含意是使用不带参数的kill命令时终止进程,初步判断,由于文件数据块的原因造成datanode失联
对配置文件hdfs-site.xml增加如下配置
<property>
<name>dfs.namenode.secondary.http-address</name>
INFO org.apache.hadoop.hdfs.server.datanode.DataNode: Got finalize command for block pool BP-***
ERROR org.apache.hadoop.hdfs.server.datanode.DataNode: RECEIVED SIGNAL 15: SIGTERM
INFO org.apache.hadoop.hdfs.server.datanode.DataNode: SHUTDOWN_MSG:
signal 15含意是使用不带参数的kill命令时终止进程,初步判断,由于文件数据块的原因造成datanode失联
对配置文件hdfs-site.xml增加如下配置
<property>
<name>dfs.namenode.secondary.http-address</name>