You may want to remove or add some DataNodes from your HDFS cluster at some point. In fact ,Removing or adding nodes in Hadoop can be straightforward.Like this, we only do some simply operations, in which we will not affect ongoing other jobs.But in order to removing or adding more safe and
efficient
,we must note replication of blocks and other points.
I think you should known that:
dfs.xxx ==> datanode ==>
hdfs-site.xml
mapred.xxx
==> tasktracker
==> mapred-site.xml
It's means:you can do some operations on datanode and tasktracker respectively. Because they are hardly in the same of operations.
1.Removing DataNodes
Modify your cluster file of hdfs-site.xml on namenode:add some property like this
I think you should known that:
dfs.xxx
mapred.xxx
It's means:you can do some operations on datanode and tasktracker respectively. Because they are hardly in the same of operations.
1.Removing DataNodes
Modify your cluster file of hdfs-site.xml on namenode:add some property like this
- <property>