Removing and adding DataNodes in cluster by hadoop

本文介绍了如何通过配置Hadoop集群的dfs.hosts和dfs.hosts.exclude属性来添加和移除DataNodes。同时,也提到了mapred.hosts和mapred.hosts.exclude用于管理TaskTracker。此外, dfs.balance.bandwidthPerSec参数用于设置每个DataNode进行数据平衡的最大带宽。
摘要由CSDN通过智能技术生成
You may want to remove or add some DataNodes from your HDFS cluster at some point. In fact ,Removing or adding nodes in Hadoop can be straightforward.Like this, we only do some simply operations, in which  we will not affect ongoing other jobs.But in order to removing or adding more safe and   efficient   ,we must note replication of blocks and other points.

I think you should known that:

dfs.xxx       ==>   datanode   ==>   hdfs-site.xml
mapred.xxx    ==> tasktracker    ==> mapred-site.xml
It's means:you can do some operations on datanode and tasktracker   respectively. Because they are hardly in the same of operations.

1.Removing DataNodes
Modify your cluster file of hdfs-site.xml on namenode:add some property like this
  1. <property>  
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值