ccah-500 第31题 Which workloads benefit the most from faster network fabric

31.You are planning a Hadoop cluster and considering implementing 10 Gigabit Ethernet as the network fabric. Which workloads benefit the most from faster network fabric?

A. When your workload generates a large amount of output data, significantly larger than the amount of intermediate data

B. When your workload consumes a large amount of input data, relative to the entire capacity if HDFS

C. When your workload consists of processor-intensive tasks

D. When your workload generates a large amount of intermediate data, on the order of the input data itself

 

Answer: D

A 当负载生成的输出数据显著大于中间数据的量时。

B 当工作负载需要大量输入数据,相对于hdfs整个容量。

C 当工作负载由处理器密集型任务组成。

 

A 有点道理.
Questions enforces more on Network Fabric not I/O bound which are local.
Large data output means, large data shuffle across network for Reducer.

 

D 的依据更明显。

http://blog.cloudera.com/blog/2013/08/how-to-select-the-right-hardware-for-your-new-hadoop-cluster/

“When we encounter applications that produce large amounts of intermediate data — outputting data on the same order as the amount read in — we recommend two ports on a single Ethernet card or two channel-bonded Ethernet cards to provide 2 Gbps per machine.

Cloudera recommends:
Consider 10Gb/sec in the cases:
- Clusters storing very large amounts of data
- Clusters in which typical MapReduce jobs produce large amounts of intermediate data.

please take note that: Intermediate data is transferred across the network to the Reducers

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值