java
/etc/profile
export JAVA_HOME=/usr/local/share/jdk
export JRE_HOME=${JAVA_HOME}/jre
export CLASSPATH=.:${JAVA_HOME}/lib:${JRE_HOME}/lib
export PATH=${JAVA_HOME}/bin:$PATH
ssh
rsync
HDFS NameNode, SecondaryNameNode, and DataNode
YARN ResourceManager, NodeManager, and WebAppProxy
MapReduce MapReduce Job History Server
read-only default configuration - core-default.xml, hdfs-default.xml, yarn-default.xml and mapred-default.xml
Site-specific configuration - etc/hadoop/core-site.xml, etc/hadoop/hdfs-site.xml, etc/hadoop/yarn-site.xml and etc/hadoop/mapred-site.xml
you can control the Hadoop scripts found in the bin/ directory of the distribution, by setting site-specific values via the etc/hadoop/hadoop-env.sh and etc/hadoop/yarn-env.sh.
To configure the Hadoop cluster you will need to configure the environment in which the Hadoop daemons execute as well as the configuration parameters for the Hadoop daemons.
HDFS daemons are NameNode, SecondaryNameNode, and DataNode. YARN damones are ResourceManager, NodeManager, and WebAppProxy. If MapReduce is to be used, then the MapReduce Job History Server will also be running. For large installations, these are generally running on separate hosts.
dfs.namenode.http-address 0.0.0.0:50070
dfs.datanode.http.address 0.0.0.0:50075
dfs.namenode.secondary.http-address 0.0.0.0:50090
mapreduce.jobtracker.http.address 0.0.0.0:50030
mapreduce.tasktracker.http.address 0.0.0.0:50060
mapreduce.jobhistory.address 0.0.0.0:10020
mapreduce.jobhistory.webapp.address 0.0.0.0:19888
yarn.resourcemanager.address ${yarn.resourcemanager.hostname}:8032
yarn.nodemanager.address ${yarn.nodemanager.hostname}:0
yarn.resourcemanager.scheduler.address ${yarn.resourcemanager.hostname}:8030
yarn.resourcemanager.webapp.address ${yarn.resourcemanager.hostname}:8088
yarn.resourcemanager.resource-tracker.address ${yarn.resourcemanager.hostname}:8031
NameNode http://nn_host:port/ Default HTTP port is 50070. dfs.namenode.http-address
ResourceManager http://rm_host:port/ Default HTTP port is 8088. yarn.resourcemanager.webapp.address
MapReduce JobHistory Server <
/etc/profile
export JAVA_HOME=/usr/local/share/jdk
export JRE_HOME=${JAVA_HOME}/jre
export CLASSPATH=.:${JAVA_HOME}/lib:${JRE_HOME}/lib
export PATH=${JAVA_HOME}/bin:$PATH
ssh
rsync
HDFS NameNode, SecondaryNameNode, and DataNode
YARN ResourceManager, NodeManager, and WebAppProxy
MapReduce MapReduce Job History Server
read-only default configuration - core-default.xml, hdfs-default.xml, yarn-default.xml and mapred-default.xml
Site-specific configuration - etc/hadoop/core-site.xml, etc/hadoop/hdfs-site.xml, etc/hadoop/yarn-site.xml and etc/hadoop/mapred-site.xml
you can control the Hadoop scripts found in the bin/ directory of the distribution, by setting site-specific values via the etc/hadoop/hadoop-env.sh and etc/hadoop/yarn-env.sh.
To configure the Hadoop cluster you will need to configure the environment in which the Hadoop daemons execute as well as the configuration parameters for the Hadoop daemons.
HDFS daemons are NameNode, SecondaryNameNode, and DataNode. YARN damones are ResourceManager, NodeManager, and WebAppProxy. If MapReduce is to be used, then the MapReduce Job History Server will also be running. For large installations, these are generally running on separate hosts.
dfs.namenode.http-address 0.0.0.0:50070
dfs.datanode.http.address 0.0.0.0:50075
dfs.namenode.secondary.http-address 0.0.0.0:50090
mapreduce.jobtracker.http.address 0.0.0.0:50030
mapreduce.tasktracker.http.address 0.0.0.0:50060
mapreduce.jobhistory.address 0.0.0.0:10020
mapreduce.jobhistory.webapp.address 0.0.0.0:19888
yarn.resourcemanager.address ${yarn.resourcemanager.hostname}:8032
yarn.nodemanager.address ${yarn.nodemanager.hostname}:0
yarn.resourcemanager.scheduler.address ${yarn.resourcemanager.hostname}:8030
yarn.resourcemanager.webapp.address ${yarn.resourcemanager.hostname}:8088
yarn.resourcemanager.resource-tracker.address ${yarn.resourcemanager.hostname}:8031
NameNode http://nn_host:port/ Default HTTP port is 50070. dfs.namenode.http-address
ResourceManager http://rm_host:port/ Default HTTP port is 8088. yarn.resourcemanager.webapp.address
MapReduce JobHistory Server <