安装完hadoop后,在命令行执行hadoop,会出现如下提示:
Usage: hadoop [--config confdir] COMMAND
where COMMAND is one of:
namenode -format format the DFS filesystem
secondarynamenode run the DFS secondary namenode
namenode run the DFS namenode
datanode run a DFS datanode
dfsadmin run a DFS admin client
mradmin run a Map-Reduce admin client
fsck run a DFS filesystem checking utility
fs run a generic filesystem user client
balancer run a cluster balancing utility
jobtracker run the MapReduce job Tracker node
pipes run a Pipes job
tasktracker run a MapReduce task Tracker node
job manipulate MapReduce jobs
queue get information regarding JobQueues
version print the version
jar <jar> run a jar file
distcp <srcurl> <desturl> copy file or directories recursively
archive -archiveName NAME <src>* <dest> create a hadoop archive
daemonlog get/set the log level for each daemon
or
CLASSNAME run the class named CLASSNAME
Most commands print help when invoked w/o parameters.
基本囊括了hadoop的所有命令,每一个命令会有一个对应的java类作为入口,对应列表如下:
命令 | 功能 | 对应java类入口 |
---|---|---|
namenode | 处理namenode的相关操作 | org.apache.hadoop.hdfs.server.namenode.NameNode |
secondarynamenode | 处理secondarynamenode相关操作 | org.apache.hadoop.hdfs.server.namenode.SecondaryNameNode |
datanode | 处理datanode相关操作 | org.apache.hadoop.hdfs.server.datanode.DataNode |
dfsadmin | 处理dfs管理员客户端操作 | org.apache.hadoop.hdfs.tools.DFSAdmin |
mradmin | 处理MapReduce管理员的客户端操作 | org.apache.hadoop.mapred.tools.MRAdmin |
fsck | 运行DFS文件系统验证工具 | org.apache.hadoop.hdfs.tools.DFSck |
fs | 运行文件系统用户客户端 | org.apache.hadoop.fs.FsShell |
balancer | 执行集群balancing操作 | org.apache.hadoop.hdfs.server.balancer.Balancer |
jobtracker | 操作MapReduce的jobTracker节点 | org.apache.hadoop.mapred.JobTracker |
pipes | 执行pipes任务 | org.apache.hadoop.mapred.pipes.Submitter |
tasktracker | 操作MapReduce任务taskTracker节点 | org.apache.hadoop.mapred.TaskTracker |
job | 操作MapReduce任务 | org.apache.hadoop.mapred.JobClient |
queue | 获取任务队列信息 | org.apache.hadoop.mapred.JobQueueClient |
version | 打印版本信息 | org.apache.hadoop.util.VersionInfo |
jar | 运行jar文件 | org.apache.hadoop.util.RunJar |
distcp | 集群间互相拷贝文件 | org.apache.hadoop.tools.DistCp |
archive | 创建一个hadoop结构 | org.apache.hadoop.tools.HadoopArchives |
daemonlog | 获取或是设置每一个守护进程的日志级别 | org.apache.hadoop.log.LogLevel |