Big Data 2-- 单机配置hadoop

1, 找到一个想要安装的hadoop版本, 例如http://hadoop.apache.org/docs/r1.2.1/。

2,下载并解压: tar -zxvf XXXXX.

3, 安装jdk, 和ssh软件(如果之前没安装的话)。

4, 编辑conf/hadoop-env.sh, 指定java_home。

5,Setup passphraseless ssh

Now check that you can ssh to the localhost without a passphrase:
$ ssh localhost

If you cannot ssh to localhost without a passphrase, execute the following commands:
$ ssh-keygen -t dsa -P '' -f ~/.ssh/id_dsa
$ cat ~/.ssh/id_dsa.pub >> ~/.ssh/authorized_keys

6, $ mkdir input
$ cp conf/*.xml input
$ bin/hadoop jar hadoop-examples-*.jar grep input output 'dfs[a-z.]+'
$ cat output/*


7, The result is:

14/05/16 07:21:23 INFO util.NativeCodeLoader: Loaded the native-hadoop library
14/05/16 07:21:23 WARN snappy.LoadSnappy: Snappy native library not loaded
14/05/16 07:21:23 INFO mapred.FileInputFormat: Total input paths to process : 7
14/05/16 07:21:24 INFO mapred.JobClient: Running job: job_local1725642642_0001
14/05/16 07:21:24 INFO mapred.LocalJobRunner: Waiting for map tasks
14/05/16 07:21:24 INFO mapred.LocalJobRunner: Starting task: attempt_local1725642642_0001_m_000000_0
14/05/16 07:21:24 INFO util.ProcessTree: setsid exited with exit code 0
14/05/16 07:21:24 INFO mapred.Task:  Using ResourceCalculatorPlugin : org.apache.hadoop.util.LinuxResourceCalculatorPlugin@2ea45536
14/05/16 07:21:24 INFO mapred.MapTask: Processing split: file:/home/wezhao/Downloads/hadoop-1.2.1/input/capacity-scheduler.xml:0+7457
14/05/16 07:21:24 INFO mapred.MapTask: numReduceTasks: 1
14/05/16 07:21:24 INFO mapred.MapTask: io.sort.mb = 100
14/05/16 07:21:24 INFO mapred.MapTask: data buffer = 79691776/99614720
14/05/16 07:21:24 INFO mapred.MapTask: record buffer = 262144/327680
14/05/16 07:21:24 INFO mapred.MapTask: Starting flush of map output
14/05/16 07:21:24 INFO mapred.Task: Task:attempt_local1725642642_0001_m_000000_0 is done. And is in the process of commiting
14/05/16 07:21:24 INFO mapred.LocalJobRunner: file:/home/wezhao/Downloads/hadoop-1.2.1/input/capacity-scheduler.xml:0+7457
14/05/16 07:21:24 INFO mapred.Task: Task 'attempt_local1725642642_0001_m_000000_0' done.
14/05/16 07:21:24 INFO mapred.LocalJobRunner: Finishing task: attempt_local1725642642_0001_m_000000_0
14/05/16 07:21:24 INFO mapred.LocalJobRunner: Starting task: attempt_local1725642642_0001_m_000001_0
14/05/16 07:21:25 INFO mapred.Task:  Using ResourceCalculatorPlugin : org.apache.hadoop.util.LinuxResourceCalculatorPlugin@76933bcb
14/05/16 07:21:25 INFO mapred.MapTask: Processing split: file:/home/wezhao/Downloads/hadoop-1.2.1/input/hadoop-policy.xml:0+4644
14/05/16 07:21:25 INFO mapred.MapTask: numReduceTasks: 1
14/05/16 07:21:25 INFO mapred.MapTask: io.sort.mb = 100
14/05/16 07:21:25 INFO mapred.MapTask: data buffer = 79691776/99614720
14/05/16 07:21:25 INFO mapred.MapTask: record buffer = 262144/327680
14/05/16 07:21:25 INFO mapred.MapTask: Starting flush of map output
14/05/16 07:21:25 INFO mapred.MapTask: Finished spill 0
14/05/16 07:21:25 INFO mapred.Task: Task:attempt_local1725642642_0001_m_000001_0 is done. And is in the process of commiting
14/05/16 07:21:25 INFO mapred.LocalJobRunner: file:/home/wezhao/Downloads/hadoop-1.2.1/input/hadoop-policy.xml:0+4644
14/05/16 07:21:25 INFO mapred.Task: Task 'attempt_local1725642642_0001_m_000001_0' done.
14/05/16 07:21:25 INFO mapred.LocalJobRunner: Finishing task: attempt_local1725642642_0001_m_000001_0
14/05/16 07:21:25 INFO mapred.LocalJobRunner: Starting task: attempt_local1725642642_0001_m_000002_0
14/05/16 07:21:25 INFO mapred.Task:  Using ResourceCalculatorPlugin : org.apache.hadoop.util.LinuxResourceCalculatorPlugin@7ed75415
14/05/16 07:21:25 INFO mapred.MapTask: Processing split: file:/home/wezhao/Downloads/hadoop-1.2.1/input/mapred-queue-acls.xml:0+2033
14/05/16 07:21:25 INFO mapred.MapTask: numReduceTasks: 1
14/05/16 07:21:25 INFO mapred.MapTask: io.sort.mb = 100
14/05/16 07:21:25 INFO mapred.MapTask: data buffer = 79691776/99614720
14/05/16 07:21:25 INFO mapred.MapTask: record buffer = 262144/327680
14/05/16 07:21:25 INFO mapred.MapTask: Starting flush of map output
14/05/16 07:21:25 INFO mapred.Task: Task:attempt_local1725642642_0001_m_000002_0 is done. And is in the process of commiting
14/05/16 07:21:25 INFO mapred.LocalJobRunner: file:/home/wezhao/Downloads/hadoop-1.2.1/input/mapred-queue-acls.xml:0+2033
14/05/16 07:21:25 INFO mapred.Task: Task 'attempt_local1725642642_0001_m_000002_0' done.
14/05/16 07:21:25 INFO mapred.LocalJobRunner: Finishing task: attempt_local1725642642_0001_m_000002_0
14/05/16 07:21:25 INFO mapred.LocalJobRunner: Starting task: attempt_local1725642642_0001_m_000003_0
14/05/16 07:21:25 INFO mapred.Task:  Using ResourceCalculatorPlugin : org.apache.hadoop.util.LinuxResourceCalculatorPlugin@561777b1
14/05/16 07:21:25 INFO mapred.MapTask: Processing split: file:/home/wezhao/Downloads/hadoop-1.2.1/input/fair-scheduler.xml:0+327
14/05/16 07:21:25 INFO mapred.MapTask: numReduceTasks: 1
14/05/16 07:21:25 INFO mapred.MapTask: io.sort.mb = 100
14/05/16 07:21:25 INFO mapred.MapTask: data buffer = 79691776/99614720
14/05/16 07:21:25 INFO mapred.MapTask: record buffer = 262144/327680
14/05/16 07:21:25 INFO mapred.MapTask: Starting flush of map output
14/05/16 07:21:25 INFO mapred.JobClient:  map 42% reduce 0%
14/05/16 07:21:25 INFO mapred.Task: Task:attempt_local1725642642_0001_m_000003_0 is done. And is in the process of commiting
14/05/16 07:21:25 INFO mapred.LocalJobRunner: file:/home/wezhao/Downloads/hadoop-1.2.1/input/fair-scheduler.xml:0+327
14/05/16 07:21:25 INFO mapred.Task: Task 'attempt_local1725642642_0001_m_000003_0' done.
14/05/16 07:21:25 INFO mapred.LocalJobRunner: Finishing task: attempt_local1725642642_0001_m_000003_0
14/05/16 07:21:25 INFO mapred.LocalJobRunner: Starting task: attempt_local1725642642_0001_m_000004_0
14/05/16 07:21:25 INFO mapred.Task:  Using ResourceCalculatorPlugin : org.apache.hadoop.util.LinuxResourceCalculatorPlugin@7c9e67a
14/05/16 07:21:25 INFO mapred.MapTask: Processing split: file:/home/wezhao/Downloads/hadoop-1.2.1/input/core-site.xml:0+296
14/05/16 07:21:25 INFO mapred.MapTask: numReduceTasks: 1
14/05/16 07:21:25 INFO mapred.MapTask: io.sort.mb = 100
14/05/16 07:21:25 INFO mapred.MapTask: data buffer = 79691776/99614720
14/05/16 07:21:25 INFO mapred.MapTask: record buffer = 262144/327680
14/05/16 07:21:25 INFO mapred.MapTask: Starting flush of map output
14/05/16 07:21:25 INFO mapred.Task: Task:attempt_local1725642642_0001_m_000004_0 is done. And is in the process of commiting
14/05/16 07:21:25 INFO mapred.LocalJobRunner: file:/home/wezhao/Downloads/hadoop-1.2.1/input/core-site.xml:0+296
14/05/16 07:21:25 INFO mapred.Task: Task 'attempt_local1725642642_0001_m_000004_0' done.
14/05/16 07:21:25 INFO mapred.LocalJobRunner: Finishing task: attempt_local1725642642_0001_m_000004_0
14/05/16 07:21:25 INFO mapred.LocalJobRunner: Starting task: attempt_local1725642642_0001_m_000005_0
14/05/16 07:21:25 INFO mapred.Task:  Using ResourceCalculatorPlugin : org.apache.hadoop.util.LinuxResourceCalculatorPlugin@7fd88db7
14/05/16 07:21:25 INFO mapred.MapTask: Processing split: file:/home/wezhao/Downloads/hadoop-1.2.1/input/mapred-site.xml:0+292
14/05/16 07:21:25 INFO mapred.MapTask: numReduceTasks: 1
14/05/16 07:21:25 INFO mapred.MapTask: io.sort.mb = 100
14/05/16 07:21:25 INFO mapred.MapTask: data buffer = 79691776/99614720
14/05/16 07:21:25 INFO mapred.MapTask: record buffer = 262144/327680
14/05/16 07:21:25 INFO mapred.MapTask: Starting flush of map output
14/05/16 07:21:25 INFO mapred.Task: Task:attempt_local1725642642_0001_m_000005_0 is done. And is in the process of commiting
14/05/16 07:21:25 INFO mapred.LocalJobRunner: file:/home/wezhao/Downloads/hadoop-1.2.1/input/mapred-site.xml:0+292
14/05/16 07:21:25 INFO mapred.Task: Task 'attempt_local1725642642_0001_m_000005_0' done.
14/05/16 07:21:25 INFO mapred.LocalJobRunner: Finishing task: attempt_local1725642642_0001_m_000005_0
14/05/16 07:21:25 INFO mapred.LocalJobRunner: Starting task: attempt_local1725642642_0001_m_000006_0
14/05/16 07:21:25 INFO mapred.Task:  Using ResourceCalculatorPlugin : org.apache.hadoop.util.LinuxResourceCalculatorPlugin@6d7e845a
14/05/16 07:21:25 INFO mapred.MapTask: Processing split: file:/home/wezhao/Downloads/hadoop-1.2.1/input/hdfs-site.xml:0+274
14/05/16 07:21:25 INFO mapred.MapTask: numReduceTasks: 1
14/05/16 07:21:25 INFO mapred.MapTask: io.sort.mb = 100
14/05/16 07:21:25 INFO mapred.MapTask: data buffer = 79691776/99614720
14/05/16 07:21:25 INFO mapred.MapTask: record buffer = 262144/327680
14/05/16 07:21:25 INFO mapred.MapTask: Starting flush of map output
14/05/16 07:21:25 INFO mapred.MapTask: Finished spill 0
14/05/16 07:21:25 INFO mapred.Task: Task:attempt_local1725642642_0001_m_000006_0 is done. And is in the process of commiting
14/05/16 07:21:25 INFO mapred.LocalJobRunner: file:/home/wezhao/Downloads/hadoop-1.2.1/input/hdfs-site.xml:0+274
14/05/16 07:21:25 INFO mapred.Task: Task 'attempt_local1725642642_0001_m_000006_0' done.
14/05/16 07:21:25 INFO mapred.LocalJobRunner: Finishing task: attempt_local1725642642_0001_m_000006_0
14/05/16 07:21:25 INFO mapred.LocalJobRunner: Map task executor complete.
14/05/16 07:21:25 INFO mapred.Task:  Using ResourceCalculatorPlugin : org.apache.hadoop.util.LinuxResourceCalculatorPlugin@18987a33
14/05/16 07:21:25 INFO mapred.LocalJobRunner:
14/05/16 07:21:25 INFO mapred.Merger: Merging 7 sorted segments
14/05/16 07:21:25 INFO mapred.Merger: Down to the last merge-pass, with 2 segments left of total size: 49 bytes
14/05/16 07:21:25 INFO mapred.LocalJobRunner:
14/05/16 07:21:25 INFO mapred.Task: Task:attempt_local1725642642_0001_r_000000_0 is done. And is in the process of commiting
14/05/16 07:21:25 INFO mapred.LocalJobRunner:
14/05/16 07:21:25 INFO mapred.Task: Task attempt_local1725642642_0001_r_000000_0 is allowed to commit now
14/05/16 07:21:25 INFO mapred.FileOutputCommitter: Saved output of task 'attempt_local1725642642_0001_r_000000_0' to file:/home/wezhao/Downloads/hadoop-1.2.1/grep-temp-1256449998
14/05/16 07:21:25 INFO mapred.LocalJobRunner: reduce > reduce
14/05/16 07:21:25 INFO mapred.Task: Task 'attempt_local1725642642_0001_r_000000_0' done.
14/05/16 07:21:26 INFO mapred.JobClient:  map 100% reduce 100%
14/05/16 07:21:26 INFO mapred.JobClient: Job complete: job_local1725642642_0001
14/05/16 07:21:26 INFO mapred.JobClient: Counters: 21
14/05/16 07:21:26 INFO mapred.JobClient:   File Input Format Counters
14/05/16 07:21:26 INFO mapred.JobClient:     Bytes Read=15323
14/05/16 07:21:26 INFO mapred.JobClient:   File Output Format Counters
14/05/16 07:21:26 INFO mapred.JobClient:     Bytes Written=155
14/05/16 07:21:26 INFO mapred.JobClient:   FileSystemCounters
14/05/16 07:21:26 INFO mapred.JobClient:     FILE_BYTES_READ=1278506
14/05/16 07:21:26 INFO mapred.JobClient:     FILE_BYTES_WRITTEN=1572630
14/05/16 07:21:26 INFO mapred.JobClient:   Map-Reduce Framework
14/05/16 07:21:26 INFO mapred.JobClient:     Map output materialized bytes=87
14/05/16 07:21:26 INFO mapred.JobClient:     Map input records=378
14/05/16 07:21:26 INFO mapred.JobClient:     Reduce shuffle bytes=0
14/05/16 07:21:26 INFO mapred.JobClient:     Spilled Records=4
14/05/16 07:21:26 INFO mapred.JobClient:     Map output bytes=41
14/05/16 07:21:26 INFO mapred.JobClient:     Total committed heap usage (bytes)=1204920320
14/05/16 07:21:26 INFO mapred.JobClient:     CPU time spent (ms)=0
14/05/16 07:21:26 INFO mapred.JobClient:     Map input bytes=15323
14/05/16 07:21:26 INFO mapred.JobClient:     SPLIT_RAW_BYTES=819
14/05/16 07:21:26 INFO mapred.JobClient:     Combine input records=2
14/05/16 07:21:26 INFO mapred.JobClient:     Reduce input records=2
14/05/16 07:21:26 INFO mapred.JobClient:     Reduce input groups=2
14/05/16 07:21:26 INFO mapred.JobClient:     Combine output records=2
14/05/16 07:21:26 INFO mapred.JobClient:     Physical memory (bytes) snapshot=0
14/05/16 07:21:26 INFO mapred.JobClient:     Reduce output records=2
14/05/16 07:21:26 INFO mapred.JobClient:     Virtual memory (bytes) snapshot=0
14/05/16 07:21:26 INFO mapred.JobClient:     Map output records=2
14/05/16 07:21:26 INFO mapred.FileInputFormat: Total input paths to process : 1
14/05/16 07:21:26 INFO mapred.JobClient: Running job: job_local162864386_0002
14/05/16 07:21:26 INFO mapred.LocalJobRunner: Waiting for map tasks
14/05/16 07:21:26 INFO mapred.LocalJobRunner: Starting task: attempt_local162864386_0002_m_000000_0
14/05/16 07:21:26 INFO mapred.Task:  Using ResourceCalculatorPlugin : org.apache.hadoop.util.LinuxResourceCalculatorPlugin@10f102d3
14/05/16 07:21:26 INFO mapred.MapTask: Processing split: file:/home/wezhao/Downloads/hadoop-1.2.1/grep-temp-1256449998/part-00000:0+143
14/05/16 07:21:26 INFO mapred.MapTask: numReduceTasks: 1
14/05/16 07:21:26 INFO mapred.MapTask: io.sort.mb = 100
14/05/16 07:21:26 INFO mapred.MapTask: data buffer = 79691776/99614720
14/05/16 07:21:26 INFO mapred.MapTask: record buffer = 262144/327680
14/05/16 07:21:26 INFO mapred.MapTask: Starting flush of map output
14/05/16 07:21:26 INFO mapred.MapTask: Finished spill 0
14/05/16 07:21:26 INFO mapred.Task: Task:attempt_local162864386_0002_m_000000_0 is done. And is in the process of commiting
14/05/16 07:21:26 INFO mapred.LocalJobRunner: file:/home/wezhao/Downloads/hadoop-1.2.1/grep-temp-1256449998/part-00000:0+143
14/05/16 07:21:26 INFO mapred.Task: Task 'attempt_local162864386_0002_m_000000_0' done.
14/05/16 07:21:26 INFO mapred.LocalJobRunner: Finishing task: attempt_local162864386_0002_m_000000_0
14/05/16 07:21:26 INFO mapred.LocalJobRunner: Map task executor complete.
14/05/16 07:21:26 INFO mapred.Task:  Using ResourceCalculatorPlugin : org.apache.hadoop.util.LinuxResourceCalculatorPlugin@198e261d
14/05/16 07:21:26 INFO mapred.LocalJobRunner:
14/05/16 07:21:26 INFO mapred.Merger: Merging 1 sorted segments
14/05/16 07:21:26 INFO mapred.Merger: Down to the last merge-pass, with 1 segments left of total size: 47 bytes
14/05/16 07:21:26 INFO mapred.LocalJobRunner:
14/05/16 07:21:26 INFO mapred.Task: Task:attempt_local162864386_0002_r_000000_0 is done. And is in the process of commiting
14/05/16 07:21:26 INFO mapred.LocalJobRunner:
14/05/16 07:21:26 INFO mapred.Task: Task attempt_local162864386_0002_r_000000_0 is allowed to commit now
14/05/16 07:21:26 INFO mapred.FileOutputCommitter: Saved output of task 'attempt_local162864386_0002_r_000000_0' to file:/home/wezhao/Downloads/hadoop-1.2.1/output
14/05/16 07:21:26 INFO mapred.LocalJobRunner: reduce > reduce
14/05/16 07:21:26 INFO mapred.Task: Task 'attempt_local162864386_0002_r_000000_0' done.
14/05/16 07:21:27 INFO mapred.JobClient:  map 100% reduce 100%
14/05/16 07:21:27 INFO mapred.JobClient: Job complete: job_local162864386_0002
14/05/16 07:21:27 INFO mapred.JobClient: Counters: 21
14/05/16 07:21:27 INFO mapred.JobClient:   File Input Format Counters
14/05/16 07:21:27 INFO mapred.JobClient:     Bytes Read=155
14/05/16 07:21:27 INFO mapred.JobClient:   File Output Format Counters
14/05/16 07:21:27 INFO mapred.JobClient:     Bytes Written=41
14/05/16 07:21:27 INFO mapred.JobClient:   FileSystemCounters
14/05/16 07:21:27 INFO mapred.JobClient:     FILE_BYTES_READ=612459
14/05/16 07:21:27 INFO mapred.JobClient:     FILE_BYTES_WRITTEN=783045
14/05/16 07:21:27 INFO mapred.JobClient:   Map-Reduce Framework
14/05/16 07:21:27 INFO mapred.JobClient:     Map output materialized bytes=51
14/05/16 07:21:27 INFO mapred.JobClient:     Map input records=2
14/05/16 07:21:27 INFO mapred.JobClient:     Reduce shuffle bytes=0
14/05/16 07:21:27 INFO mapred.JobClient:     Spilled Records=4
14/05/16 07:21:27 INFO mapred.JobClient:     Map output bytes=41
14/05/16 07:21:27 INFO mapred.JobClient:     Total committed heap usage (bytes)=262946816
14/05/16 07:21:27 INFO mapred.JobClient:     CPU time spent (ms)=0
14/05/16 07:21:27 INFO mapred.JobClient:     Map input bytes=57
14/05/16 07:21:27 INFO mapred.JobClient:     SPLIT_RAW_BYTES=125
14/05/16 07:21:27 INFO mapred.JobClient:     Combine input records=0
14/05/16 07:21:27 INFO mapred.JobClient:     Reduce input records=2
14/05/16 07:21:27 INFO mapred.JobClient:     Reduce input groups=1
14/05/16 07:21:27 INFO mapred.JobClient:     Combine output records=0
14/05/16 07:21:27 INFO mapred.JobClient:     Physical memory (bytes) snapshot=0
14/05/16 07:21:27 INFO mapred.JobClient:     Reduce output records=2
14/05/16 07:21:27 INFO mapred.JobClient:     Virtual memory (bytes) snapshot=0
14/05/16 07:21:27 INFO mapred.JobClient:     Map output records=2


reference:

1, https://www.ibm.com/developerworks/community/blogs/theTechTrek/entry/test_driving_apache_hadoop_standalone_pseudo_distributed_mode1?lang=en

2, http://hadoop.apache.org/docs/r1.2.1/single_node_setup.html



评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值