Running Hadoop On Ubuntu Linux (Single-Node Cluster) @ Michael G. Noll
Running Hadoop On Ubuntu Linux (Single-Node Cluster)
by Michael G. Noll
on August 5, 2007 (last updated: January 19, 2012)In this tutorial, I will describe how to setup a single-node Hadoop cluster.
Table of Contents:
- What we want to do
- Prerequisites
- Sun Java 6
- Adding a dedicated Hadoop system user
- Configuring SSH
- Disabling IPv6
- Alternative
- Hadoop
- Installation
- Update $HOME/.bashrc
- Excursus: Hadoop Distributed File System (HDFS)
- Configuration
- hadoop-env.sh
- conf/*-site.xml
- Formatting the HDFS filesystem via the NameNode
- Starting your single-node cluster
- Stopping your single-node cluster
- Running a MapReduce job
- Download example input data
- Restart the Hadoop cluster
- Copy local example data to HDFS
- Run the MapReduce job
- Retrieve the job result from HDFS
- Hadoop Web Interfaces
- MapReduce Job Tracker Web Interface
- Task Tracker Web Interface
- HDFS Name Node Web Interface
- What’s next?
- Related Links
- Change Log
- Comments (339)