用OpenStack savanna部署Hadoop集群

最新推荐文章于 2022-10-20 20:22:10 发布

向辉tc

最新推荐文章于 2022-10-20 20:22:10 发布

阅读量3.1k

点赞数

分类专栏： OpenStack Hadoop 文章标签： savanna hadoop

本文链接：https://blog.csdn.net/starean/article/details/16115453

版权

OpenStack 同时被 2 个专栏收录

23 篇文章 0 订阅

订阅专栏

Hadoop

2 篇文章 0 订阅

订阅专栏

1. 部署OpenStack环境, 安装主要模块(keystone/glance/nova/neutron/horizon)

使用RDO的packstack部署较快，按照下面链接中的步骤，如果中间发生错误，可重复执行命令

http://openstack.redhat.com/QuickStartLatest

使用neutron而不是nova-network，执行下面命令行

packstack --allinone --os-neutron-install=y

2. 上面部署成功后, 安装savanna, （我部署savanna的时候命令行还没有完全支持，所以使用的dashboard)

参考https://savanna.readthedocs.org/en/latest/horizon/installation.guide.html

To install with RDO(使用RDO安装savanna)

Start by following the Quickstart to install and setupOpenStack.
Install the savanna-api service with,

$ yum install openstack-savanna

Configure the savanna-api service to your liking. The configurationfile is located in/etc/savanna/savanna.conf.
Start the savanna-api service with,

$ service openstack-savanna-api start

To install into a virtual environment(使用tarball安装Savanna)

1. First you need to install python-setuptools, python-virtualenv and python headers using yourOS package manager. The python headers package name depends on OS. For Ubuntu it is python-dev,for Red Hat - python-devel.

For Fedora:

$ sudo yum install gcc python-setuptools python-virtualenv python-devel

Setup virtual environment for Savanna:

$ virtualenv savanna-venv

This will install python virtual environment into savanna-venv directoryin your current working directory. This command does not require superuser privileges and could be executed in any directory current user haswrite permission.

You can install the latest Savanna release version from pypi:

$ savanna-venv/bin/pip install savanna

Or you can get Savanna archive from http://tarballs.openstack.org/savanna/ and install it using pip:

$ savanna-venv/bin/pip install 'http://tarballs.openstack.org/savanna/savanna-master.tar.gz'

Note that savanna-master.tar.gz contains the latest changes and might not be stable at the moment.We recommend browsing http://tarballs.openstack.org/savanna/ and selecting the latest stable release.

After installation you should create configuration file. Sample config file locationdepends on your OS. For Ubuntu it is/usr/local/share/savanna/savanna.conf.sample,for Red Hat -/usr/share/savanna/savanna.conf.sample. Below is an example for Ubuntu:

$ mkdir savanna-venv/etc
$ cp savanna-venv/share/savanna/savanna.conf.sample savanna-venv/etc/savanna.conf

check each option in savanna-venv/etc/savanna.conf, and make necessary changes

To start Savanna call:

$ savanna-venv/bin/python savanna-venv/bin/savanna-api --config-file savanna-venv/etc/savanna.conf

3. 配置Savanna

[root@xianghui workplace]# vi /etc/savanna/savanna.conf
[DEFAULT]
port=8386
os_auth_host=127.0.0.1
os_auth_port=35357
os_admin_username=admin
os_admin_password=openstack1
os_admin_tenant_name=service
use_floating_ips=false
use_neutron=true
debug=true
verbose=true
log_file=savanna.log
log_dir=/var/log/savanna/
plugins=vanilla,hdp
[plugin:vanilla]
plugin_class=savanna.plugins.vanilla.plugin:VanillaProvider
[plugin:hdp]
plugin_class=savanna.plugins.hdp.ambariplugin:AmbariPlugin
[database]
#connection=sqlite:savanna/openstack/common/db/$sqlite_db

创建log_dir

# mkdir /var/log/savanna

创建文件savanna.log

# vi /var/log/savanna/savanna.log 
# chown savanna:savanna /var/log/savanna/savanna.log

配置完成后重启savanna，

4. 配置savanna UI

Go to the machine where Dashboard resides and install Savanna UI:

For RDO:

$ sudo yum install python-django-savanna

Otherwise:

$ sudo pip install savanna-dashboard

This will install latest stable release of Savanna UI. If you want to install master branch of Savanna UI:

 
  $ sudo pip install 'http://tarballs.openstack.org/savanna-dashboard/savanna-dashboard-master.tar.gz'

2. Configure OpenStack Dashboard. In settings.py add savanna to

 
  HORIZON_CONFIG = {
    'dashboards': ('nova', 'syspanel', 'settings', ..., 'savanna'),

and also add savannadashboard to

 
  INSTALLED_APPS = (
    'savannadashboard',
    ....

Note: settings.py file is located in /usr/share/openstack-dashboard/openstack_dashboard/ by default.

Also you have to specify SAVANNA_URL in local_settings.py. For example:

 
  SAVANNA_URL = 'http://localhost:8386/v1.1'

If you are using Neutron instead of Nova Network:

 
  SAVANNA_USE_NEUTRON = True

Note: For RDO, the local_settings.py file is located in /etc/openstack-dashboard/, otherwise it is in /usr/share/openstack-dashboard/openstack_dashboard/local/.

$ sudo service httpd reload

You can check that service has been started successfully. Go to Horizon URL and if installation is correct you’ll be able to see the Savanna tab.

5. 下载prebuild image Upload image to Glance

You can download pre-built images with vanilla Apache Hadoop or build this images yourself:

Download and install pre-built image with Fedora 19

$ wget http://savanna-files.mirantis.com/savanna-0.3-vanilla-1.2.1-fedora-19.qcow2
$ glance image-create --name=savanna-0.3-vanilla-1.2.1-fedora-19 \
  --disk-format=qcow2 --container-format=bare < ./savanna-0.3-vanilla-1.2.1-fedora-19.qcow2

登录dashboard就会发现多了一页page "savanna"， plugins项显示目前支持两种类型plugin: vanilla/hdp

使用dashboard注册prebuild image 到savanna后结果如下：

注意 tags:='["vanilla", "1.2.1", "fedora"]'

创建node group: data node/name node templates

创建cluster templates

创建cluster

6. 跑job

由于界面貌似只支持Swift, 但是我没有安装配置swift, 就打算简单run HDFS.

下面的列表是cluster生成的三个虚拟机，在虚拟机启动后，cloud-init脚本会自动执行vanilla plugin预先写好的hadoop配置，这样就不用每台虚拟机单独手动配置

[root@xianghui ~]# nova list
+--------------------------------------+---------------------------------------+--------+------------+-------------+---------------------------------+
| ID                                   | Name                                  | Status | Task State | Power State | Networks                        |
+--------------------------------------+---------------------------------------+--------+------------+-------------+---------------------------------+
| 0019f2e8-9450-45e7-9455-44f51d4029b8 | test-1-DataNodeGroup-001              | ACTIVE | None       | Running     | flat-80=80.0.0.6                |
| 1cd12dc5-86f5-4e06-bf1f-fd7635dca032 | test-1-DataNodeGroup-002              | ACTIVE | None       | Running     | flat-80=80.0.0.7                |
| e746f0ba-b297-4e07-b430-85840b71fa53 | test-1-NameNodeGroup-001              | ACTIVE | None       | Running     | flat-80=80.0.0.3                |

因为配置了ssh-key, 所以不需要密码，直接登录

[root@xianghui ~]# ssh fedora@80.0.0.3
Last login: Wed Oct 30 09:48:07 2013 from 80.0.0.2
[fedora@test-1-namenodegroup-001 ~]$ sudo -i
[root@test-1-namenodegroup-001 ~]#whereis hadoop
hadoop: /usr/bin/hadoop /etc/hadoop /usr/etc/hadoop /usr/include/hadoop /usr/share/hadoop

编辑input文件作为job的输入文件

[root@test-1-namenodegroup-001 ~]# cat input
dfs
dfjalkfd
dkaljf

运行example里面的wordcount job, 结果显示结果为3，正确！

[root@test-1-namenodegroup-001 ~]# hadoop jar /usr/share/hadoop/hadoop-examples-1.2.1.jar wordcount input test_output
13/11/14 06:15:34 INFO mapred.JobClient: Running job: job_201310210945_0015
13/11/14 06:15:35 INFO mapred.JobClient:  map 0% reduce 0%
13/11/14 06:15:50 INFO mapred.JobClient:  map 100% reduce 0%
13/11/14 06:16:01 INFO mapred.JobClient:  map 100% reduce 33%
13/11/14 06:16:03 INFO mapred.JobClient:  map 100% reduce 100%
13/11/14 06:16:05 INFO mapred.JobClient: Job complete: job_201310210945_0015
13/11/14 06:16:05 INFO mapred.JobClient: Counters: 29
13/11/14 06:16:05 INFO mapred.JobClient:   Job Counters
13/11/14 06:16:05 INFO mapred.JobClient:     Launched reduce tasks=1
13/11/14 06:16:05 INFO mapred.JobClient:     SLOTS_MILLIS_MAPS=15287
13/11/14 06:16:05 INFO mapred.JobClient:     Total time spent by all reduces waiting after reserving slots (ms)=0
13/11/14 06:16:05 INFO mapred.JobClient:     Total time spent by all maps waiting after reserving slots (ms)=0
13/11/14 06:16:05 INFO mapred.JobClient:     Launched map tasks=1
13/11/14 06:16:05 INFO mapred.JobClient:     Data-local map tasks=1
13/11/14 06:16:05 INFO mapred.JobClient:     SLOTS_MILLIS_REDUCES=12207
13/11/14 06:16:05 INFO mapred.JobClient:   File Output Format Counters
13/11/14 06:16:05 INFO mapred.JobClient:     Bytes Written=26
13/11/14 06:16:05 INFO mapred.JobClient:   FileSystemCounters
13/11/14 06:16:05 INFO mapred.JobClient:     FILE_BYTES_READ=44
13/11/14 06:16:05 INFO mapred.JobClient:     HDFS_BYTES_READ=138
13/11/14 06:16:05 INFO mapred.JobClient:     FILE_BYTES_WRITTEN=110578
13/11/14 06:16:05 INFO mapred.JobClient:     HDFS_BYTES_WRITTEN=26
13/11/14 06:16:05 INFO mapred.JobClient:   File Input Format Counters
13/11/14 06:16:05 INFO mapred.JobClient:     Bytes Read=21
13/11/14 06:16:05 INFO mapred.JobClient:   Map-Reduce Framework
13/11/14 06:16:05 INFO mapred.JobClient:     Map output materialized bytes=44
13/11/14 06:16:05 INFO mapred.JobClient:     Map input records=4
13/11/14 06:16:05 INFO mapred.JobClient:     Reduce shuffle bytes=44
13/11/14 06:16:05 INFO mapred.JobClient:     Spilled Records=6
13/11/14 06:16:05 INFO mapred.JobClient:     Map output bytes=32
13/11/14 06:16:05 INFO mapred.JobClient:     Total committed heap usage (bytes)=163254272
13/11/14 06:16:05 INFO mapred.JobClient:     CPU time spent (ms)=2580
13/11/14 06:16:05 INFO mapred.JobClient:     Combine input records=3
13/11/14 06:16:05 INFO mapred.JobClient:     SPLIT_RAW_BYTES=117
13/11/14 06:16:05 INFO mapred.JobClient:     Reduce input records=3
13/11/14 06:16:05 INFO mapred.JobClient:     Reduce input groups=3
13/11/14 06:16:05 INFO mapred.JobClient:     Combine output records=3
13/11/14 06:16:05 INFO mapred.JobClient:     Physical memory (bytes) snapshot=238972928
13/11/14 06:16:05 INFO mapred.JobClient:     Reduce output records=3
13/11/14 06:16:05 INFO mapred.JobClient:     Virtual memory (bytes) snapshot=1652383744
13/11/14 06:16:05 INFO mapred.JobClient:     Map output records=3

向辉tc

关注

0
点赞
踩
3

收藏

觉得还不错? 一键收藏
1
评论
用OpenStack savanna部署Hadoop集群

1. 部署OpenStack环境, 安装主要模块(keystone/glance/nova/neutron/horizon)使用RDO的packstack部署较快，按照下面链接中的步骤，如果中间发生错误，可重复执行命令http://openstack.redhat.com/QuickStartLatest使用neutron而不是nova-network，执行下面命令行
复制链接

扫一扫

专栏目录