Running the Cloudera Training VM in VirtualBox(转)

转载 2011年01月11日 23:34:00

Cloudera’s Training VM is one of the most popular resources on our website. It was created with VMware Workstation, and plays nicely with the VMware Player for Windows, Linux, and Mac. But VMware isn’t for everyone. Thomas Lockney has managed to get our VM image running on Virtual Box, and has written a step-by-step guide for the community. Thanks Thomas! – Christophe

I was quite pleased when I discovered that Cloudera had created a virtual machine image that could be used while working through their training material. It would make the process simpler, and it looked like a potentially useful environment for general Hadoop experimentation. However, their VM is built for VMware, which I stopped using a while back. However, as a heavy VirtualBox user, I knew that it would not be hard to get it running in my preferred desktop virtualization environment.

Here’s a step-by-step guide for getting Cloudera’s virtual machine image up and running. I’ll include screenshots for most of the steps to make it as clear as possible. I’ll assume you already have at least some familiarity with running VirtualBox (if not, there are plenty of good tutorials and references available online) and some experience with Ubuntu or some other fairly modern Linux desktop system.

1. The first step is to download the virtual machine from the Hadoop Training Virtual Machine page on Cloudera’s site. The version at the time of this writing is 3.1, and the filename you’ll end up with is cloudera-training-0.3.1.tar.bz2. Once you have downloaded the file (this may take a little while — it’s quite large), decompress it somewhere useful. On a Unix-based machine (e.g., Linux or OS X), you can do this by running the following command:

tar xjf cloudera-training-0.3.1.tar.bz2

2. Next, start up VirtualBox. Once it loads, go to the File menu and select Virtual Media Manager.

3. The Virtual Media Manager is where you set up new drive images. An image needs to be created before you can use it with a virtual machine. In this case, you’re creating a new image by pointing to the existing image, which was supplied with the Cloudera VM download. It’s a VMware image (a .vmdk file), which VirtualBox can read.

In the Virtual Media Manager window, click New to create a new image.

4. In the file dialog box that appears, browse to the directory where you extracted the download and select the file cloudera-training-0.2-cl3.vmdk. Please note that this name will likely change with later releases, so you might need to experiment to find the right file. If that is the case, you’ll be looking for files ending in .vmdk. Note that files with the s00# names are generally either snapshots or extensions to the base drive image (you can choose to have the image split up into multiple files).


5. After closing the Virtual Media Manager window, click the New button in the main VirtualBox window to create a new virtual machine.

6. From the Create New Virtual Machine dialog box, give your new machine a name. Select Linux as the operating system and Ubuntu as the version.

7. On the next screen, set the memory size. The VMware image that Cloudera created has 1024 MB assigned, but I’ve found I can get away with less for basic needs. If you plan to do full development in this VM, set it higher (if you have the space to spare).

8. Next, you’ll select the hard disk image, which we added earlier.

9. Double check the summary before clicking Finish .

10. After closing the Virtual Machine Wizard, you can select the Cloudera machine that you just created and click Start .

11. Assuming you’ve done everything correctly up to this point and your VirtualBox installation is working properly, you should see a window pop up with the boot-up messages for the new virtual machine. Watch this to make sure everything is booting fine. If you see error messages here or if your machine doesn’t boot up correctly, you may have missed a step earlier or selected the wrong file for the hard disk image.

12. After a few moments, you should see the desktop of your new image. If you’ve gotten this far, you can stop here if you want, but you’ll be missing out on the enhanced functionality that VirtualBox offers, such as better integration with your existing desktop, sharing of files, etc.

13. If you want full integration, open a terminal and run the following command:

sudo apt-get install build-essential linux-headers-`uname -r`

This will install the basics that you need before loading the VirtualBox additions.


14. Select Install Guest Additions from the Devices menu.

15. You should now see a pop-up window prompting you to run the installer for the guest additions. Click the Run button to continue.

16. If the dependencies installed correctly earlier, you’ll see a terminal window, which will show you the progress as the add-ons are installed.


17. At this point, you can select Shutdown from the system menu in the top menu bar, and then choose Restart to reboot your virtual machine. When the VM restarts and the desktop is fully loaded, you should be able to resize the window, use your mouse seamlessly between the virtual machine window and your desktop, and add a shared folder (see the VirtualBox documentation for instructions on this).

One last thing: there is a call at the very end of /etc/init.d/rc.local to /usr/bin/vmware-user that you might want to remove. It won’t hurt anything if you leave it there, but you will occasionally see error messages at startup or shutdown due to its presence. I finally hunted it down just now after running this VM for a while, so it’s really not a big deal.

That’s all folks!






Running the Cloudera Training VM in VirtualBox(转)

找了很久相关的知识是搜另一个东西发现这个博客,炒鸡有用~~~~~ Cloudera’s Training VM is one of the most popular resources on our...
  • xuzhengzheng32
  • xuzhengzheng32
  • 2015年07月03日 15:34
  • 273

Hadoop QuickStart VM

Cloudera-Quickstart-VM-5.3.0 下载包:
  • zqhxuyuan
  • zqhxuyuan
  • 2015年01月27日 16:55
  • 3529

使用cloudera-quickstart-vm无配置快速部署Hadoop应用 目录: 通过CDH网站下载cloudera-vm镜...
  • qiezikuaichuan
  • qiezikuaichuan
  • 2015年09月17日 13:56
  • 2206


本来一直在寻求vb、vm的硬盘文件的转换方法,突然发现vb本来就可以使用vm的磁盘文件了(我的vb版本是4.1.12),有点汗颜呀,居然一直不知道,郁闷中。。。 当然既然已经查找了,总是有点收获的,...
  • superbfly
  • superbfly
  • 2016年05月13日 13:34
  • 3319


一、问题描述:安装过Docker Toolbox,卸载后,重新安装,无法正常使用,提示 Running pre-create checks... Error with pre-create chec...
  • u010389429
  • u010389429
  • 2016年07月15日 15:26
  • 9521

被cloudera quickstart vm坑了一把

基于没有服务器,只有个人4G内存笔记本一台,想试一把hadoop环境,操作一下真实环境学习。 研究了cloudera官网提供了一个快速学习的环境安装包:cloudera quickstartvm。 ...
  • gongchengshiv
  • gongchengshiv
  • 2017年11月18日 22:58
  • 212


  • xerjava
  • xerjava
  • 2016年10月11日 01:20
  • 944

Oracle VM VirtualBox 虚拟机设置全屏与共享

虚拟机:VirtualBox  (1)  要使用全屏和共享等功能需要为虚拟机安装增强功能>> 点击"设备"-->"安装增强功能"  桌面上会出现光驱图标,             VBOXADDI...
  • T123012009065
  • T123012009065
  • 2013年11月04日 16:55
  • 2739

CDH QuickStart VM基本使用

1.1 准备工作 1.       下载虚拟机镜像包。 下载地址: 后面步骤...
  • wiborgite
  • wiborgite
  • 2017年12月06日 16:26
  • 350

在Oracle VM VirtualBox中如何安装64位虚拟机系统

一、前提条件 VirtualBox安装64bit的系统需要满足以下条件: 64bit的cpu(支持硬件虚拟化技术) 64bit的操作系统 64bit的VirtualBox 二...
  • tao_627
  • tao_627
  • 2016年07月17日 12:42
  • 5486
您举报文章:Running the Cloudera Training VM in VirtualBox(转)