SQOOP是一款开源的工具,主要用于在HADOOP(Hive)与传统的数据库(mysql、postgresql...)间进行数据的传递,下面从SQOOP用户手册上摘录一段描述
Sqoop is a tool designed to transfer data between Hadoop andrelational databases. You can use Sqoop to import data from arelational database management system (RDBMS) such as MySQL or Oracle into the Hadoop Distributed File System (HDFS),transform the data in Hadoop MapReduce, and then export the data back into an RDBMS.
下面就是我在自己机子上配置sqoop客户端和使用的过程。
系统:ubuntu12.04
前提:
- 客户端已经配置好hadoop环境。
- 设置环境变量HADOOP_HOME。
版本:
- hadoop:hadoop-1.0.3
- sqoop: Sqoop 1.4.1-incubating
客户端配置:
- 配置sqoop