一、下载
wget https://mirrors.tuna.tsinghua.edu.cn/apache/sqoop/1.4.7/sqoop-1.4.7.bin__hadoop-2.6.0.tar.gz
mv sqoop-1.4.7.bin__hadoop-2.6.0.tar.gz /opt
tar -zxvf sqoop-1.4.7.bin__hadoop-2.6.0.tar.gz
mv sqoop-1.4.7.bin__hadoop-2.6.0 sqoop-1.4.7
二、配置环境变量
vim ~/.bashrc
内容如下:
# sqoop
export SQOOP_HOME=/opt/sqoop-1.4.7
export PATH=$PATH:$SQOOP_HOME/bin
export CLASSPATH=$CLASSPATH:${SQOOP_HOME}/lib
如图
刷新生效
source ~/.bashrc
三、修改配置文件
使用sqoop需要搭建好hadoop、hbase、zookeeper、hive,搭建方法
zookeeper:https://blog.csdn.net/qq_39680564/article/details/89500281
hadoop:https://blog.csdn.net/qq_39680564/article/details/89513162
hbase:https://blog.csdn.net/qq_39680564/article/details/89515459
hive:https://blog.csdn.net/qq_39680564/article/details/89714184
改名
mv /opt/sqoop-1.4.7/conf/sqoop-env-template.sh /opt/sqoop-1.4.7/conf/sqoop-env.sh
编辑sqoop-env.sh
文件
vim /opt/sqoop-1.4.7/conf/sqoop-env.sh
内容如下
#Set path to where bin/hadoop is available
export HADOOP_COMMON_HOME=/opt/hadoop-3.0.3
#Set path to where hadoop-*-core.jar is available
export HADOOP_MAPRED_HOME=/opt/hadoop-3.0.3
#set the path to where bin/hbase is available
export HBASE_HOME=/opt/hbase-2.1.0
#Set the path to where bin/hive is available
export HIVE_HOME=/opt/hive
#Set the path for where zookeper config dir is
export ZOOCFGDIR=/opt/zookeeper-3.4.10/conf
如图
查看版本
[root@master ~]# sqoop version
Warning: /opt/sqoop-1.4.7/../hcatalog does not exist! HCatalog jobs will fail.
Please set $HCAT_HOME to the root of your HCatalog installation.
Warning: /opt/sqoop-1.4.7/../accumulo does not exist! Accumulo imports will fail.
Please set $ACCUMULO_HOME to the root of your Accumulo installation.
错误: 找不到或无法加载主类 org.apache.hadoop.hbase.util.GetJavaProperty
2019-08-08 09:54:12,649 INFO sqoop.Sqoop: Running Sqoop version: 1.4.7
Sqoop 1.4.7
git commit id 2328971411f57f0cb683dfb79d19d4d19d185dd8
Compiled by maugli on Thu Dec 21 15:59:58 STD 2017
四、sqoop常用参数
create-hive-table 将表定义导入Hive
eval 评估SQL语句并显示结果
export 将HDFS目录导出到数据库表
import 将表从数据库导入HDFS
import-all-tables 将全部表从数据库导入HDFS
job 使用已保存的作业
list-databases 列出服务器上的可用数据库
list-tables 列出数据库中的可用表
merge 合并增量导入的结果