Spark-submit提交任务,封装shell脚本,shell脚本创建HBase表,批量导hive表

使用Spark-submit提交Spark任务封装shell脚本

# =====集群上local 启动模式======
#!/bin/bash

if [ $# -eq 1 ];then
        spark-submit --master local[4]  --class hx.com.Ods2DwdFilterSql --files /home/etl_admin/spark/config.properties sparkDwdFilter-1.0-SNAPSHOT.jar $1
else
  echo "Please input command. eg: ./$0 filename.sql(hql)"
fi
# =====yarn-client 启动模式=======

#!/bin/bash

if [ $# -eq 1 ];then
        spark-submit \
        --master yarn \
        --deploy-mode client \
        --queue default \
        --driver-memory 2g \
        --num-executors 3 \
        --executor-memory 2g \
        --executor-cores 2 \
        --class hx.com.Ods2DwdFilterSql \
        --files /home/etl_admin/spark/config.properties \
        sparkDwdFilter-1.0-SNAPSHOT.jar /opt/etl/sqlFiles/$1 
else
  echo "Please input command. eg: ./$0 filename.sql(hql)"
fi

# =======yarn-cluster 启动模式=======

#!/bin/bash

if [ $# -eq 1 ];then
        spark-submit \
        --master yarn \
        --deploy-mode cluster \
        --queue default \
        --driver-memory 2g \
        --num-executors 3 \
        --executor-memory 2g \
        --executor-cores 2 \
        --class hx.com.Ods2DwdFilterSql \
        --files /home/etl_admin/spark/config.properties \
        sparkDwdFilter-1.0-SNAPSHOT.jar /opt/etl/sqlFiles/$1 
else
  echo "Please input command. eg: ./$0 filename.sql(hql)"
fi

shell脚本创建hbase表

  • create_hbase_table.sh
#!/bin/bash
#1,判断执行脚本时,是否输入正确的参数
[[ $# < 1 ]] && echo "请输入hbase表名!" &&  exit 1

#2,定义hbase表
#hbase_table="'test111'"
hbase_table="'$1'"
cf="'f'"
echo "输入的hbase表 ==>  $hbase_table $cf"

#3,判断表是否已存在
#echo "exists   $hbase_table  "| hbase shell  |grep 'does exist'
echo `hbase shell <<EOF
exists $hbase_table
EOF` |grep 'does exist' 

status=$?
echo "exists ? ==> status=$status"
if [ $status -eq 0 ];then
        echo "table exists..."; exit 0
fi
#创建表
echo "table not exists ! ===> start creating...."
#echo " create  $hbase_table , $cf "  |hbase shell 
hbase shell <<EOF
create $hbase_table , $cf 
EOF

#获取结果
status=$?
if [ $status -eq 0  ];then 
   echo "create succeed ! "
else
   echo "error,create not done !"
fi

批量导出hive表

#!/bin/bash

# 要批量导出建表语句的数据库 
#DATABASES='ods edw dws'
DATABASES='gmall'
for DATABASE in ${DATABASES}
do 
hive -e "use ${DATABASE}; show tables;" > ${DATABASE}_tables.txt
sed -i '/WARN:/d' ${DATABASE}_tables.txt
#sleep 1
echo "use ${DATABASE};" >> ${DATABASE}_repair_tables.sql
echo "set hive.msck.path.validation=ignore;" >> ${DATABASE}_repair_tables.sql
echo "use ${DATABASE};" >> ${DATABASE}_count.sql

cat ${DATABASE}_tables.txt | while read eachline
do
hive -e "use ${DATABASE}; show create table ${eachline};" >> ${DATABASE}_tables_ddl.sql

echo "msck repair table ${DATABASE}.${eachline};" >> ${DATABASE}_repair_tables.sql
echo "select count(1) from ${DATABASE}.${eachline} union all " >> ${DATABASE}_count.sql

done
sed -i '/WARN:/d'  ${DATABASE}_tables_ddl.sql
sed -i "s/\`/ /g"  ${DATABASE}_tables_ddl.sql
cat ${DATABASE}_tables.txt | while read eachtable
do

sed -i "s/CREATE EXTERNAL TABLE  ${eachtable}/;CREATE EXTERNAL TABLE ${DATABASE}.${eachtable}_bak/g"  ${DATABASE}_tables_ddl.sql
done

echo ";" >> ${DATABASE}_tables_ddl.sql
done

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值