hive 数据导入 es

最新推荐文章于 2023-05-16 16:01:12 发布

海牛大数据_青牛老师

最新推荐文章于 2023-05-16 16:01:12 发布

阅读量1.1k

点赞数

分类专栏： es 文章标签： es

本文链接：https://blog.csdn.net/heqingcool/article/details/117653199

版权

es 专栏收录该内容

4 篇文章 0 订阅

订阅专栏

es-hadoop插件

上传es-hadoop插件到集群

准备hive数据

-- 连接hive
beeline -u "jdbc:hive2://worker-1:10000/;principal=hive/worker-1@HAINIU.COM"
-- 创建临时表
create table if not exists xiniu.hivetable(
    pk string,
    col1 int,
    col2 boolean,
    col3 timestamp,
    col4 string
)
comment 'hive表'
row format delimited fields terminated by '\t'
;
-- 加载数据
load data inpath '/eslib/testfile' into table xiniu.hivetable;

导入hive数据到es

上传es-hadoop jar包

hadoop fs -put /opt/elasticsearch-hadoop-7.13.1.jar /eslib/

加载es-hadoop jar包

add jar hdfs:///eslib/elasticsearch-hadoop-7.13.1.jar

创建es的hive外表

CREATE EXTERNAL TABLE xiniu.hive2es(
pk string,
col1 string,
col2 string,
col3 string,
col4 string
)STORED BY 'org.elasticsearch.hadoop.hive.EsStorageHandler'
TBLPROPERTIES('es.resource'='hivemappinges/_doc',
'es.nodes'='worker-1:9200,worker-2:9200,worker-3:9200',
'es.index.auto.create'='TRUE',
'es.index.refresh_interval' = '-1',
'es.index.number_of_replicas' = '0',
'es.batch.write.retry.count' = '6',
'es.batch.write.retry.wait' = '60s',
'es.mapping.name' = 'pk:pk,col1:col1,col2:col2,col3:col3,col4:col4'
);

插入数据到es的hive外表

INSERT OVERWRITE TABLE xiniu.hive2es SELECT pk,col1,col2,col3,col4 FROM xiniu.hivetable;

file

海汼部落原创文章，原文链接：http://www.hainiubl.com/topics/75637

海牛大数据_青牛老师

关注

0
点赞
踩
8

收藏

觉得还不错? 一键收藏
0
评论
hive 数据导入 es

es-hadoop插件上传es-hadoop插件到集群准备hive数据-- 连接hivebeeline -u "jdbc:hive2://worker-1:10000/;principal=hive/worker-1@HAINIU.COM"-- 创建临时表create table if not exists xiniu.hivetable( pk string, col1 int, col2 boolean, col3 timestamp, col4 st
复制链接

扫一扫