ElasticSearch大批量数据入库

最新推荐文章于 2024-01-25 01:52:19 发布

Soyoger

最新推荐文章于 2024-01-25 01:52:19 发布

阅读量3.7k

点赞数

分类专栏： ElasticSearch 文章标签： elasticsearch

本文链接：https://blog.csdn.net/qq_36330643/article/details/71750886

版权

ElasticSearch 专栏收录该内容

48 篇文章 2 订阅

订阅专栏

1.bulk批量导入数据

 
 
  
  curl 172.17.1.15:9200/_bulk?pretty --data-binary @E:\Bin\Debug\testdata\437714060.json
 
 
 
 
  
  curl 172.17.1.15:9200/_bulk?pretty --data-binary @E:\Bin\Debug\testdata\743719428.json
 
 
 
 
  
  curl 172.17.1.15:9200/_bulk?pretty --data-binary @E:\Bin\Debug\testdata\281679894.json
 
 
 
 
  
  curl 172.17.1.15:9200/_bulk?pretty --data-binary @E:\Bin\Debug\testdata\146257480.json
 
 
 
 
  
  curl 172.17.1.15:9200/_bulk?pretty --data-binary @E:\Bin\Debug\testdata\892018760.json

但问题是这方法是先定义一定格式的json文件，然后再用curl命令去执行Elasticsearch的_bulk来批量插入，所以得把数据写进json文件，然后再通过批处理，执行文件插入数据，另外在生成json文件，文件不能过大，过大会报错，所以建议生成10M一个文件

2. github上找到了一个数据导入导出工具

github上找到了一个数据导入导出工具，算是基本上解决了这个问题，直接把数据从原来的index里面取出来然后重新录入到新的index里面，这样省去了数据解析的过程，节约了不少时间。

下面是问题的解决步骤:

首先安装一下elasticsearch-knapsack(数据导入导出工具)

./bin/plugin -install knapsack -url http://xbib.org/repository/org/xbib/elasticsearch/plugin/elasticsearch-knapsack/1.5.2.0/elasticsearch-knapsack-1.5.2.0-plugin.zip

然后可以通过导出命令导出json格式的数据到系统中的某个文件夹（导出某个index中的数据到指定路径）

curl -XPOST 'localhost:9200/test/_export?path=/tmp/myarchive.zip'

然后运用导入命令将数据导入进elasticsearch里面

[html] view plain copy

curl -XPOST 'localhost:9200/test/test/_import?path=/tmp/myarchive.zip'

以下是github地址:

https://github.com/jprante/elasticsearch-knapsack

Soyoger

关注

0
点赞
踩
3

收藏

觉得还不错? 一键收藏
打赏
0
评论
ElasticSearch大批量数据入库

1.bulk批量导入数据curl 172.17.1.15:9200/_bulk?pretty --data-binary @E:\Bin\Debug\testdata\437714060.jsoncurl 172.17.1.15:9200/_bulk?pretty --data-binary @E:\Bin\Debug\testdata\743719428.jsoncurl 172.17.
复制链接

扫一扫