ElasticSearch重启之后shard未分配问题的解决

最新推荐文章于 2024-04-15 02:50:09 发布

chikoucha6215

最新推荐文章于 2024-04-15 02:50:09 发布

阅读量1.2k

点赞数

文章标签：大数据运维

原文链接：https://my.oschina.net/duanvincent/blog/3075188

版权

环境：

Ubuntu16.04
elasticsearch 6.2.3
3个master节点，10个data节点
每个分片有一个副本

故障：

将一个数据节点的elasticsearch换成docker elasticsearch，分词器没有添加到plugins中。随后把分词器添加到plugins中后，发现有的分片没有被分配，但是ES集群启动正常，只不过一直是yellow状态。而且unassigned分片一直未被分配。

解决办法

首先执行：GET user/_recovery?active_only=true 发现集群并没有进行副本恢复。

点击未进行分配的分片，发现allocation_status: "no_attempt"

原因是：shard 自动分配已经达到最大重试次数5次，仍然失败了，所以导致"shard的分配状态已经是：no_attempt"。这时在Kibana Dev Tools，执行命令：POST /_cluster/reroute?retry_failed=true即可。由index.allocation.max_retries参数来控制最大重试次数。

The cluster will attempt to allocate a shard a maximum of index.allocation.max_retries times in a row (defaults to 5), before giving up and leaving the shard unallocated.

当执行reroute命令对分片重新路由后，ElasticSearch会自动进行负载均衡，负载均衡参数cluster.routing.rebalance.enable默认为true。

It is important to note that after processing any reroute commands Elasticsearch will perform rebalancing as normal (respecting the values of settings such as cluster.routing.rebalance.enable) in order to remain in a balanced state.

总结

一般来说，ElasticSearch会自动分配那些 unassigned shards，当发现某些shards长期未分配时，首先看下是否是因为：为索引指定了过多的primary shard 和 replica 数量，然后集群中机器数量又不够。另一个原因就是本文中提到的：由于故障，shard自动分配达到了最大重试次数了，这时执行 reroute 就可以了。

转载于:https://my.oschina.net/duanvincent/blog/3075188

chikoucha6215

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
ElasticSearch重启之后shard未分配问题的解决

环境： Ubuntu16.04 elasticsearch 6.2.3 3个master节点，10个data节点每个分片有一个副本故障：将一个数据节点的elasticsearch换成docker elasticsearch，分词器...
复制链接

扫一扫