ElasticSearch安装一套完结

本文链接：https://blog.csdn.net/xiaoyixiao_/article/details/122804112

ElasticSearch安装Linux单机版

软件下载

解压软件

# 解压缩
tar -zxvf elasticsearch-7.8.0-linux-x86_64.tar.gz -C /opt/module
# 改名
mv elasticsearch-7.8.0 es

创建用户

因为安全问题，Elasticsearch不允许root用户直接运行，所以要创建新用户，在root用户中创建新用户

useradd es # 新增 es 用户
passwd es # 为 es 用户设置密码
userdel -r es # 如果错了，可以删除再加
chown -R es:es /opt/module/es #文件夹所有者

修改配置文件

修改/opt/module/es/config/elasticsearch.yml文件

# 加入如下配置
cluster.name: elasticsearch
node.name: node-1
network.host: 0.0.0.0
http.port: 9200
cluster.initial_master_nodes: ["node-1"]

修改/etc/security/limits.d/20-nproc.conf

# 在文件末尾中增加下面内容
# 每个进程可以打开的文件数的限制
es soft nofile 65536
es hard nofile 65536
# 操作系统级别对每个用户创建的进程数的限制
* hard nproc 4096
# 注：* 带表 Linux 所有用户名称

修改/etc/sysctl.conf

# 在文件中增加下面内容
# 一个进程可以拥有的 VMA(虚拟内存区域)的数量,默认值为 65536
vm.max_map_count=655360

重新加载

sysctl -p

启动软件

使用ES用户启动

cd /opt/module/es/
#启动
bin/elasticsearch
#后台启动
bin/elasticsearch -d

启动时，会动态生成文件，如果文件所属用户不匹配，会发生错误，需要重新进行修改用户和用户组。浏览器中输入地址：http://linux1:9200/

ElasticSearch安装Linux集群版

软件下载

Elasticsearch 7.8.0 | Elastic

解压软件

在三台机器上同时解压

# 解压缩
tar -zxvf elasticsearch-7.8.0-linux-x86_64.tar.gz -C /opt/module
# 改名
mv elasticsearch-7.8.0 es-cluster

创建用户

因为安全问题，Elasticsearch不允许root用户直接运行，所以要在每个节点中创建新用户，在root用户中创建新用户

useradd es # 新增 es 用户
passwd es # 为 es 用户设置密码
userdel -r es # 如果错了，可以删除再加
chown -R es:es /opt/module/es #文件夹所有者

修改配置文件

修改/opt/module/es/config/elasticsearch.yml 文件，分发文件

# 加入如下配置
#集群名称
cluster.name: cluster-es
#节点名称，每个节点的名称不能重复
node.name: node-1
#ip 地址，每个节点的地址不能重复
network.host: linux1
#是不是有资格主节点
node.master: true
node.data: true
http.port: 9200
# head 插件需要这打开这两个配置
http.cors.allow-origin: "*"
http.cors.enabled: true
http.max_content_length: 200mb
#es7.x 之后新增的配置，初始化一个新的集群时需要此配置来选举 master
cluster.initial_master_nodes: ["node-1"]
# es7.x 之后新增的配置，节点发现
discovery.seed_hosts: ["linux1:9300","linux2:9300","linux3:9300"]
gateway.recover_after_nodes: 2
network.tcp.keep_alive: true
network.tcp.no_delay: true
transport.tcp.compress: true
# 集群内同时启动的数据任务个数，默认是 2 个
cluster.routing.allocation.cluster_concurrent_rebalance: 16
# 添加或删除节点及负载均衡时并发恢复的线程个数，默认 4 个
cluster.routing.allocation.node_concurrent_recoveries: 16
# 初始化数据恢复时，并发恢复线程的个数，默认 4 个
cluster.routing.allocation.node_initial_primaries_recoveries: 16

修改/etc/security/limits.conf ，分发文件

# 在文件末尾中增加下面内容
es soft nofile 65536
es hard nofile 65536

修改/etc/security/limits.d/20-nproc.conf，分发文件

# 在文件末尾中增加下面内容
es soft nofile 65536
es hard nofile 65536
* hard nproc 4096
# 注：* 带表 Linux 所有用户名称

修改/etc/sysctl.conf

# 在文件中增加下面内容
vm.max_map_count=655360

重新加载

sysctl -p

启动软件

分别在不同节点上启动ES软件

cd /opt/module/es-cluster
#启动
bin/elasticsearch
#后台启动
bin/elasticsearch -d

测试集群

ElasticSearch安装Docker版

下载elasticSearch(存储和检索)和kibana(可视化检索)

版本要统一

docker pull elasticsearch:7.4.2
docker pull kibana:7.4.2

elasticsearch启动前准备

为了将docker里的目录挂载到linux的/mydata目录中，修改/mydata就可以改掉docker里的，添加数据卷

mkdir -p /mydata/elasticsearch/config
mkdir -p /mydata/elasticsearch/data

es可以被远程任何机器访问

echo "http.host: 0.0.0.0" >/mydata/elasticsearch/config/elasticsearch.yml

递归更改权限，es需要访问

chmod -R 777 /mydata/elasticsearch/

启动elasticsearch

启动ES，因为容器内的文件映射到了外面，所以删除容器和新建容器数据还在，数据挂在到外面，但是访问权限不足，把/mydata/elasticsearch下文件夹的权限设置好

# 9200是用户交互端口 9300是集群心跳端口
# -e指定是单阶段运行
# -e指定占用的内存大小，生产时可以设置32G
docker run --name elasticsearch -p 9200:9200 -p 9300:9300 \
-e  "discovery.type=single-node" \
-e ES_JAVA_OPTS="-Xms64m -Xmx512m" \
-v /mydata/elasticsearch/config/elasticsearch.yml:/usr/share/elasticsearch/config/elasticsearch.yml \
-v /mydata/elasticsearch/data:/usr/share/elasticsearch/data \
-v  /mydata/elasticsearch/plugins:/usr/share/elasticsearch/plugins \
-d elasticsearch:7.4.2

查看运行日志

docker logs elasticsearch

设置开机启动elasticsearch

docker update elasticsearch --restart=always

启动Kibana(可视化检索)

docker run --name kibana -e ELASTICSEARCH_HOSTS=http://192.168.56.10:9200 -p 5601:5601 -d kibana:7.4.2

设置开机启动kibana

docker update kibana --restart=always

ElasticSearch安装IK分词器

分词器概述

一个分词器接收一个字符流，该分词器负责记录各个词条的顺序或位置，用于短语和词近邻查询，以及词条所代表的原始单词的起始和结束的字符串偏移量用于高亮显示搜索的内容。elasticsearch提供了很多内置的分词器（标准分词器），可以用来构建自定义分词器。但是对于中文不友好，往往需要安装额外的分词器

**注意：**不能用默认elasticsearch-plugin install xxx.zip 进行自动安装。IK分词器的版本跟ES的版本一一对应IK分词器地址

分词器安装

在上文elasticsearch安装Docker版，已经将elasticsearch容器的/usr/share/elasticsearch/plugins目录，映射到宿主机的/mydata/elasticsearch/plugins目录下，所以比较方便的做法就是下载/elasticsearch-analysis-ik-7.4.2.zip文件，然后解压到该文件夹下即可。安装完毕后，需要重启elasticsearch容器，最后进行测试

GET _analyze
{
   "analyzer": "ik_smart", 
   "text":"我是中国人"
}

GET _analyze
{
   "analyzer": "ik_max_word", 
   "text":"我是中国人"
}

自定义词库

目标：比如我们要把笑一笑算作一个词

修改配置文件

修改/usr/share/elasticsearch/plugins/ik/config中的IKAnalyzer.cfg.xml

<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE properties SYSTEM "http://java.sun.com/dtd/properties.dtd">
<properties>
	<comment>IK Analyzer 扩展配置</comment>
	<!--用户可以在这里配置自己的扩展字典 -->
	<entry key="ext_dict"></entry>
	 <!--用户可以在这里配置自己的扩展停止词字典-->
	<entry key="ext_stopwords"></entry>
	<!--用户可以在这里配置远程扩展字典 -->
	<entry key="remote_ext_dict">http://192.168.56.10/es/fenci.txt</entry> 
	<!--用户可以在这里配置远程扩展停止词字典-->
	<!-- <entry key="remote_ext_stopwords">words_location</entry> -->
</properties>

修改完成后，需要重启elasticsearch容器，否则修改不生效。

docker restart elasticsearch

更新完成后，es只会对于新增的数据用更新分词。历史数据是不会重新分词的。如果想要历史数据重新分词，需要执行

POST my_index/_update_by_query?conflicts=proceed

搭建远程扩展词典

目标:搭配nginx做远程扩展词典

# 远程扩展词典路径http://192.168.56.10/es/fenci.txt
mkdir /mydata/nginx/html/es
cd /mydata/nginx/html/es
vim fenci.txt
输入 笑一笑

测试效果

GET _analyze
{
   "analyzer": "ik_max_word", 
   "text":"我是笑一笑"
}