安装elasticsearch-analysis-ik中文分词器

1 安装elasticsearch-analysis-ik中文分词器

Ik介绍:ik是一款中文的分词插件,支持自定义词库。

1.1 下载ik分词器

下载地址:https://github.com/medcl/elasticsearch-analysis-ik/releases

下载指定版本的分词器(zip版本)

1.2 解压ik分词器

[es@bigdata1 plugins]$ cd /home/es/
[es@bigdata1 ~]$ ls
elasticsearch-6.2.2                  jdk1.8.0_161                kibana-6.2.2-linux-x86_64.tar.gz
elasticsearch-6.2.2.tar.gz           jdk-8u161-linux-x64.tar.gz  logstash-6.2.2.tar.gz
elasticsearch-analysis-ik-6.2.2.zip  kibana-6.2.2-linux-x86_64
[es@bigdata1 ~]$ unzip elasticsearch-analysis-ik-6.2.2.zip 
Archive:  elasticsearch-analysis-ik-6.2.2.zip
   creating: elasticsearch/
  inflating: elasticsearch/elasticsearch-analysis-ik-6.2.2.jar  
  inflating: elasticsearch/httpclient-4.5.2.jar  
  inflating: elasticsearch/httpcore-4.4.4.jar  
  inflating: elasticsearch/commons-logging-1.2.jar  
  inflating: elasticsearch/commons-codec-1.9.jar  
  inflating: elasticsearch/plugin-descriptor.properties  
   creating: elasticsearch/config/
  inflating: elasticsearch/config/extra_single_word_full.dic  
  inflating: elasticsearch/config/quantifier.dic  
  inflating: elasticsearch/config/IKAnalyzer.cfg.xml  
  inflating: elasticsearch/config/main.dic  
  inflating: elasticsearch/config/extra_single_word_low_freq.dic  
  inflating: elasticsearch/config/extra_stopword.dic  
  inflating: elasticsearch/config/preposition.dic  
  inflating: elasticsearch/config/extra_main.dic  
  inflating: elasticsearch/config/extra_single_word.dic  
  inflating: elasticsearch/config/suffix.dic  
  inflating: elasticsearch/config/surname.dic  
  inflating: elasticsearch/config/stopword.dic  
[es@bigdata1 ~]$ ls
elasticsearch               elasticsearch-analysis-ik-6.2.2.zip  kibana-6.2.2-linux-x86_64
elasticsearch-6.2.2         jdk1.8.0_161                         kibana-6.2.2-linux-x86_64.tar.gz
elasticsearch-6.2.2.tar.gz  jdk-8u161-linux-x64.tar.gz           logstash-6.2.2.tar.gz
[es@bigdata1 ~]$ mv elasticsearch/* ./elasticsearch-6.2.2/plugins/ik/
[es@bigdata1 ~]$ vim ./elasticsearch-6.2.2/config/elasticsearch.yml

1.3 重新启动Elasticsearch服务

ps -ef | grep elasticsearch

然后kill掉相关的进程。

然后按照18.8的方式启动elasticsearch进程

1.4 然后执行以下命令

GET _analyze  
{  
  "analyzer":"ik_smart",  
  "text":"2018年5月全球编程语言排行榜"  
}

运行结果是:

{
  "tokens": [
    {
      "token": "2018年",
      "start_offset": 0,
      "end_offset": 5,
      "type": "TYPE_CQUAN",
      "position": 0
    },
    {
      "token": "5月",
      "start_offset": 5,
      "end_offset": 7,
      "type": "TYPE_CQUAN",
      "position": 1
    },
    {
      "token": "全球",
      "start_offset": 7,
      "end_offset": 9,
      "type": "CN_WORD",
      "position": 2
    },
    {
      "token": "编程",
      "start_offset": 9,
      "end_offset": 11,
      "type": "CN_WORD",
      "position": 3
    },
    {
      "token": "语言",
      "start_offset": 11,
      "end_offset": 13,
      "type": "CN_WORD",
      "position": 4
    },
    {
      "token": "排行榜",
      "start_offset": 13,
      "end_offset": 16,
      "type": "CN_WORD",
      "position": 5
    }
  ]
}

Kibana的效果图
这里写图片描述

  • 0
    点赞
  • 2
    收藏
    觉得还不错? 一键收藏
  • 打赏
    打赏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包

打赏作者

涂作权的博客

你的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20
扫码支付:¥1
获取中
扫码支付

您的余额不足,请更换扫码支付或充值

打赏作者

实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值