Elasticsearch Ingest OpenNLP 插件使用教程

最新推荐文章于 2024-08-16 08:24:54 发布

尤琦珺Bess

最新推荐文章于 2024-08-16 08:24:54 发布

阅读量789

点赞数 21

本文链接：https://blog.csdn.net/gitblog_00067/article/details/141236565

版权

Elasticsearch Ingest OpenNLP 插件使用教程

elasticsearch-ingest-opennlpAn Elasticsearch ingest processor to do named entity extraction using Apache OpenNLP项目地址:https://gitcode.com/gh_mirrors/el/elasticsearch-ingest-opennlp

项目介绍

Elasticsearch Ingest OpenNLP 是一个 Elasticsearch 的插件，用于通过 Apache OpenNLP 进行命名实体提取。该插件允许用户在 Elasticsearch 中处理文档时，自动识别和提取文本中的命名实体，如人名、地名、组织名等。

项目快速启动

安装插件

首先，确保你已经安装了 Elasticsearch。然后，通过以下命令安装 Ingest OpenNLP 插件：

bin/elasticsearch-plugin install https://github.com/spinscale/elasticsearch-ingest-opennlp/releases/download/7.15.0.1/ingest-opennlp-7.15.0.1.zip

配置处理器

安装完成后，需要在 Elasticsearch 中配置 Ingest OpenNLP 处理器。以下是一个示例配置：

PUT _ingest/pipeline/opennlp-pipeline
{
  "description": "A pipeline to perform named entity extraction",
  "processors": [
    {
      "opennlp" : {
        "field" : "message",
        "target_field" : "entities",
        "model" : "en-ner-person.bin"
      }
    }
  ]
}

使用处理器

配置好处理器后，可以通过以下命令使用该处理器处理文档：

POST _ingest/pipeline/opennlp-pipeline/_simulate
{
  "docs": [
    {
      "_source": {
        "message": "John Doe works at Google."
      }
    }
  ]
}