自然语言处理标记工具汇总

自然语言处理标记工具汇总

整理了一些比较好用的自然语言处理标记工具,如有遗漏欢迎补充。

名称年份描述协议官网github
doccano2019doccano is an open source text annotation tool for human. It provides annotation features for text classification, sequence labeling and sequence to sequence. So, you can create labeled data for sentiment analysis, named entity recognition, text summarization and so on. Just create project, upload data and start annotation. You can build a dataset in hours.MIThttps://github.com/chakki-works/doccanohttps://github.com/chakki-works/doccano
INCEpTION2018A semantic annotation platform offering intelligent assistance and knowledge managementThe annotation of specific semantic phenomena often require compiling task-specific corpora and creating or extending task-specific knowledge bases. Presently, researchers require a broad range of skills and tools to address such semantic annotation tasks.Apachehttps://inception-project.github.io/https://github.com/inception-project/inception
NeuroNER2017NeuroNER is a program that performs named-entity recognition (NER).https://github.com/Franck-Dernoncourt/NeuroNERhttps://github.com/Franck-Dernoncourt/NeuroNER
Prodigy2017Prodigy is a machine teaching tool so efficient that a single data scientist can create end-to-end prototypes for new funtionality without commissioning external annotations, and with a smooth path to production. Whether you're working on entity recognition, intent detection or image classification, Prodigy can help you train and evaluate your models faster.Commercialhttps://prodi.gy/
chineseannator2017Annotator for Chinese Text CorpusApachehttps://github.com/deepwel/Chinese-Annotator
Chatito2017Generate datasets for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!MIThttps://rodrigopivi.github.io/Chatito/https://github.com/rodrigopivi/Chatito
YEDDA2016YEDDA (the previous SUTDAnnotator) is developed for annotating chunk/entity/event on text (almost all languages including English, Chinese), symbol and even emoji. It supports shortcut annotation which is extremely efficient to annotate text by hand. The user only need to select text span and press shortcut key, the span will be annotated automatically. It also support command annotation model which annotates multiple entities in batch and support export annotated text into sequence text.Apachehttps://github.com/jiesutd/YEDDAhttps://github.com/jiesutd/YEDDA
rasa-nlu-trainer2016This is a tool to edit your training examples for rasa NLU Use the online version or install with npmCommercialhttps://github.com/RasaHQ/rasa-nlu-trainer
TALEN2016A lightweight web-based tool for annotating word sequences.Researchhttps://github.com/CogComp/talenhttps://github.com/CogComp/talen
WebAnno2014WebAnno is a general purpose web-based annotation tool for a wide range of linguistic annotations including various layers of morphological, syntactical, and semantic annotations.Additionaly, custom annotation layers can be defined, allowing WebAnno to be used also for non-linguistic annotation tasks.WebAnno is a multi-user tool supporting different roles such as annotator, curator, and project manager. The progress and quality of annotation projects can be monitored and measuered in terms of inter-annotator agreement. Multiple annotation projects can be conducted in parallel.Apachehttps://webanno.github.io/webanno/https://github.com/webanno/webanno
MAE2014MAE is a lightweight, general-purpose natural language annotation toolGPLhttps://github.com/keighrim/mae-annotationhttps://github.com/keighrim/mae-annotation
Anafora2013Anafora (pronounced "a-nuh-FOUR-uh", /ænəˈfɔɹə/) is a new annotation tool written at the University of Colorado by Wei-te Chen and Will Styler. Anafora is designed to be a lightweight, flexible annotation solution which is easy to deploy for large and small projects.Apachehttps://github.com/weitechen/anaforahttps://github.com/weitechen/anafora
brat2010brat is a web-based tool for text annotation; that is, for adding notes to existing text documents.brat is designed in particular for structured annotation, where the notes are not freeform text but have a fixed form that can be automatically processed and interpreted by a computer.MIThttps://github.com/nlplab/brathttps://github.com/nlplab/brat
  • 5
    点赞
  • 22
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值