nlp 入门
Ever wanted to build a chatbot and encountered some blockers along the way relating to data privacy or supported languages? Do you wish to reduce chatbot response time or run them without an active data connection?
是否曾经想建立一个聊天机器人并在与数据隐私或受支持的语言有关的过程中遇到一些障碍? 您是否希望减少聊天机器人的响应时间或在没有活动数据连接的情况下运行它们?
If that’s the case or if you’re just curious and want to learn more, give NLP.js a try.
如果是这样,或者您只是好奇并且想了解更多信息,请尝试NLP.js。
自然语言处理和NLP.js (Natural Language Processing & NLP.js)
Natural Language Processing or NLP is a field combining linguistics and computing, as well as artificial intelligence. Correctly understanding natural language is critical for virtual assistants, chatbots, voice assistants, and a wide range of applications based on a voice or text interface with a machine. These applications typically include a Natural Language Processor whose purpose is to extract the interactions and intention, as well as related information and metadata, from a piece of plain natural language and translate them into something a machine can process.
自然语言处理或自然语言处理是结合语言学和计算以及人工智能的领域。 正确理解自然语言对于虚拟助手,聊天机器人,语音助手以及基于计算机语音或文本界面的各种应用程序至关重要。 这些应用程序通常包括一个自然语言处理器,其目的是从一种简单的自然语言中提取交互作用和意图以及相关的信息和元数据,并将其转换为机器可以处理的东西。
NLP.js is an on-premise open source set of more than 70 libraries, used to tackle and solve the main three areas of NLPs: natural language understanding, language generation, and named entity recognition. The key differentiating feature that NLP.js provides is an enhanced user experience via an improved response time, additional language support and, according to some benchmarks, improved accuracy while leveraging increased data privacy & security controls and choices.
NLP.js是一个内部开放源代码集,包含70多个库,用于处理和解决NLP的主要三个方面:自然语言理解,语言生成和命名实体识别。 NLP.js提供的关键差异化功能是通过改善响应时间,附加语言支持以及根据某些基准测试来提高准确性,同时利用增强的数据隐私和安全性控制和选择来增强用户体验。
为什么有NLP库? (Why have an NLP library?)
It isn’t easy to understand how existing NLPs process every sentence and why specific behavior results as an output. This black box effect, due to the lack of visibility on why the chatbot has answered in a specific way without being able to dig into the source of the problem, causes frustration to chatbot managers. Having the NLP as an open-source library provides more visibility and understanding of the low-level natural language processing. It would enable technical people to better comprehend the processing of the conversation for managing language-specific strategies to achieve the expected accuracy level. Even if having a specific strategy per country isn’t a mandatory approach, it’s highly recommended when you target high-performance chatbots in languages other than the most-commonly used.
很难理解现有的NLP如何处理每个句子以及为什么特定行为会作为输出结果。 这种黑匣子效应,是由于缺乏对聊天机器人为何以特定方式回答而无法深入探究问题根源的可见性,导致聊天机器人管理人员感到沮丧。 将NLP作为开放源代码库,可以更好地了解和理解底层自然语言处理。 这将使技术人员可以更好地理解对话的处理过程,以管理特定于语言的策略,以达到预期的准确性水平。 即使不是针对每个国家/地区制定特定的策略,也都建议您使用其他常用语言以外的其他语言作为高性能聊天机器人的目标,这也是极力推荐的做法。
NLP.js的主要功能 (The main features of NLP.js)
1.语言支持(1. Language support)
NLP.js supports up to 104 different languages with the use of BERT embeddings. Without BERT, it natively supports 41 languages.
NLP.js通过使用