自然语言处理
Adam婷
笔者在人工智能/机器学习领域中默默探索,时而迷惘,时而欣喜。
展开
-
自然语言处理1(NLP)------NLP--Basic Embedding Model
NLP–Basic Embedding Model1-1. NNLM(Neural Network Language Model) - Predict Next WordPaper下载代码块NNLM-Tensor.py@Tae Hwan Jung @graykodeimport tensorflow as tfimport numpy as nptf.reset_d...原创 2019-02-07 09:01:11 · 615 阅读 · 0 评论 -
使用ULMFiT和Python中的fastai库的文本分类(NLP)教程
Introduction自然语言处理(NLP)在当今世界不需要介绍。 它是最重要的研究和研究领域之一,并且在过去十年中出现了惊人的兴趣增长。 NLP的基础知识广为人知,易于掌握。 但是当文本数据变得庞大且非结构化时,事情开始变得棘手。这就是深度学习变得如此关键的地方。 是的,我正在谈论对NLP任务进行深入学习 - 这仍然是一个相对较少的路径。 DL已经证明了它在图像检测,分类和分割等计算机视觉...原创 2019-04-21 22:59:13 · 1721 阅读 · 3 评论 -
使用NLP预测电影类型 - 多标签分类
IntroductionI was intrigued going through this amazing article on building a multi-label image classification model last week. The data scientist in me started exploring possibilities of transforming...原创 2019-04-24 00:29:47 · 5354 阅读 · 2 评论 -
8个优秀的预训练模型,帮助您开始使用自然语言处理(NLP)
IntroductionNatural Language Processing (NLP) applications have become ubiquitous these days. I seem to stumble across websites and applications regularly that are leveraging NLP in one form or anoth...原创 2019-04-19 20:45:43 · 1352 阅读 · 0 评论 -
A Guide to Building an Intelligent Chatbot for Slack using Dialogflow API
Introduction自然语言处理(NLP)领域的突破近来出现了突然上升。 我们可用的文本数据量巨大,数据科学家正在提出新的创新解决方案来解析它并分析模式。 从编写整本小说到解码古代文本,我们已经看到了NLP的各种应用。最受欢迎的应用程序之一是聊天机器人。 Zomato,Starbucks,Lyft和Spotify等组织正在其网站和移动应用上利用这项技术。 作为用户,我们不再需要担心被搁置 ...原创 2019-04-30 09:32:27 · 636 阅读 · 0 评论 -
使用新闻预测股票走势-----Kaggle经典ph.D操作分析
General informationTwo Sigma Financial News Competition is a unique competitions: not only it is a Kernel-only competition, but we aren’t supposed to download data and during stage two our solutions ...原创 2019-05-05 23:26:28 · 3847 阅读 · 8 评论 -
How do Transformers Work in NLP? A Guide to the Latest State-of-the-Art Models
How do Transformers Work in NLP? A Guide to the Latest State-of-the-Art ModelsOverviewThe Transformer model in NLP has truly changed the way we work with text dataTransformer is behind the recent N...原创 2019-07-08 19:27:40 · 1260 阅读 · 1 评论 -
Attention in Neural Networks Some variations of attention architectures
Attention in Neural Networks Some variations of attention architecturesIn an earlier post on “Introduction to Attention” we saw some of the key challenges that were addressed by the attention archit...原创 2019-07-08 19:39:57 · 378 阅读 · 0 评论 -
Toxic BERT plain vanila
# Version 2 + Bug fix - thanks to @chinhuic# This Python 3 environment comes with many helpful analytics libraries installed# It is defined by the kaggle/python docker image: https://github.com/kag...原创 2019-07-18 10:03:06 · 474 阅读 · 1 评论 -
Review Your NLP Knowledge
Review Your NLP Knowledge1. Abbreviated Words in NLP:LSTM: Long Short Term MemoryBert: Bidirectional Encoder Representations from Transformers.POS: parts of speech.DTM: Document Term Matrix.NER...原创 2019-07-18 10:18:50 · 264 阅读 · 0 评论 -
排名前三的自然语言处理库教程 ---- Adam Studio
前三名NLP库教程Notebook ContentIntroductionImportVersionSetupData setGendered Pronoun Analysisa. Problem Featureb. VariablesNLTKTokenizing sentencesNLTK and arraysNLTK stop wordsNLTK – s...原创 2019-07-17 23:30:19 · 1063 阅读 · 0 评论 -
学习ELMo从文本中提取特征的分步NLP指南
Introduction我从事不同的自然语言处理(NLP)问题(成为数据科学家的好处!)。 每个NLP问题都是以自己的方式面临的独特挑战。 这只是人类语言复杂,美丽和精彩的反映。但有一点一直是NLP从业者心中的荆棘是无法(机器)理解句子的真正含义。 是的,我在谈论背景。 当被要求执行基本任务时,传统的NLP技术和框架非常棒。 当我们试图为这种情况添加背景时,事情很快就消失了。NLP格局在过去...原创 2019-04-20 16:56:15 · 916 阅读 · 0 评论 -
注意神经机器翻译----创建与训练机器翻译模型
This notebook trains a sequence to sequence (seq2seq) model for Spanish to English translation. This is an advanced example that assumes some knowledge of sequence to sequence models.After training t...原创 2019-03-18 20:30:47 · 988 阅读 · 0 评论 -
自然语言处理2------CNN(Convolutional Neural Network)
CNN(Convolutional Neural Network)2-1. TextCNN - Binary Sentiment ClassificationPaper下载:TextCNN-Tensor.py''' code by Tae Hwan Jung(Jeff Jung) @graykode Reference : https://github.com/ioat...原创 2019-02-07 09:19:36 · 2427 阅读 · 0 评论 -
自然语言处理之----RNN(Recurrent Neural Network)
循环神经网络3-1. TextRNN - Predict Next StepPaper Finding Structure In TimeTextRNN-Tensor.py''' code by Tae Hwan Jung(Jeff Jung) @graykode'''import tensorflow as tfimport numpy as nptf.reset...原创 2019-02-07 09:33:33 · 562 阅读 · 1 评论 -
自然语言处理之------Attention Mechanism
Attention Mechanism4-1. Seq2Seq - Change WordPaper Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation(2014)''' code by Tae Hwan Jung(Jeff Jung) @gra...原创 2019-02-07 09:41:05 · 391 阅读 · 0 评论 -
自然语言处理之------ Model based on Transformer
Model based on TransformerDependencies:Python 3.5+Pytorch 0.4.1+5-1. The Transformer - TranslatePaper BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingTran...原创 2019-02-07 09:50:46 · 947 阅读 · 0 评论 -
自然语言处理之-----Word2Vec
A Beginner’s Guide to Word2Vec and Neural Word EmbeddingsIntroduction to Word2VecWord2vec是一个处理文本的双层神经网络。它的输入是一个文本语料库,它的输出是一组向量:该语料库中单词的特征向量。虽然Word2vec不是深度神经网络,但它将文本转换为深网可以理解的数字形式。 Deeplearning4j实现...原创 2019-02-22 22:33:06 · 1699 阅读 · 1 评论 -
机器学习算法之------LSTM
Understanding LSTM NetworksRecurrent Neural NetworksHumans don’t start their thinking from scratch every second. As you read this essay, you understand each word based on your understanding of pre...原创 2019-02-23 07:59:11 · 3947 阅读 · 0 评论 -
使用 Doc2Vec & Logistic Regretion 进行多类文本分类
The goal is to classify consumer finance complaints into 12 pre-defined classes using Doc2Vec and Logistic RegressionDoc2vec is an NLP tool for representing documents as a vector and is a generalizin...原创 2019-02-23 11:28:34 · 1337 阅读 · 0 评论 -
使用Python对Anthem的游戏发布进行情感分析
Video game launches are plagued by drama. From misleading pre-order bundles, to games that are far from complete at launch, big publishers have quite a bit of risk to manage when it comes to deciding ...翻译 2019-02-24 23:06:31 · 813 阅读 · 0 评论 -
使用word2vec分析新闻标题并预测文章流行度
Can word embeddings of article titles predict popularity? What can we learn about the relationship between sentiment and shares? word2vec can help us answer these questions, and more.Word embeddings...原创 2019-03-21 15:40:17 · 2210 阅读 · 0 评论 -
解析BERT
什么是BERT?BERT是Transformer的双向编码器表示的缩写。它是由Google在2018年末开发和发布的一种新型语言模型。像BERT这样的预训练语言模型在许多自然语言处理任务中发挥着重要作用,例如问答,命名实体识别,自然语言推理,文本分类等等BERT是一种基于微调的多层双向变压器编码器。此时,介绍Transformer架构非常重要。什么是变压器?2017年,谷歌发表了一篇题为“...原创 2019-07-26 14:38:45 · 4591 阅读 · 0 评论