Word Embedding Preparation 4: ElMo

FocusYang55

于 2020-12-17 00:55:26 发布

阅读量108

点赞数

分类专栏： nlp

本文链接：https://blog.csdn.net/boosting1/article/details/111306278

版权

5 篇文章 0 订阅

订阅专栏

ElMo

Published in 2018 and named as Embedding from language Models
Deep contextualized word representations that models complex characteristics of word use and how these uses vary across linguistic contexts.
It enables models to better disambiguate between *** sense of a given word.
Elmo dynamically determines word embedding in downstream task.
Elmo generates three embeddings.(1) word embedding. (2) 1st LSTM layer embedding (3) 2st LSTM layer embedding.
Pre-training -> get three embedding(v1, v2, v3) per word.(Big data environment)
Fine-tunning -> freeze embeddings and train weights(w1, w2, w3) for (v1, v2, v3) (local environment)
The final embedding is w1v1 + w2*v2 + w3*v3