第二阶段-tensorflow程序图文详解（七） Embeddings

最新推荐文章于 2022-07-18 17:45:54 发布

Alun_Sun

最新推荐文章于 2022-07-18 17:45:54 发布

阅读量1k

点赞数

分类专栏： tensorflow1.4 文章标签： tensorflow 机器学习深度学习程序

本文链接：https://blog.csdn.net/jk981811667/article/details/78911954

版权

本文详细介绍了TensorFlow中的Embeddings概念，提供了一个简单的训练Embedding的示例，并解释了如何使用TensorBoard Embedding Projector查看Embeddings。文章适合机器学习或TensorFlow初学者，以及所有层次的Embedding Projector使用者。文章讨论了Embeddings作为机器学习输入和输出的价值，以及如何在TensorFlow中创建和可视化词嵌入。

摘要由CSDN通过智能技术生成

This document introduces the concept of embeddings, gives a simple example of how to train an embedding in TensorFlow, and explains how to view embeddings with the TensorBoard Embedding Projector. The first two parts target newcomers to machine learning or TensorFlow, and the Embedding Projector how-to is for users at all levels.

An embedding is a mapping from discrete objects, such as words, to vectors of real numbers. For example, a 300-dimensional embedding for English words could include:

blue:  (0.01359, 0.00075997, 0.24608, ..., -0.2524, 1.0048, 0.06259)
blues:  (0.01396, 0.11887, -0.48963, ..., 0.033483, -0.10007, 0.1158)
orange:  (-0.24776, -0.12359, 0.20986, ..., 0.079717, 0.23865, -0.014213)
oranges:  (-0.35609, 0.21854, 0.080944, ..., -0.35413, 0.38511, -0.070976)

The individual dimensions in these vectors typically have no inherent meaning. Instead, it’s the overall patterns of location and distance between vectors that machine learning takes advantage of.

Embeddings are important for input to machine learning. Classifiers, and neural networks more generally, work on vectors of real numbers. They train best on dense vectors, where all values contribute to define an object. However, many important inputs to machine learning, such as words of text, do not have a natural vector representation. Embedding functions are the standard and eff