《Translating Embeddings for Modeling Multi-relational Data》阅读笔记

最新推荐文章于 2022-07-15 14:35:22 发布

yangOvOyang

最新推荐文章于 2022-07-15 14:35:22 发布

阅读量1.4k

点赞数 3

分类专栏：知识图谱

知识图谱专栏收录该内容

2 篇文章 1 订阅

订阅专栏

Abstract

We propose TransE, embedding entities and relationships of multi-relational data in low-dimensional vector space. It significantly outperforms state-of-the-art methods in link prediction on two knowledge bases. It can also be successfully trained on a large scale data set.

Introduction

Our work focuses on modeling multi-relational data from KBs(Knowledge Base), with the goal of providing an efficient tool to complete them by automatically adding new facts, without requiring extra knowledge.

1. Modeling Multi-relational data

In contrast to single-relational data, the difficulty of multi-relational data is that the notion of locality may involve relationships and entities of different types at the same time, so that modeling multi-relational data requires more generic approaches that can choose the appropriate patterns considering all heterogeneous relationships at the same time.

… suggested that even in complex and heterogeneous multi-relational domains simple yet appropriate modeling assumptions can lead to better trade-offs(权衡) between accuracy and scalability(可扩展性).

2. Relationships as translations in the embedding space

TransE, an energy-based model for learning low-dimensional embeddings of entities. In TransE, realtionships are represented as translations in the embedding space: if $(h,l,t)$ holds, then the embedding of the tail entity should be close to embedding of the head entity $h$ plus some vector that depends on the relationship $l$ . This approach relies on a reduced set of parameters as it learns only one low-dimensional vector for each entity and each relationship.

The main motivation behind our translation-based parameterization is that hierarchical relationships are extremely common in KBs and translations are the natural transformations for representing them. Indeed, considering the natural representation of trees(i.e. embeddings of the nodes in dimension 2), the siblings are close to each other and nodes at a given height are organized on the x-axis, the parent-child relationship corresponds to a translation on the y-axis.

ps: a null translation vector corresponds to an equivalence relationship between entities.

Translation-based Model

Given a training set $S$ of triplets $(h,l,t)$ composed of two entities $h,t \in E$ (the set of entities) and a relationship $l \in L$ (the set of relationships), the model learns vector embeddings of the entities and the relationships.

Note that for a given entity, its embedding vector is the same when the entity appears as the head or as the tail of a triplet.

We want that $h+l \approx t$ when $(h,l,t)$ holds ( $t$ should be a nearest neighbor of $h+l$ ), while $h+l$ should be far away from $t$ otherwise.

transe算法

measure d

Following an energy-based framework, the energy of a triplet is equal to $d(h+l,t)$ for some dissimilarity measure $d$ , which we take to be either the $L_1$ or the $L_2$ -norm(曼哈顿或欧几里得距离).
$d(h+l,t) = ||h+t-l||_2^2$

corrupted triplets $S_{(h,l,t)}^{'}$

either the head or tail replaced by a random entity(but not both at the same time)

$S_{(h,l,t)}^{'} = \{(h^{'},l,t)|h^{'}\in E\} \bigcup \{ (h,l,t^{'})|t^{'} \in E\}$ .

loss function $L$

Given margin hyperparameter $\gamma > 0$ 1, $[x]_+$ denotes the positive part of $x$ 2.

$L = \sum_{(h,l,t) \in S} \sum_{(h^{'},l^{'},t^{'}) \in S_{(h,l,t)}^{'}} [\gamma + d(h+l,t)-d(h^{'}+l,t^{'})]_+$

The loss function favors lower values of the energy for training triplets than for corrupted triplets, and thus a natural implementation of the intended criterion.

The optimization is carried out by stochastic gradient descent3 (in minibatch mode), over the possible $h$ , $l$ , $t$ , with the additional constraints that the $L_2-norm$ of the embeddings of the entities is 14(no regularization or norm constraints are given to the label embeddings $l$ ). It prevents the training process to trivially minimize $L$ by artificially increasing entity embeddings norms.

一般设置为1 ↩
当值大于零，取本身；小于零，取0 ↩
SGD, 随机梯度下降。这里是对一个batch求梯度之后就立即更新theta值 ↩
约束节点的嵌入(向量)的欧几里得距离为1，但是关系的嵌入不用约束 ↩

确定要放弃本次机会？
福利倒计时
: :

立减 ¥
普通VIP年卡可用
立即使用

yangOvOyang

关注关注

3
点赞

踩

5

收藏

觉得还不错? 一键收藏

0
评论

《Translating Embeddings for Modeling Multi-relational Data》阅读笔记

AbstractWe propose TransE, embedding entities and relationships of multi-relational data in low-dimensional vector space. It significantly outperforms state-of-the-art methods in link prediction on ...
复制链接

扫一扫

专栏目录

tactics-for-translating-Neologisms资料PPT课件.ppt

07-30

tactics-for-translating-Neologisms资料PPT课件.ppt

active-record-translating-from-orm-to-ar

03-10

从ORM转换为活动记录目标：了解ActiveRecord如何为您抽象功能强大的方法。确定如何继承模型。指示本实验旨在向您展示Active Record的强大功能。在spec/dog_spec.rb查看您的测试套件，现在有八个测试都失败了...

参与评论您还未登录，请先登录后发表或查看评论

论文阅读笔记（3）——Translating Embeddings for Modeling Multi-relational Data

strivequeen的博客

08-23 908

Abstract 我们考虑在低维向量空间中嵌入实体和多维数据关系的问题。目标是提出一种易于训练的规范模型，该模型包含数量减少的参数，并且可以扩展到非常大的数据库。因此提出了TransE，一种通过将关系解释为对实体的低维嵌入进行操作的翻译来建模关系的方法。尽管它很简单，但由于大量实验表明TransE在两个知识库的链接预测中明显优于最新方法，因此这种假设被证明是有效的。此外，它可以在具有1M 实体，25k关系和超过17M 训练样本的大规模数据集上成功进行训练。 1 Introduction 多重关系数据是指有向

Translating Embedding for Modeling Multi-relational Data

BUPT-WT的博客

01-16 779

研究意义: 1、学到了实体(entity)和关系(relation)的embedding表示 2、模型简单而有效，容易训练 3、启发了整个Trans系列知识表示学习，代表性工作有transH，transR，transD 4、引用量较高本文主要结构如下所示: 一、Abstract 提出基于一对多关系数据的建模，将实体和关系映射到低维空间摘要主要说明以下几个主要的关键点: 1、使用低维向量表示三元组数据(知识图谱)的实...

Translating embeddings for modeling multi-relational data

qq_34260382的博客

09-20 585

Translating embeddings for modeling multi-relational data 0. 目的将多关系数据中的实体和关系嵌入到低维空间中去表示 1. 总结提出一种基于翻译的模型 TransE, 将 relation(关系) 看作是在低维空间中由 head entity(头实体) 到 tail entity(尾实体) 的一种翻译 2. 主要思想对于正确的关系 ...

论文阅读：（TransE）Translating Embeddings for Modeling Multi-relational Data多关系数据转换嵌入建模

qq_38667212的博客

01-21 1125

目前研究方向为知识图谱补全，知识图谱补全现有五种常用的方法，分别为：基于知识表示的方法、基于路径查找的方法、基于推理规则的方法、基于强化学习的方法以及基于元学习的方法。寒假的任务是通过多阅读该领域经典论文，开拓思路，找到今后课题的研究方向。本节为阅读知识表示的经典算法——TransE

【论文研读】 Translating Embeddings for Modeling Multi-relational Data论文理解

Bessie_Lee

04-03 2319

文章目录embedding知识图谱 embedding 首先，我们得知道什么是embedding？他有什么优点？他为什么要被开发出来？传统的one-hot编码不可以吗？接下来让我们一一进行解决 1、什么是embedding？ ① 维基百科的官方回答： ② 自己的理解：（这个结合了一下一篇大佬的博客思路进行理解的）首先，embedding在pytorch或者tf中会被经常使用，m代表的是单词的数目，n代表的是词语嵌入的维度。其次，词嵌入（word embedding）是一个大矩阵，一行代表的是一个单词。

translating-orm-to-ar-online-web-sp-000

03-19

目标：了解ActiveRecord如何为您抽象功能强大的方法。确定如何继承模型。本实验旨在向您展示ActiveRecord 。在spec/dog_spec.rb查看您的测试套件，现在有八个测试都失败了。过去，您必须单独编写每种方法才能通过每...

Translating-a-Unity-GameObject-Between-Two-Vectors:一个简单的Unity C＃脚本，可以按照给定的间隔连续地来回转换GameObject

05-08

在两个向量之间转换Unity GameObject 为了提供上下文，我有一个GameObject充当了障碍。它从屏幕的左侧开始，我希望它转换为右侧，然后再次返回。该技工必须连续执行。我们首先定义两个字段，一个布尔值和一个...

translating-divinity-2:这是尝试使神译自动执行

05-15

到目前为止，这些说明仅供程序员阅读，如果将来有人需要，我会在将来对其进行更新。配置您将需要。请注意，翻译文本会花费您很多钱，我相信整个翻译大约要花20美元，但我不能保证。，看看您的Google Translate...

multi-relational data mining

04-12

multi-relational data mining

论文解读：（TransE）Translating Embeddings for Modeling Multi-relational Data

热门推荐

夏栀的博客

11-29 1万+

论文解读：（TransE）Translating Embeddings for Modeling Multi-relational Data 表示学习是深度学习的基石，正式表示学习才能让深度学习可以自由的挖掘更深层次的特征。自word embedding（词嵌入表示）的提出，一种对结构化信息的三元组的表示学习也进入研究视野。TransE模型正是一种基于深度学习的知识表示方法，也是Trans系列...

论文翻译解读：Translating Embeddings for Modeling Multi-relational Data【TransE】

weixin_43923463的博客

07-15 720

我们考虑在低维向量空间中嵌入多关系数据的实体和关系的问题。我们的目标是提出一个权威模型，该模型易于训练，包含较少的参数，并且可以扩展到非常大的数据库。因此，我们提出了TransE，一种通过将关系解释为对实体的低维嵌入操作的平移来建模关系的方法。尽管简单，但这个假设被证明是强大的，因为大量的实验表明，TransE在两个知识库上的链接预测方面明显优于最先进的方法。此外，它可以成功地在拥有1M实体、25k关系和17M以上训练样本的大规模数据集上训练。https。...

Translating Embeddings for Modeling Multi-relational Data 笔记（基于Translation提出了TransE）

test

03-14 1664

更多图神经网络和深度学习内容请移步：论文：Translating Embeddings for Modeling Multi-relational Data 论文链接：https://proceedings.neurips.cc/paper/2013/file/1cecc7a77928ca8133fa24680a88d2f9-Paper.pdf 摘要 We consider the problem of embedding entities and relationships of multi-rela

论文简读-TransE-《Translating Embeddings for Modeling Multi-relational Data》

qq_26623993的博客

06-08 625

TransE: Translating Embeddings for Modeling Multi-relational Data 1. introduction 本文研究的是将知识图谱中实体和关系嵌入(embedding)至低维向量空间的问题，提出了名为TransE的实体与关系表示方法，该方法将实体与实体的关系看成是翻译操作。 2. Translation-based model 给出一个知识图谱中的三元组(h,r,t)(h, r, t)(h,r,t),其中h,t∈E,r∈Rh,t \in E, r \

【论文代码复现】Translating Embeddings for Modeling Multi-relational Data中TransE代码实现+遇到的错误

Bessie_Lee

04-18 1038

文章目录1、相关链接2、代码报错汇总2.1、报错：AssertionError: Torch not compiled with CUDA enabled2.2、TransE部分报错（憨憨报错）2.3、文件找不到3、运行结果截图4、完整代码+注释本文创建初心：想为看Translating Embeddings for Modeling Multi-relational Data这篇文章的人提供一个完整的资源与遇到的情况的一个汇总【注】本文的代码来源于GitHub上面一位优秀的博主，所有的相关链接都会

Translating Embeddings for Modeling Multi-relational Data 论文翻译：多元关系数据嵌入

和而不流

01-12 1万+

摘要 1简介 2transE模型 3相关工作 4实验 1数据集 2实验设置 3链接预测 4用几个例子学习预测新关系 5总结和展望摘要：考虑多元关系数据得实体和关系在低维向量空间的嵌入问题。我们的目标是提出一个权威的模型，该模型比较容易训练，包含一组简化了的参数，并且能够扩展到非常大的数据库。因此，我们提出了TransE，一个将关系作为低维空间实体嵌入的翻译的方法。尽管它很简单，

《Graph Representation Learning》【4】——Multi-relational Data and Knowledge Graphs

智慧的旋风的博客

11-27 749

4 Multi-relational Data and Knowledge Graphs 这一部分，我们将介绍多关系图（multi-relational graph）中的浅层嵌入方法。 Knowledge graph completion 这一章的大多数方法，最初都是为完成知识图谱任务而设计的。在多关系图中，我们一般会定义这样的三元组（tuple）：e=(u,τ,v)e=(u,\tau,v)e=(u,τ,v)，来表示两个节点之间存在某种关系。一般来说，知识图谱任务的目标是预测图中缺失的边，即链接预测（li

知识图谱表示学习 TransE: Translating Embeddings for Modeling Multi-relational Data

Pengwill's Blog

06-24 2048

知识图谱表示学习 TransE: Translating Embeddings for Modeling Multi-relational Data 表示学习是深度学习的基础，将数据用更有效的方式表达出来，才能让深度学习发挥出更强大的作用。表示学习避免了手动提取数据特征的繁琐，允许计算机学习特征的同时，也学习如何提取特征。尽管举例基于翻译（translation）的知识图谱表示学习已经过去了五六年的时间，但是仍不可忽略其重要意义。本文聚焦于TransE模型。 1. 引言多元关系数据（Multi-relat

如何基于TransE或类似模型进行推理？请提供技术细节以及一些例子。

最新发布

04-23

对于基于TransE或类似模型进行推理，通常可以采用以下步骤： 1. 构建知识图谱：将知识库中的实体和关系抽象成节点和边，构建一个图谱。 2. 训练TransE模型：使用知识图谱作为输入，训练TransE模型来学习实体之间的关系。 3. 进行推理：通过查找知识图谱中的实体和关系，进行推理。其中，比较关键的是如何训练TransE模型。TransE模型的核心思想是将实体和关系映射到同一向量空间中，从而在向量空间中计算它们之间的相似度。在训练阶段，需要最小化实体和关系之间的距离，使得真实的三元组距离近，而虚假的三元组距离远。相似度可以使用余弦相似度或点积等函数计算，具体实现可参考论文《TransE: Translating Embeddings for Modeling Multi-relational Data》。下面给出一个简单的例子：假设有一个知识库包含以下三元组：（Tom, hasChild, Harry）（Tom, hasChild, Lily）（Lily, sibling, Harry）使用TransE模型，我们可以将Tom、Harry和Lily分别映射到向量空间中的三个向量，然后通过计算向量之间的距离，来推理Tom是否是Harry的父亲。具体过程如下： 1. 将实体和关系映射到向量空间中： Tom -> (0, 0) Harry -> (2, 0) Lily -> (1, 1) hasChild -> (1, 0) sibling -> (0, 1) 2. 通过向量之间的距离计算相似度： sim(Tom, hasChild, Harry) = cos((0+1-2)/3) ≈ -0.63 sim(Tom, hasChild, Lily) = cos((0+1-1)/3) ≈ 0.33 sim(Tom, sibling, Harry) = cos((0-1-2)/3) ≈ -0.94 由此可见，Tom与Harry之间的相似度较低，因此不能推断Tom是Harry的父亲。而Tom与Lily之间的相似度较高，说明Tom是Lily的父亲。

“相关推荐”对你有帮助么？

非常没帮助

没帮助

一般

有帮助

非常有帮助

提交