1. introduction
(1) the definition of KG
A KG is a multi-relational graph composed of entities (nodes) and relations (different types of edges).
(2) examples
Freebase, DBpedia, YAGO, NELL
(3) key idea of KGE
The key idea is to embed components of a KG including entities and relations into continuous vector spaces, so as to simplify the manipulation while preserving the inherent structure of the KG.
(4) traditional procedure of KGE
Given a KG, such a technique first represents entities and relations in a continuous vector space, and defines a scoring function on each fact to measure its plausibility. Entity and relation embeddings can then be obtained by maximizing the total plausibility of observed facts.
2. notations
…
3. KG Embedding with Facts Alone
(1) facts
Facts observed in the KG are stored as a collection of triples D+={(h,r,t)}. Each triple is composed of a head entity h∈E, a tail entity t∈E, and a relation r∈R between them.
(2) steps
① representing entities and relations
Entities are usually represented as vectors, i.e., deterministic points in the vector space. Relations are typically taken as operations in the vector space.
② defining a scoring function
A scoring function fr(h,t) is defined on each fact (h,r,t) to measure its plausibility. Facts observed in the KG tend to have higher scores than those that have not been observed.
③ learning entity and relation representations
The third step solves an optimization problem that maximizes the total plausibility of observed facts.
(3) embedding techniques
① Translational Distance Models
exploit distance-based scoring functions; measure the plausibility of a fact as the distance between the two entities, usually after a translation carried out by the relation
(i) TransE
-represents both entities and relations as vectors in the same space, say Rd
-Given a fact (h,r,t), the relation is interpreted as a translation vector r so that the embedded entities h and t can be connected by r with low error, i.e., h+r≈t when (h,r,t) holds.
-
-TransE has flaws in dealing with 1-to-N, N-to-1, and N-to-N relations : given a 1-to-N relation, e.g., DirectorOf, TransE might learn very similar vector representations for Psycho, Rebecca, and RearWindow which are all films directed by AlfredHitchcock, even though they are totally different entities
(ii) TransE’s Extensions
Relation-Specific Entity Embeddings: