斯坦福课程Knowledge Graphs-What is a Knowledge Graph?

1. Introduction

Knowledge graphs have emerged as a compelling abstraction for organizing world’s structured knowledge over the internet, and a way to integrate information extracted from multiple data sources. Knowledge graphs have also started to play a central role in machine learning as a method to incorporate world knowledge, as a target knowledge representation for extracted knowledge, and for explaining what is learned.

Our goal here is to explain the basic terminology, concepts and usage of knowledge graphs in a simple to understand manner. We do not intend to give here an exhaustive survey of the past and current work on the topic of knowledge graphs.

We will begin by defining knowledge graphs, some applications that have contributed to the recent surge in the popularity of knowledge graphs, and then use of knowledge graphs in machine learning. We will conclude this chapter by summarizing what is new and different about the recent use of knowledge graphs.

2. Knowledge Graph Definition

A knowledge graph is a directed labeled graph in which the labels have well-defined meanings. A directed labeled graph consists of nodes, edges, and labels. Anything can act as a node, for example, people, company, computer, etc. An edge connects a pair of nodes and captures the relationship of interest between them, for example, friendship relationship between two people, customer relationship between a company and person, or a network connection between two computers. The labels capture the meaning of the relationship, for example, the friendship relationship between two people.

More formally, given a set of nodes N, and a set of labels L, a knowledge graph is a subset of the cross product N × L × N. Each member of this set is referred to as a triple, and can be visualized as shown below.

The directed graph representation is used in a variety of ways depending on the needs of an application. A directed graphs such as the one in which the nodes are people, and the edges capture friendship relationship is also known as a data graph. A directed graph in which the nodes are classes of objects (e.g., Book, Textbook, etc.), and the edges capture the subclass relationship, is also known as a taxonomy. In some data models, A is referred to as subject, B is referred to as predicate, and C is referred to as object.

Many interesting computations over graphs can be reduced to navigation. For example, in a friendship knowledge graph, to calculate the friends of a friends of a person A, we can navigate the knowledge graph from A to all nodes B connected to it by a relation labeled as friend, and then recursively to all nodes C connected by the friend relation to each B.

A path in a graph G is a series of nodes (v1, v2,…, vn) where for any i ∈ N with 1 ≤ i < n, there is an edge from vi to vi+1. A simple path is a path with no repeated nodes. A cycle is a path in which the first and the last nodes are the same. Usually, we are interested in only those paths in which the edge label is the same for every pair of nodes. It is possible to define numerous additional properties over the graphs (e.g., connected components, strongly connected components), and provide different ways to traverse the graphs (e.g., shortest path, Hamiltonian path, etc.).

3. Recent Applications of Knowledge Graphs

There are numerous applications of knowledge graphs both in research and industry. Within computer science, there are many uses of a directed graph representation, for example, data flow graphs, binary decision diagrams, state charts, etc. For our discussion here, we have chosen to focus on two concrete applications that have led to recent surge in popularity of knowledge graphs: organizing information over internet and data integration.

3.1 Knowledge Graphs for organizing Knowledge over the Internet

We will explain the use of a knowledge graph over the web by taking the concrete example of Wikidata. Wikidata acts as the central storage for the structured data for Wikipedia. To show the interplay between the two, and to motivate the use of Wikidata knowledge graph, consider the city of Winterthur in Switzerland which has a page in Wikipedia. The Wikipedia page for Winterthur lists its twin towns: two are in Switerzland, one in Czech Republic, and one in Austria. The city of Ontario in California that has a Wikipedia page titled Ontario, California, lists Winterthur as its sister city. The sister city and twin city relationships are identical as well as reciprocal. Thus, if a city A is a sister city of another city B, then B must be a sister city of A. This inference should be automatic, but because this information is stated in English in Wikipedia, it is not easy to detect this discrepancy. In contrast, in the Wikidata representation of Winterthur, there is a relationship called twinned administrative body that lists the city of Ontario. As this relationship is symmetric, the Wikidata page for the city of Ontario automatically includes Winterthur. Thus, when Wikidata knowledge g

  • 0
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
Contents Knowledge Graphs: Venturing Out into the Wild . . . . . . . . . . . . . . . . . . . . 1 Gerard de Melo Information Extraction from the Web by Matching Visual Presentation Patterns . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 Radek Burget Statistical Induction of Coupled Domain/Range Restrictions from RDF Knowledge Bases . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27 Basil Ell, Sherzod Hakimov, and Philipp Cimiano Wikipedia and DBpedia for Media - Managing Audiovisual Resources in Their Semantic Context . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41 Jean-Pierre Evain, Mike Matton, and Tormod Vaervagen Identifying Global Representative Classes of DBpedia Ontology Through Multilingual Analysis: A Rank Aggregation Approach . . . . . . . . . . 57 Eun-kyung Kim and Key-Sun Choi Identifying Poorly-Defined Concepts in WordNet with Graph Metrics . . . . . . 66 John P. McCrae and Narumol Prangnawarat Extracting Process Graphs from Medical Text Data: An Approach Towards a Systematic Framework to Extract and Mine Medical Sequential Processes Descriptions from Large Text Sources. . . . . . . . . . . . . . . . . . . . . 76 Andreas Niekler and Christian Kahmann Chainable and Extendable Knowledge Integration Web Services. . . . . . . . . . 89 Felix Sasaki, Milan Dojchinovski, and Jan Nehring Entity Typing Using Distributional Semantics and DBpedia . . . . . . . . . . . . . 102 Marieke van Erp and Piek Vossen WC3: Analyzing the Style of Metadata Annotation Among Wikipedia Articles by Using Wikipedia Category and the DBpedia Metadata Database . . . 119 Masaharu Yoshioka Author Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137
知识图谱是一种以图的形式表示知识的技术,广泛应用于知识表示、获取和应用等领域。在知识图谱的表示方面,常用的方法有本体建模、属性图、图神经网络等。本体建模是一种基于概念和关系的形式化知识表示方法,通过定义类、属性和关系等概念来描述知识之间的关系。属性图是一种将实体和属性表示为节点,关系表示为边的图结构,可以通过图分析技术来挖掘实体之间的关联性和特征。图神经网络是一种基于图结构数据进行表示学习的方法,可以通过节点嵌入和图编码等技术来提取知识图谱中的特征和关系。 在知识图谱的获取方面,主要涉及到实体识别、关系抽取和知识补全等任务。实体识别是指从文本中识别出具有指定语义类型的实体,常借助于命名实体识别和实体链接等技术。关系抽取是指从文本中自动抽取出实体之间的关系,可以通过基于规则、基于统计的方法和深度学习等方法来实现。知识补全是指基于已有知识图谱和外部知识源,自动推理和预测未知实体和关系的方法,可以通过图嵌入和图神经网络等技术来实现。 在知识图谱的应用方面,涵盖了很多领域,如自然语言理解、智能推荐和智能问答等。在自然语言理解中,知识图谱可以用于文本的语义表示和语义搜索,提升文本理解和信息检索的效果。在智能推荐中,知识图谱可以通过分析用户行为和之间的关系,为用户推荐具有相关性且符合用户偏好的内容。在智能问答中,知识图谱可以提供丰富的语义信息,帮助机器理解用户的问题并提供准确的答案。 综上所述,知识图谱的研究涵盖了知识图谱的表示、获取和应用等方面,通过各种方法和技术,为知识的表示、获取和应用提供了有效的解决方案。

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值