【转】知识图谱(KG)存储、可视化、公开数据集、图计算、图编程工具分享

知识图谱(Knowledge Graph),在图书情报界称为知识域可视化或知识领域映射地图,是显示知识发展进程与结构关系的一系列各种不同的图形,用可视化技术描述知识资源及其载体,挖掘、分析、构建、绘制和显示知识及它们之间的相互联系。

    知识图谱是通过将应用数学、图形学、信息可视化技术、信息科学等学科的理论与方法与计量学引文分析、共现分析等方法结合,并利用可视化的图谱形象地展示学科的核心结构、发展历史、前沿领域以及整体知识架构达到多学科融合目的的现代理论。它能为学科研究提供切实的、有价值的参考。

    本资源整理了知识图谱相关开源工具,涉及知识图谱公开数据集、知识图谱存储、可视化、知识融合、知识图谱计算相关工具、资料。

目录

    基础架构

        o图数据库工具

        o三元组存储工具

        o图计算框架工具

        o图可视化工具

        o图处理相关编程语言

        o图应用服务工具

    知识工程

        o知识融合

    知识图谱数据集

        o一般数据集

        o语义网络数据集

        o学术数据集

    学习资料

        o官方文档

        o学习社区资料

基础架构

    图数据库工具

    AgensGraph - multi-model graph database with SQL and Cypher support based on PostgreSQL

    ArangoDB - highly available Multi-Model NoSQL database

    Blazegraph - GPU accelerated graph database

    Cayley - open source database written in Go

    CosmosDB - cloud-based multi-model database with support for TinkerPop3

    Dgraph - Fast, Transactional, Distributed Graph Database (open source, written in Go)

    DSE Graph - Graph layer on top of DataStax Enterprise (Cassandra, SolR, Spark)

    Grakn.AI - a distributed hyper-relational database for knowledge-oriented systems, i.e. a distributed knowledge base

    Graphd - the Metaweb/Freebase Graph Repository

    JanusGraph - an open-source, distributed graph database with pluggable storage and indexing backends

    Memgraph - High Performance, In-Memory, Transactional Graph Database

    Neo4j - OLTP graph database

    Sparksee - makes space and performance compatible with a small footprint and a fast analysis of large networks

    Stardog - RDF graph database with OLTP and OLAP support

    OrientDB - Distributed Multi-Model NoSQL Database with a Graph Database Engine

    TigerGraph - a complete, distributed, parallel graph computing platform for enterprise, supporting web-scale data analytics in real-time.

    Nebula Graph - A truly distributed, linear scalable, lightning-fast graph database, using SQL-like query language.

    HugeGraph - An open source TinkerPop 3 compliant OLTP Graph Database with pluggable storage bakcend which is similar to JanusGraph. It also supports OLAP through Spark GraphX.

    三元组存储工具

    AllegroGraph - high-performance, persistent graph database that scales to billions of quads

    Apache Jena - open source Java framework for building Semantic Web and Linked Data applications

    Eclipse RDF4J - (formerly known as Sesame) is an open source Java framework for processing RDF data. This includes parsing, storing, inferencing and querying of/over such data. It offers an easy-to-use API that can be connected to all leading RDF storage solutions. It allows you to connect with SPARQL endpoints and create applications that leverage the power of linked data and Semantic Web.

    GraphDB - enterprise ready Semantic Graph Database, compliant with W3C Standards

    Virtuoso - a "Data Junction Box" that drives enterprise and individual agility by deriving a Semantic Web of Linked Data from existing data silos

    Hoply - explore bigger than RAM relational data in the comfort of Python.

    图计算框架工具

    Apache Giraph - an iterative graph processing system built for high scalability

    Apache TinkerPop - a graph computing framework for both graph databases (OLTP) and graph analytic systems (OLAP)

    Apache Spark - GraphX - Apache Spark's API for graphs and graph-parallel computation

    Tencent Plato - a fast distributed graph computation and machine learning framework used by WeChat.

    图可视化工具

    AntV G6 - Simple, easy and complete high performance graph visualization engine written in JavaScript, from Ant Financial

    Gephi - Graph visualization platform software runs on Windows, Mac and Linux.

    KeyLines & ReGraph - Graph visualization tookits for JavaScript and React developer from Cambridge Intelligence.

    Linkurious - Linkurious is an enterprise ready on-premises graph visualization and analysis platform.

    图处理相关编程语言

    Cypher

    Gremlin

    SPARQL

    GraphQL+- - The query language of Dgraph, which is based on Facebook's GraphQL

    GQL - An initiative to create a standard query language for property graph database, just like SQL for relational database.

    图应用服务工具

    CosmosDB @ Microsoft - Azure Cosmos DB is Microsoft's globally distributed, multi-model (Key-value, Document, Column, Graph) database service.

    JanusGraph @ IBM Compose

    JanusGraph @ Google Cloud Platform - JanusGraph on Google Kubernetes Engine backed by Google Cloud Bigtable

    JanusGraph @ Amazon Web Services Labs - The Amazon DynamoDB Storage Backend for JanusGraph

    Neo4j @ Graphene

    Neo4j @ Graph Story - End-to-end Graph Database hosting for Community and Enterprise Neo4j with expert help for development

    Neptune @ Amazon Web Services - a fast, reliable, fully-managed graph database service that makes it easy to build and run applications that work with highly connected datasets

    Graph Engine Service @ Huawei Cloud - Fully-managed, distributed, at-scale graph query and analysis service that provides a visualized interactive analytics platform.

    Graph Database (beta) @ Aliyun (Alibaba Cloud) - highly reliable and available property graph database that supports ACID and TinkerPop Gremlin query language.

    Tencent Knowledge Graph @ Tencent Cloud - One stop platform for Graph database, computing and visualization. Currently available in beta test and only in Chinese.

    

知识工程

    YAGA-NAGA - Harvesting, Searching, and Ranking Knowledge from the Web

    知识融合

    Dedupe - dedupe is a python library that uses machine learning to perform fuzzy matching, deduplication and entity resolution quickly on structured data.

    LIMES - Link Discovery Framework for Metric Spaces.

    

知识图谱数据集

    一般数据集

    BabelNet - Both a multilingual encyclopedic dictionary, with lexicographic and encyclopedic coverage of terms, and a semantic network which connects concepts and named entities in a very large network of semantic relations, made up of about 16 million entries, called Babel synsets. Each Babel synset represents a given meaning and contains all the synonyms which express that meaning in a range of different languages.

    Wikidata - Wikidata is a free, collaborative, multilingual, secondary database, collecting structured data to provide support for Wikipedia, Wikimedia Commons, the other wikis of the Wikimedia movement, and to anyone in the world.

    Google Knowledge Graph - Google’s Knowledge Graph has millions of entries that describe real-world entities like people, places, and things.

    DBpedia - DBpedia is a crowd-sourced community effort to extract structured content from the information created in various Wikimedia projects.

    XLore - A large-scale English-Chinese bilingual knowledge graph by structuring and integrating Chinese Wikipedia, English Wikipedia, French Wikipedia, and Baidu Baike.

    The GDELT Project - The GDELT Project monitors the world's broadcast, print, and web news from nearly every corner of every country in over 100 languages and identifies the people, locations, organizations, themes, sources, emotions, counts, quotes, images and events driving our global society every second of every day, creating a free open platform for computing on the entire world.

    YAGO - A huge semantic knowledge base, derived from Wikipedia, WordNet and GeoNames. Currently, YAGO has knowledge of more than 10 million entities (like persons, organizations, cities, etc.) and contains more than 120 million facts about these entities. The source code of YAGO is in this Github repo.

    Zhishi.me - Knowledge Graph data extracted from the largest Chinese encyclopedias, Baidu Baike, Hudong Baike and Chinese Wikipedia.

    语义网络数据集

    ConceptNet - ConceptNet is a freely-available semantic network, designed to help computers understand the meanings of words that people use.

    Microsoft Concept Graph - For Short Text Understanding

    OpenHowNet - An Open Sememe-based Lexical Knowledge Base in Chinese.

    WordNet - A free large lexical database of English from Princeton University.

    学术数据集

    AMiner - Aminer aims to provide comprehensive search and mining services for researcher social networks.

    Microsoft Academic - Microsoft Academic (MA) employs advances in machine learning, semantic inference and knowledge discovery to help you explore scholarly information in more powerful ways than ever before.

    AceMap - Academic search engine based on knowledge graph which includes entities like paper, author, institution and etc.

    

学习资料

    官方文档

    Cypher - reference documentation

    Gremlin - reference documentation

    学习社区资料

    Graph Book - TinkerPop3 centric book written by Kelvin R. Lawrence

    SQL2Gremlin - transition from SQL to Gremlin by Daniel Kuppitz

    The Gremlin Compendium - minimum survival kit for any Gremlin user, 10 blog post series by Doan DuyHai

相关会议

    Graph Connect - powered by Neo4j

    Graph Day - an Independent Graph Conference from the Data Day folks

  • 0
    点赞
  • 3
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
【资源说明】 Java开发基于知识图谱的航天数据系统源码+项目使用说明.zip 1、该资源内项目代码都是经过测试运行成功,功能ok的情况下才上传的,请放心下载使用! 2、本项目适合计算机相关专业(如计科、人工智能、通信工程、自动、电子信息等)的在校学生、老师或者企业员工下载使用,也适合小白学习进阶,当然也可作为毕设项目、课程设计、作业、项目初期立项演示等。 3、如果基础还行,也可在此代码基础上进行修改,以实现其他功能。 基于知识图谱的航天数据系统,采用多元数据展示方式,对数据进行分析处理后展示。 该项目是参加全国计算机设计大赛的作品,很遗憾没能够杀出重围,但是通过这个项目我学习到了关于数据知识图谱方面的知识 ## 应用背景 当今航天科普网站存在以下问题: * **科技新闻高深难理解** * **数据表现形式单调,觉冲击感不够** * **方式单一难以激发热情** * **体会感悟不够深刻强烈** 项目背景意义: * **一个理念:**让航天知识深入人心, 感受到祖国航天事业的强大 * **两个观念:**标新立异出类拔萃、信息可靠数据准确 * **五个实现:**知识图谱、你问我答、慧眼世界、慧眼中国、心里有数 知识图谱的构建 1. 利用**webMagic**爬虫框架结合定时任务动态爬取数据并且存储到 **.csv** 后缀的文件中。 2. 从文件中读取数据进行清洗过滤,采用面向对象的思想,建立起核心三元组的概念(实体、属性、关系)。 3. 运用**ORM**框架实现实体对象到**数据库Neo4j**的映射,进行数据持久存储。 知识测试系统构建 1. 航天问题库爬取; 2. 航天问题随机展示; 3. 用户回答问题并提交; 4. 服务器统计热点问题; 5. 答案判别回复。
UMLS数据集是由美国国家医学书馆(National Library of Medicine)开发和维护的一种医学资源,用于支持医学信息检索和语义分析任务。知识图谱是将这个庞大的UMLS数据集以的形式展示出来,以帮助用户更直观地理解和使用其中的知识。 在知识图谱的效果中,可以看到UMLS数据集中的各种实体和它们之间的关系。这些实体包括疾病、药物、治疗方法、解剖部位等,它们以节点的形式呈现在中。不同的实体之间通过边连接起来,表示它们之间的关联关系,比如疾病与症状的关系、药物与治疗方法的关系等。 通过这样的知识图谱,用户可以更方便地了解某个实体的具体信息。当用户点击某个节点时,会显示该节点的详细属性和与之相关的其他实体。用户还可以对进行放大、缩小、平移等操作,以便更好地浏览整个谱。 除了直观地展示实体和关系,知识图谱还可以提供一些辅助功能。比如,用户可以根据自己的兴趣选择只显示特定类型的实体,或者根据关联程度来调整边的显示。这些功能可以帮助用户更快速地找到自己感兴趣的信息。 总之,UMLS数据知识图谱的可效果能够帮助用户更深入地理解和探索其中的知识。它通过直观的形展示以及辅助功能,让用户更方便地获取所需的信息,并发现实体之间的关联。这对于医学信息检索和语义分析任务有着重要的意义。

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值