Will Triple Stores Replace Relational Databases?

People ask me all the time, “Will triple stores replace relational databases in three or five years?” and I usually give two answers:

Answer 1: Yes, because  triple stores provide 100 times more flexibility. For example, triple stores make it so much easier to add new predicates (think columns in relational databases) and write complicated ad hoc queries or perform inferencing and rule processing. Triple stores will soon be as robust, user-friendly and manageable as relational databases. Relational databases may continue to perform a bit better on simple joins, but triple stores already produce better performance when it comes to complicated queries, rule handling and inferencing. Given this robustness and usability – if the speed is roughly the same – many people will make the choice to switch to the more flexible solution.

Answer 2: No, triple stores will continue to be used in conjunction with relational databases for the near future. Many installed legacy systems took millions of dollars to implement, and it’s impractical to replace these systems in the near term. In these cases, triple stores can enable smart integration of databases by adding intelligent metadata on top of databases. Many companies are already using triple stores as a “smart brain” on top of their legacy systems.

How Do Triple Stores Differ from RDBs?

In a recent conversation with a leading industry research firm, I explained how relational databases differ from triple stores by first showing a set of tables that describe a person with multiple properties and set of link tables to spouses, schools, professions and children.  

I then showed the same information as a flat table of triples and explained that you:

  1. Don’t need to make schemas beforehand.
  2. Don’t need to link tables because you can do one-to-many relationships directly.
  3. Can add new data attributes (predicates) on the fly that will be instantly available for querying because everything is automatically indexed.

The analyst stopped me and said, “Hold on – I’ve heard this argument before. This is just a completely denormalized database with just one table. There were all these database articles about knowledge databases in the seventies, but they never went anywhere. How is this different?”

Standards and Adoption

To make a leap forward in technology, standards are often a critical factor. There is a standard for the triple approach (RDF, RDFS, OWL, SPARQL, etc). If you visit the W3C website, you can see that in just a few years it has become the most important development next to HTML5. The big initial success of SQL databases was that there was a standard. It allowed companies to train new people very easily on new technology, and the standard makes it easy to escape from one database company to the other. Competition between companies focused on features and performance but not on the basic access and storage technology.

Unstructured Data is Valuable

Businesspeople are realizing that in many cases, there is far more knowledge in unstructured data than in structured information. (Think emails, documents, drawings, spreadsheets, etc.) Everyone will agree that relational databases are not the perfect tool for unstructured data. For a while, people thought that XML might be the solution, but XML is not a flexible self-describing language that easily deals with graphs. (To some extent it can self-describe through XML schema and deal with graphs by using the ID tag but is cumbersome.) Most people now agree that RDF is a much better structure for that than XML.

Need for Standardized Metadata

Metadata has become increasingly important now that unstructured data is finally recognized to be as valuable as structured data. There are now metadata standards using RDF triples and companies are using  triple stores in production not as a replacement of relational databases but as a complementary technology.

In his book, “Pull: The Power of The Semantic Web to Transform your Business,” David Siegel explains how standardized metadata and meta tagging will change the business world. I'm not going to defend his book here, but he sketches in great detail the importance of metadata for taxes, the SEC, the world of health care, education, etc. If you read this book, you will understand why RDF (and thus triples) is the new language for metadata.

Real applications are using triple stores today. Most of these applications are programmed with a triplestore that is running next to a bunch of relational databases. At some point, triple stores will completely replace relational databases. But today they can also run in parallel and in cooperation.

For the longest time Google refused to use the word “semantics” in any of their communications. Their position was that keywords are enough to do whatever you need to do for information retrieval. However, Google bought Metaweb, which offers the biggest RDF-based encyclopedia on the Web and formally announced that it will use RDF embedded in Web pages to enhance the presentation of products and companies and events in their query results. Both Microsoft and Yahoo! have taken the same step, and we are now seeing the first success stories of companies that use this technology.

I’ll leave you with one final thought: What if you need to buy a new car? The obvious choice is to buy another car just like the one you have in the garage. The type of car that has been around for 30 years, works great on the road, perfect mileage/fuel consumption and every auto shop knows how to repair it. But there is also a new car on the market that is just as stable and fast, similar cost, but much more flexible because it also can fly and run underwater. Which car would you buy?


地址:http://www.information-management.com/newsletters/database_metadata_unstructured_data_triple_store-10020158-1.html

基于bert实现关系三元组抽取python源码+数据集+项目说明.zip基于bert实现关系三元组抽取python源码+数据集+项目说明.zip基于bert实现关系三元组抽取python源码+数据集+项目说明.zip基于bert实现关系三元组抽取python源码+数据集+项目说明.zip基于bert实现关系三元组抽取python源码+数据集+项目说明.zip 个人大四的毕业设计、课程设计、作业、经导师指导并认可通过的高分设计项目,评审平均分达96.5分。主要针对计算机相关专业的正在做毕设的学生和需要项目实战练习的学习者,也可作为课程设计、期末大作业。 [资源说明] 不懂运行,下载完可以私聊问,可远程教学 该资源内项目源码是个人的毕设或者课设、作业,代码都测试ok,都是运行成功后才上传资源,答辩评审平均分达到96.5分,放心下载使用! 1、该资源内项目代码都经过测试运行成功,功能ok的情况下才上传的,请放心下载使用! 2、本项目适合计算机相关专业(如计科、人工智能、通信工程、自动化、电子信息等)的在校学生、老师或者企业员工下载学习,也适合小白学习进阶,当然也可作为毕设项目、课程设计、作业、项目初期立项演示等。 3、如果基础还行,也可在此代码基础上进行修改,以实现其他功能,也可用于毕设、课设、作业等。 下载后请首先打开README.md文件(如有),供学习参考。
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值