来自耶鲁的一篇论文 http://db.cs.yale.edu/hadoopdb/hadoopdb.pdf 学习下。
简要介绍的PPT,很经典http://www.slideshare.net/cloudera/hw09-hadoop-db
附其他资源,来自首页
HadoopDB
An Architectural Hybrid of MapReduce and DBMS Technologies for AnalyticalWorkloads.
DR@Y
Database Research at Yale University
HadoopDB is:
- A hybrid of DBMS and MapReduce technologies that targets analytical workloads
- Designed to run on a shared-nothing cluster of commodity machines, or in the cloud
- An attempt to fill the gap in the market for a free and open source parallel DBMS
- Much more scalable than currently available parallel database systems and DBMS/MapReduce hybrid systems.
- As scalable as Hadoop, while achieving superior performance on structured data analysis workloads
For more detail, check out theDBMS Musings blog post,or the paper below.
The Paper:
HadoopDB: An Architectural Hybrid of MapReduce and DBMSTechnologies for Analytical Workloads. Azza Abouzeid, Kamil Bajda-Pawlikowski,Daniel J. Abadi, Avi Silberschatz, Alex Rasin. In Proceedings of VLDB, 2009. [PDF]
News:
- Here is the latest paper on performance enhancements in HadoopDB published at SIGMOD'2011.
- HadoopDB technology was commercialized into Hadapt.
- Daniel Abadi was invited to give a talk at Hadoop World: NYC 2010.
- Our demonstration of HadoopDB in Action was accepted for SIGMOD 2010.
- We are invited to present HadoopDB at New York Hadoop User group meetup on April 21.
- HadoopDB was presented at New England Database Summit 2010 during the poster session.
- Call for contributions is posted here.
- HadoopDB's presentation at CLuE PI Meeting 2009 in California is now available.
- The Hadoop World: NYC 2009 presentation is now available.
- Our VLDB 2009 slides are here.
Press:
Links:
Project Members
- Azza Abouzeid
- Kamil Bajda-Pawlikowski
- Daniel J. Abadi
- Avi Silberschatz