Cloud Computing(5)_Big Data Infrastructure

One pupolar application of Hadoop is data-warehousing.
Data-warehousing have been implemented through relational databases, particularly those optimized for a specific workload.

Database Workload
  • OLTP(online transaction processing)
    • Typical applications: e-commerce, banking, airline reservations
    • User facing: real-time(process in memory), low latency, highly-concurrent
    • Tasks: relatively small set of “standard” transactional queries
    • Data access pattern: random reads, updates, writes
  • OLAP(online analytical processing)
    • Typical applications: business intelligence, data mining
    • Back-end processing: **(process in disk)**batch workloads, less concurrent
    • Tasks: complex analytical queries, often ad hoc
    • Data access pattern: table scans, large amounts of data per query
Database solution: Separate databases
  • User-facing OLTP database for high-volume transactions
  • Data warehousing for OLAP workloads

How do we connect the OLTP and OLAP?
1

OLTP/OLAP Integration
  • Extract-Transform-Load(ETL)
    • Extract records from sourse(OLTP)
    • Transform: clean data, check integrity, aggregate
    • Load into OLAP database
  • ETL Bottleneck
    • ETL typically a nightly task
    • limited by the speed of communication

Given successful applications of Hadoop to data-warehousing and complex analytical queries that are prevalent in such an environment, it makes sense to examine MapReduce algorithms for manipulating relational data.
2
**Hadoop is not a database!**Hadoop to ETL(actually, first L, then E and T)

What’s changed?
  • Dropping cost of disks: cheaper to store everything
  • Types of data collected: even collect the data whose value is less apparent
  • Rise of social media and user-generated content: large increase in data volume
  • Growing maturity of data mining techniques: demonstrates value of data analytics
  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值