典型的大数据架构

 

       

Any data architecture loosely consists of four major logical components:

“任何数据架构由主要的四个逻辑组件组成:”

 

Typical Big Data Architecture

I don’t think there’s a blueprint for big data architectures. But such a diagram can give you a could idea of the possible components involved. Then to make things simple for engineers, you start adding requirements, constraints, and SLAs at each level. Once you have some sort of idea of how things will look, you start building it and discover that some of the components you are planning to use don’t work well together or there’s no way to achieve those SLAs. All in all, it’s a fun job.

 

“我不认为这是一个大数据架构的蓝图。但这样一个图能给你一个关于可能包含的组件的大致的想法。然后对工程师让事情变得简单,你开始在每个等级上添加需求,约束,和服务等级协议(SLAS Service-level agreement)。一旦你有了关于事情该怎么看的某种想法,你开始建立它并发现你将用到的一些组件不能很好的在一起工作,或者根本没有办法达到这些服务等级协议。总之,这是一项有趣的工作。”

 

 My english is not very well,but I will try my best to translate it. Dont't hesitate to point out if there is some problem in my word,thank you !

Learn various commercial and open source products that perform SQL on Big Data platforms. You will understand the architectures of the various SQL engines being used and how the tools work internally in terms of execution, data movement, latency, scalability, performance, and system requirements. This book consolidates in one place solutions to the challenges associated with the requirements of speed, scalability, and the variety of operations needed for data integration and SQL operations. After discussing the history of the how and why of SQL on Big Data, the book provides in-depth insight into the products, architectures, and innovations happening in this rapidly evolving space. SQL on Big Data discusses in detail the innovations happening, the capabilities on the horizon, and how they solve the issues of performance and scalability and the ability to handle different data types. The book covers how SQL on Big Data engines are permeating the OLTP, OLAP, and Operational analytics space and the rapidly evolving HTAP systems. You will learn the details of: Batch Architectures ―an understanding of the internals and how the existing Hive engine is built and how it is evolving continually to support new features and provide lower latency on queries Interactive Architectures―an understanding of how SQL engines are architected to support low latency on large data sets Streaming Architectures ―an understanding of how SQL engines are architected to support queries on data in motion using in-memory and lock-free data structures Operational Architectures―an understanding of how SQL engines are architected for transactional and operational systems to support transactions on Big Data platforms Innovative Architectures―an exploration of the rapidly evolving newer SQL engines on Big Data with innovative ideas and concepts Table of Contents Chapter 1: Why SQL on Big Data? Chapter 2: SQL-on-Big-Data Challenges & Solutions Chapter 3: Batch SQL—Architecture Chapter 4: Interactive SQL—Architecture Chapter 5: SQL for Streaming, Semi-Structured, and Operational Analytics Chapter 6: Innovations and the Road Ahead
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值