数据湖 data lake_什么是Data Lake及其在大数据2015中的重要性

数据湖 data lake

Cloud computing is the biggest innovation in the computing technology, which makes a splendid progress for the organisations searching for larger datasets to suffice their customers growing needs. After cloud computing, Big Data was the most emerging technology utilized and implemented by approximately 45% of the online organisations and big brands according an ICT survey of 2014. The ability of big data to work and manage large data sets leads many to think of improving acknowledgement of the retrieval of data requests and concept of Data Lake engineered to manage the growing prerequisites to handle data.

云计算是计算技术中最大的创新,它为寻求更大数据集的组织满足客户不断增长的需求提供了辉煌的进步。 根据2014年的ICT调查,在云计算之后 ,大数据是大约45%的在线组织和大品牌使用和实施的最新兴技术。大数据处理和管理大数据集的能力使许多人想到改进确认了对数据请求的检索和Data Lake的概念,旨在处理不断增长的处理数据的先决条件。

The Data Lake is a gigantic, easily available, centralized data warehouse of great volumes of structured and unstructured information. A heavy object-based storage repository holds data in its native format until it is required. The raw data is stored and served in its native format until the request. The Data Lake is assembling of data about data (metadata) where data instances (packets) can be plucked and served as per the requirements of database. The emerging demand for data lake technology arises because to manage large data sets of big data a collection of data was very important to acknowledge the petition.

Data Lake是一个巨大的,易于使用的集中式数据仓库,其中包含大量的结构化和非结构化信息。 繁重的基于对象的存储库以其本机格式保存数据,直到需要它为止。 原始数据将以其本机格式存储和提供,直到请求。 Data Lake正在组装有关数据(元数据)的数据,其中可以根据数据库的需求来抽取和使用数据实例(数据包)。 出现对数据湖技术的新兴需求是因为要管理大数据的大数据集,收集数据对于确认请愿书非常重要。

The Data Lake architecture is built and store every single bit of data in an unstructured and raw format approaching to big data. Initial incoming data were not classified when it was stored in the origin. As a result, data preparation is eliminated. A data lake is thus unstructured compared to a conservative data warehouse. When the data is required only then the data packets were classified, organized or analysed to the acknowledgement. The working diagram of traditional data warehouses is different from the Data Lake, in traditional warehouse data was analysed and structured at the first time they enter and stored in the unique request with specific analysis and applications while data residing in lakes are still waiting for applications to discover ways to manufacture insights. A data lake uses a flat architecture to stock data. Each raw data element in a lake is allocated a unique metadata tag identifier to know the unstructured data to store large data sets.

Data Lake体系结构已构建并以接近大数据的非结构化原始格式存储数据的每一位。 初始传入数据存储在源中时未分类。 结果,消除了数据准备。 因此,与保守的数据仓库相比,数据湖是非结构化的。 仅在需要数据时,才对数据包进行分类,组织或分析以进行确认。 传统数据仓库的工作图与Data Lake不同,在传统仓库数据首次进入时就进行了分析和结构化,并以特定的分析和应用程序将它们存储在唯一的请求中,而驻留在湖泊中的数据仍在等待应用程序的提交。发现制造见解的方法。 数据湖使用平面架构来存储数据。 湖泊中的每个原始数据元素都分配有一个唯一的元数据标签标识符,以了解非结构化数据来存储大型数据集。

The growing need of large databases for big platforms flexibly served from Data Lake because unstructured data identifier, and storage mechanism allow fast access of stored data instances in many ways for different platforms. The triangle of cloud computing, big data and data lakes concept is growing to serve multiple data channels with unrecognized value to manage incoming data. In a quickly rising world of big data, the Data Lake concept is gaining popularity and increasing exponentially. Data Lake is useful to lower the costs of storage, the ability to store more data types, scale multiple data types, advanced capacity, data analysis and designed and deployed to reduced risk for future data management.

由于非结构化数据标识符和存储机制允许以多种方式针对不同平台快速访问存储的数据实例,因此从Data Lake可以灵活地为大型平台提供大型数据库的需求不断增长。 云计算,大数据和数据湖概念的三角形正在不断发展,以服务于无法识别价值的多个数据通道来管理传入数据。 在大数据Swift崛起的世界中,Data Lake概念日益流行并呈指数级增长。 Data Lake对降低存储成本,存储更多数据类型,扩展多种数据类型的能力,高级容量,数据分析以及为降低未来数据管理风险而设计和部署的功能非常有用。

翻译自: https://www.eukhost.com/blog/webhosting/what-is-data-lake-and-its-importance-in-big-data-2015/

数据湖 data lake

  • 0
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值