大数据通常是指规模大到常用软件工具无法处理的数据集合,我们无法在可接受时间段内,去获取、关联处理这些数据。大数据的具体规模经常是变幻不定的,就2012年来说,这个规模从几十TB到几PB不等。为解决大数据处理上的困难,逐步兴起了大数据工具平台,如Apache Hadoop的大数据平台。
原文:
Big data is usually includes data sets with sizes beyond the ability of
commonly-used software tools to capture, manage, and process the data within a
tolerable elapsed time. Big data sizes are a constantly moving target, as of
2012[update]
ranging from a few dozen terabytes to many petabytes of data in a single data
set. With this difficulty, a new platform of "big data" tools has arisen to
handle sensemaking over large quantities of data, as in the Apache Hadoop Big Data
Platform.
灵玖软件:www.lingjoin.com