What Is Hadoop?
The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing. Hadoop includes these subprojects:
* Hadoop Common: The common utilities that support the other Hadoop subprojects.
* Chukwa: A data collection system for managing large distributed systems.
* HBase: A scalable, distributed database that supports structured data storage for large tables.
* HDFS: A distributed file system that provides high throughput access to application data.
* Hive: A data warehouse infrastructure that provides data summarization and ad hoc querying.
* MapReduce: A software framework for distributed processing of large data sets on compute clusters.
* Pig: A high-level data-flow language and execution framework for parallel computation.
* ZooKeeper: A high-performance coordination service for distributed applications.
Hadoop的是什么?
Apache的Hadoop项目的发展提供可靠,可扩展,分布式计算开放源码软件。 Hadoop的包括以下子项目:
* Hadoop的共同点:即支持其他Hadoop的子项目的共同事业。
* Chukwa:一个管理大型分布式系统的数据收集系统。
* HBase:一种可扩展,分布式数据库,支持结构化数据大表存储。
* HDFS:一个分布式的文件系统,提供了高吞吐量的应用程序数据访问。
* Hive:一种数据仓库基础设施,提供数据汇总和特设查询。
* MapReduce的:一个大型的分布式数据处理软件框架集的计算集群。
* Pig:一种高层次的数据流语言和并行计算的执行框架。
* ZooKeeper:一个分布式应用的高性能的协调服务。
What Is Hadoop?
最新推荐文章于 2018-05-09 11:46:14 发布