hbase首页介绍,感觉写的很好,就翻译了一下,8,9,10不是很理解,英语水平有限,望批评指正
Apache Hbase 是一个 分布式的,可伸缩的,大数据量存储的 hadoop数据库。
Use Apache HBase™ when you need random, realtime read/write access to your Big Data. This project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware. Apache HBase is an open-source, distributed, versioned, non-relational database modeled after Google's
Bigtable: A Distributed Storage System for Structured Data
by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, Apache HBase provides Bigtable-like capabilities on top of Hadoop and HDFS.
Features
- Linear and modular scalability.
- Strictly consistent reads and writes.
- Automatic and configurable sharding of tables
- Automatic failover support between RegionServers.
- Convenient base classes for backing Hadoop MapReduce jobs with Apache HBase tables.
- Easy to use Java API for client access.
- Block cache and Bloom Filters for real-time queries.
- Query predicate push down via server side Filters
- Thrift gateway and a REST-ful Web service that supports XML, Protobuf, and binary data encoding options
- Extensible jruby-based (JIRB) shell
- Support for exporting metrics via the Hadoop metrics subsystem to files or Ganglia; or via JMX
特点
1.线性和可调节的伸缩性
2.严格的一致读写
3.自动的和可配置的表分片
4.在regionserver之间支持自动的失败切换
5.提供方便的基础类,支持hadoop的mapreduce使用Apache Hbase的表
6.使用JAVA API开发客户端去访问Hbase很简单
7.对实时查询支持块的缓存和布隆过滤器
8.查询谓词经服务端过滤器向下推送
9.Thrift网关和支持XML的restful Web服务,protobuf(
Protocol Buffers Google的类XML数据描述语言
)和二进制数据的编码选项
10.可扩展的基于Jruby的外壳
11.支持导出指标经过Hadoop metrics子系统,导出到文件,ganglia,或JMX