Summary
In case you missed something along the way, here is a quick overview of the material
covered in this chapter.
HBase is a database designed for semistructured data and horizontal scalability. It
stores data in tables. Within a table, data is organized over a four-dimensional coordinate
system: rowkey, column family, column qualifier, and version. HBase is schema-less,
requiring only that column families be defined ahead of time. It’s also type-less, storing
all data as uninterpreted arrays of bytes. There are five basic commands for interacting
with data in HBase: Get, Put, Delete, Scan, and Increment. The only way to query
HBase based on non-rowkey values is by a filtered scan.6
<