HBase read tuning tip

Joel ask the following question:
=================

 

Hi All,

I have an application with two HBase tables.

One table is written to frequently, by a crawler writing web pages.

Another table is written to occasionally (the result of some
processing), but end users read data from this table, and I want the
read response times to be as low as possible.

I only have one server on which to host both tables.

What tuning should I consider to minimise the read latency on the second
table (there will be relatively few users, so throughput is less of a
concern, for the time being) ?


Regards,
Joel

Stack answer it at first
================
Mark the second-table in-memory in the schema.  And for the first,
have it not use cache at all.  This way, cache should only have
content from the table that is read.  Does the second table fit fully
in memory?

Joel reply it again
================
No, the second table is too large to fit in memory.

Lars George answer it at second
================
Hi Joel,

Marking it "in-memory" is *not* making it all stay or be loaded into
memory. It is just a priority flag to retain blocks of that CF
preferably in the block caches. So it caches it up to the max block
cache size. The rest may cause some churn but that is the best you can
do.

Lars

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值