HBase RegionServe监控

一、RegionServer级别的监控

 

MetricType(GAUGE,COUNTER)类型业务意义备注
regionCount
GAUGE
 The number of regions hosted by the regionserver


RegionServer包含对象
storeCount
GAUGE
  
storeFileCount
GAUGE
 The number of store files on disk currently managed by the regionserver
storeFileSize
GAUGE
 Aggregate size of the store files on disk
hlogFileCount
GAUGE
 The number of write ahead logs not yet archived
totalRequestCount
COUNTER
 The total number of requests received

负载
readRequestCount
COUNTER
 The number of read requests received
writeRequestCount
COUNTER
 The number of write requests received
numOpenConnections
GAUGE
 The number of open connections at the RPC layer


连接与队列

 
numActiveHandler
GAUGE
 The number of RPC handlers actively servicing requests
numCallsInGeneralQueue
GAUGE
 The number of currently enqueued user requests
numCallsInReplicationQueue
GAUGE
 The number of currently enqueued operations received from replication
numCallsInPriorityQueue
GAUGE
 The number of currently enqueued priority (internal housekeeping) requests
flushQueueLength 
GAUGE 
 Current depth of the memstore flush queue. If increasing, we are falling behind with clearing memstores out to HDFS. 
compactionQueueLength
GAUGE
 Current depth of the compaction request queue. If increasing, we are falling behind with storefile compaction.
updatesBlockedTime
COUNTER
ms Number of milliseconds updates have been blocked so the memstore can be flushed  
blockCacheHitCount 
COUNTER
 The number of block cache hits 

blockcache使用情况
blockCacheMissCount
COUNTER 
 The number of block cache misses
blockCacheExpressHitPercent  
GAUGE
percentThe percent of the time that requests with the cache turned on hit the cache 
percentFilesLocal  
GAUGE 
percentPercent of store file data that can be read from the local DataNode, 0-100 文件本地化比例
<op>_<measure> 
GAUGE
 Operation latencies, where <op> is one of Append, Delete, Mutate, Get, Replay, Increment; and where <measure> is one of min, max, mean, median, 75th_percentile, 95th_percentile, 99th_percentile 
详细的各类操作计数器
 slow<op>Count
COUNTER
 The number of operations we thought were slow, where <op> is one of the list above 
GcTimeMillis
COUNTER
msTime spent in garbage collection, in milliseconds

GC时间
GcTimeMillisParNew
COUNTER
msTime spent in garbage collection of the young generation, in milliseconds
GcTimeMillisConcurrentMarkSweep
COUNTER
msTime spent in garbage collection of the old generation, in milliseconds
authenticationSuccesses
COUNTER
 Number of client connections where authentication succeeded
ACL模块的统计
authenticationFailures
COUNTER
 Number of client connection authentication failures
mutationsWithoutWALCount
COUNTER
 Count of writes submitted with a flag indicating they should bypass the write ahead log 

如下部分为非核心指标,暂未实现
 
compactedCellsCount
COUNTER
 合并cell个数cell统计
majorCompactedCellsCount
COUNTER
 大合并cell个数
flushedCellsSize
COUNTER
 flush到磁盘的大小
blockedRequestCount
COUNTER
 因memstore大于阈值而引发flush的次数 
splitRequestCount
COUNTER
 region分裂请求次数
region分裂情况
splitSuccessCounnt
COUNTER
 region分裂成功次数
receivedBytes
COUNTER
bytes收到数据量
带宽
sentBytes
COUNTER
bytes发出数据量SyncTime_mean
compactionQueueSize
GAUGE
 compaction Queue的大小


compaction情况统计
 
compactionSize_avg_time
GAUGE
ms履行一次Compaction的数据大小
compactionSize_num_ops
COUNTER
 履行compaction的次数
compactionTime_avg_time
GAUGE
ms均匀履行一次Compaction的时间
compactionTime_num_ops
COUNTER
 履行compaction的次数

 

 

二、RegionServe报警设置 

Metric报警策略报警级别备注
totalRequestCountall(#3) > 50000P1负载过大
compactionQueueLengthall(#3) > 100P1压缩队列过长
percentFilesLocalall(#3) <= 90P1文件本地化低于95%
blockCacheExpressHitPercentall(#3) <= 90P1blockCache命中率低于95%
GcTimeMillisConcurrentMarkSweepall(#3) > 200P1GC时间过长
storeFileCount
all(#3) > 1000P1StoreFile过多,需要考虑compact

 

 

三、RegionServer上的table(region)级别的监控

 

MetricType(GAUGE,COUNTER)类型业务意义备注
appendCount
COUNTER
  

 

 

 

 

region级别的各类操作计数器





 

deleteCount
COUNTER
  
mutateCount
COUNTER
  
incrementCount
COUNTER
  
scanNext_num_ops
COUNTER
  
get_num_ops
COUNTER
  
numBytesCompactedCount
COUNTER
bytes合并完成文件总大小

 

合并操作

numFilesCompactedCount
COUNTER
 合并完成文件个数
  • 0
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 1
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值