HMaster监控

一、HMaster监控指标

MetricType(GAUGE,COUNTER)类型业务意义备注
averageLoad
GAUGE
 Average number of regions served by each region server 
numRegionServers
GAUGE
 Number of live regionservers

 

regionserver计数

numDeadRegionServers
GAUGE
 Number of dead regionservers
clusterRequests
COUNTER
 Total number of requests from all region servers to a cluster 
ritCount
GAUGE
 The number of regions in transition

  

rit状态

ritOldestAge
GAUGE
msThe age of the longest region in transition, in milliseconds
ritCountOverThreshold
GAUGE
 The number of regions that have been in transition longer than a threshold time (default: 60 seconds)
如下为非核心指标
HlogSplitTime_num_ops
COUNTER
 
Time to split Write-ahead log files
 
HlogSplitTime_mean
GAUGE
 Average time to split the total size of a Write-ahead log file 
MetaHlogSplitSize_num_ops
COUNTER
   
MetaHlogSplitTime_mean
GAUGE
   
HlogSplitSize_num_ops
COUNTER
 Average time to split the total size of an Hlog file 
HlogSplitSize_mean
GAUGE
 Size of write-ahead log files the were split 
BulkAssign_num_ops
COUNTER
   
BulkAssign_mean
GAUGE
   
Assign_num_ops
COUNTER
   
Assign_mean
GAUGE
   
BalancerCluster_num_ops
COUNTER
   
BalancerCluster_mean
GAUGE
   

 

二、告警策略

Metric报警策略报警级别备注
averageLoad
all(#3) > 300P1每个RegionServer的平均region数目
numDeadRegionServers
all(#3) >= 1P1存在dead的RegionServer
clusterRequests

all(#10) >= 1000000

all(#10) <= 10000

P1

集群的压力超过100w

集群的压力小于1w(可能存在问题)

ritCount
all(#3) >= 1P1存在rit的region
  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值