Prometheus 监控系统的初步了解与系统搭建

cd /opt/prometheus
tar xf prometheus-2.35.0.linux-amd64.tar.gz
mv prometheus-2.35.0.linux-amd64 /usr/local/prometheus
 
cat /usr/local/prometheus/prometheus.yml | grep -v "^#"
global:					#用于prometheus的全局配置，比如采集间隔，抓取超时时间等
  scrape_interval: 15s			#采集目标主机监控数据的时间间隔，默认为1m
  evaluation_interval: 15s 		#触发告警生成alter的时间间隔，默认是1m
  # scrape_timeout is set to the global default (10s).
  scrape_timeout: 10s			#数据采集超时时间，默认10s
 
altering:				#用于altermanager实例的配置，支持静态配置和动态服务发现的机制
  alertmanagers:
    - static_configs:
        - targets:
          # - alertmanager:9093
 
rule_files:				#用于加载告警规则相关的文件路径的配置，可以使用文件名通配机制
  # - "first_rules.yml"
  # - "second_rules.yml"
 
scrape_configs:			#用于采集时序数据源的配置
  # The job name is added as a label `job=<job_name>` to any timeseries scraped from this config.
  - job_name: "prometheus"		#每个被监控实例的集合用job_name命名，支持静态配置（static_configs）和动态服务发现的机制（*_sd_configs）
 
    # metrics_path defaults to '/metrics'
    metrics_path: '/metrics'    #指标数据采集路径，默认为 /metrics
    # scheme defaults to 'http'.
 
    static_configs:				#静态目标配置，固定从某个target拉取数据
      - targets: ["localhost:9090"]

(2) 将Prometheus加入到系统服务

cat > /usr/lib/systemd/system/prometheus.service <<'EOF'
[Unit]
Description=Prometheus Server
Documentation=https://prometheus.io
After=network.target
 
[Service]
Type=simple
ExecStart=/usr/local/prometheus/prometheus \
--config.file=/usr/local/prometheus/prometheus.yml \
--storage.tsdb.path=/usr/local/prometheus/data/ \
--storage.tsdb.retention=15d \
--web.enable-lifecycle
  
ExecReload=/bin/kill -HUP $MAINPID
Restart=on-failure
 
[Install]
WantedBy=multi-user.target
EOF
 
 
systemctl start prometheus
systemctl enable prometheus
 
netstat -natp | grep :9090

（3）进行界面访问

浏览器搜索 ip：9090

部署 Exporters ，添加监控主机

部署 Node Exporter 监控系统级指标（对每一个node节点）

（1）上传 node_exporter-1.3.1.linux-amd64.tar.gz 进行解压

（2）将 node_exporter添加到系统服务中

cat > /usr/lib/systemd/system/node_exporter.service <<'EOF'
[Unit]
Description=node_exporter
Documentation=https://prometheus.io/
After=network.target
 
[Service]
Type=simple
ExecStart=/usr/local/bin/node_exporter \
--collector.ntp \
--collector.mountstats \
--collector.systemd \
--collector.tcpstat
 
ExecReload=/bin/kill -HUP $MAINPID
Restart=on-failure
 
[Install]
WantedBy=multi-user.target
EOF
 
（3）启动 
systemctl start node_exporter
systemctl enable node_exporter
 
netstat -natp | grep :9100

（3）修改 prometheus 配置文件，加入到 prometheus 监控中

vim /usr/local/prometheus/prometheus.yml
#在尾部增加如下内容
  - job_name: nodes
    metrics_path: "/metrics"
    static_configs:
    - targets:
	  - 20.0.0.61:9100
	  - 20.0.0.62:9100
	  - 20.0.0.63:9100
      labels:
        service: kubernetes
		
（5）重新载入配置
curl -X POST http://20.0.0.61:9090/-/reload    或    systemctl reload prometheus
浏览器查看 Prometheus 页面的 Status -> Targets

安装grafana---可视化工具

下载地址：https://grafana.com/grafana/download
          https://mirrors.bfsu.edu.cn/grafana/yum/rpm/
 
yum install -y grafana-7.4.0-1.x86_64.rpm
 
systemctl start grafana-server
systemctl enable grafana-server
 
netstat -natp | grep :3000
 
浏览器访问：http://20.0.0.61:3000 ，默认账号和密码为 admin/admin

添加服务

yucfkyu

关注

28
点赞
踩
14

收藏

觉得还不错? 一键收藏
0
评论
Prometheus 监控系统的初步了解与系统搭建

prometheus是一个开源的系统监控以及报警系统。整合zabbix的功能，系统，网络，设备。promethues可以兼容网络，设备。容器监控。告警系统。因为他和k8s是一个项目基金开发的产品，天生匹配k8s的原生系统。容器化和云原生服务适配性很高。Prometheus是一个服务监控系统和时序数据库，提供了通用的数据模型和快捷数据采集，存储和接口查询。核心组件： Prometheus server定期从静态配置的监控目标或者基于服务发现的自动配置目标中进行拉取数据。
复制链接

扫一扫

专栏目录