hchen-gogogo-CSDN博客

原创扩容 LVM

没有基础“姿势”，直接快进到实操。

2024-07-11 10:45:39 245 1

在 Kubernetes 环境中，尽管 Pod 提供了与主机系统隔离的运行环境，但它们仍然与宿主机共享某些底层资源和环境，如内核。这种设计使得在任何 Pod 内部执行的 dmesg 命令显示的是宿主机的内核日志，而不是特定于 Pod 的日志。内核共享：所有在同一宿主机上的容器（包括 Pod 内的容器）都运行在同一个操作系统内核之上。dmesg 命令显示的是内核的环环缓冲区消息，这些消息是系统级别的，不是隔离在单独的容器或 Pod 中的。

2024-04-27 16:29:57 285

原创 node-driver-registrar 与 csi-plugin 的关联

node driver registrar 和 csi plugin 的关联，明白 kubelet 调用 csi 的大体流程。

2024-04-02 17:51:31 359

原创 prometheus federate ingesting out-of-order samples error

prometheus federate msg="Error on ingesting out-of-order samples" num_dropped

2024-01-17 15:24:40 967

原创多节点 docker 部署 elastic 集群

在多个节点中通过 docker 部署 elk 集群

2024-01-12 11:32:17 824

原创申请免费的泛域名证书

申请 google 的泛域名证书,域名托管在 cloudflare, 自动更新证书。

2023-12-06 11:33:26 561

原创 OverlayFS

OverlayFS 的这种设计非常适合容器化的环境，它提供了一种轻量级、高效和灵活的方式来管理容器的文件系统。

2023-12-04 15:19:28 359

原创 grafana alert state error

grafana alert 状态为 error 的处理

2023-11-27 17:26:18 307

原创 k8s 利用 root-ceph 使用外部 ceph 集群

kubernetes 使用外部 ceph cluster 作为 volume.

2023-11-14 16:34:55 248

原创 cephadm 创建 ceph cluster

默认情况下， ceph.conf 文件和 client.admin 密钥环的副本保留在具有 _admin 标签的所有主机上的 /etc/ceph 中。我们通常建议为一台或多台其他主机指定 _admin 标签，以便可以在多台主机上轻松访问 Ceph CLI（例如，通过 cephadm shell ）。fsid 可以通过 cephadm ls, docker ps, cat /etc/ceph/ceph.conf 等来查找。3 个 monitor， 3 个 osd，3 个 mds， 3 个 rgw。

2023-11-09 18:03:53 138

原创 grafana ui 自动添加特殊字符

通过 grafana(当前使用的古董版本: v6.5.0) 编辑现有的查询语句，光标自动移动到最后了，直接复制编辑完后的语句到 prometheus 执行报错如下如果不看 markdown 原文，完全看不出差别。

2023-10-30 11:31:17 265 1

原创 Containerd container_fs 缺失

containerd 缺乏 container_fs 指标

2023-10-25 17:56:05 202 2

原创 ceph 创建 static pvc

静态 static pvc

2023-10-24 18:36:16 139

原创 HEALTH_ERR 1 filesystem is degraded, 1 filesystem is offline , 1 mds daemon damaged - Monitors have

HEALTH_ERR 1 filesystem is degraded, 1 filesystem is offline , 1 mds daemon damaged - Monitors have assigned me to become a standbyceph health detailceph mds statceph fs dumpceph fs statusceph fs ls修复不知其然，不知其所以然。。。

2023-10-23 15:50:21 569

原创 rook-ceph rbd image is still being used

Warning FailedMount 5m6s (x208 over 9h) kubelet MountVolume.MountDevice failed for volume "pvc-xxxxxxx" : rpc error: code = Internal desc = rbd image replicapool/csi-vol-xxxxxxxx is still being used

2023-09-04 15:28:30 224

原创 elastic 删除 index 错误action [indices:admin/delete] is unauthorized for user [admin] with roles

security_exception: action [indices:admin/delete] is unauthorized for user [admin] with roles [superuser] on indices [.kibana_task_manager_8.2.3_001], this action is granted by the index privileges [delete_index,manage,all]

2023-09-02 11:12:05 1127

原创 k8s 从 1.19.10 升级至 1.25.12

跨度很大的成功升级 k8s(从v1.19.10 至 v1.25.12)

2023-07-26 18:52:56 322

原创 Grafana - Alert 时间/状态

在 Grafana Alert 中，Evaluate every 和 Evaluate for 是两个重要的配置选项，用于定义告警的评估周期和持续时间。

2023-03-20 11:35:53 3385

原创 LXCFS 测试遇到的一些问题汇总

lxcfs kubernetes docker container volumes

2022-11-29 17:07:56 598

原创使用 Postgres 作为 Grafana 后端，并迁移数据

单纯的使用 PG 用来当 Grafana 的后端存储，这个倒是简单，难的是怎么样把现有数据迁移过去。网络上的方案千千万，N 种方式失败之后终寻得成功。

2022-11-14 15:35:05 718

原创 Node Problem Detector

NPD 是一个可监控节点的健康状况并检测常见节点问题(例如硬件、内核或容器运行时问题等等)的开源项目

2022-10-21 14:35:53 863

原创 Containerd 安装及基础使用

containerd docker kubernetes

2022-10-10 09:42:04 1058

原创 Docker, Kubernetes, CRI, OCI, Containerd, Runc 之间的渊源

Docker, Kubernetes, CRI, OCI, Containerd, Runc 之间的渊源

2022-10-09 10:48:28 544

原创 ETCD替换成员

ETCD 替换节点，需要基础概念还是需要先看看官文。很多的操作官文写的很清楚。只是根据实际情况需要变通一下。

2022-07-02 14:06:46 836

原创 Grafana导入 json 文件的 dashboard 错误 Templating Failed to upgrade legacy queries Datasource xxx not found

Templating Failed to upgrade legacy queries Datasource xxx not found

2022-06-14 19:39:57 8287

原创 ceph osd 占用太多的内存

OSD and MON memory consumptionCeph OSD Pod memory consumption very highCeph Cluster CRDCeph HardWare Recommendations

2022-06-09 13:39:18 1923

原创 rook-ceph 迁移 mon 到其他 node

在 kubernetes 环境种优化的、无损的迁移 ceph monitor 到其他节点

2022-05-30 17:59:25 558

原创 Grafana 迁移配置到新的环境

Grafana 默认数据库 sqlite3 迁移到新的环境

2022-05-27 16:25:00 1701

原创 Kubernetes CSI

kubernetes csi volume storage linux

2022-05-09 17:30:17 797

原创 zalando postgres-operator backup/clone

这家伙忒懒了，撒都没有留下。

2022-03-02 18:13:51 494

原创 prometheus operator servicemonitor 添加 label

serviceMonitorSepc.[]endpoint.relabelconfighttps://github.com/prometheus-operator/prometheus-operator/blob/main/Documentation/api.md#servicemonitorspechttps://github.com/prometheus-operator/prometheus-operator/blob/main/Documentation/api.md#servicemonito

2022-02-17 16:27:03 2431

原创 prometheus operator federation

prometheus 通过 prometheus-operator 配置的，怎么样配置联邦呢？先来个官网看看什么是联邦：Federation | Prometheushttps://prometheus.io/docs/prometheus/latest/federation/operator 添加 prometheus 的配置prometheus-operator/additional-scrape-config.md at main · prometheus-operator/prome

2022-02-17 16:11:45 706

原创 POD QoS

Configure Quality of Service for Pods | KubernetesThis page shows how to configure Pods so that they will be assigned particular Quality of Service (QoS) classes. Kubernetes uses QoS classes to make decisions about scheduling and evicting Pods.Before you b

2022-02-16 18:35:00 408

原创 Kubelet: Pod Lifecycle Event Generator (PLEG)

design-proposals-archive/pod-lifecycle-event-generator.md at main · kubernetes/design-proposals-archive · GitHubPod Lifecycle Event Generator: Understanding the "PLEG is not healthy" issue in Kubernetes | Red Hat Developerhttps://developers.redhat.com/

2022-02-16 17:08:09 447

空空如也

TA创建的收藏夹 TA关注的收藏夹

TA关注的人

sqlite 中文手册

程序员面试宝典（好东西）

c/c++程序员面试宝典

空空如也