Network Security Internet Technology Development Database Servers Mobile Phone Android Software Apple Software Computer Software News IT Information

In addition to Weibo, there is also WeChat

Please pay attention

WeChat public account

Shulou

Will still monitor the health status of ceph clusters

2025-04-02 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Servers >

Share

Shulou(Shulou.com)06/01 Report--

Will still monitor the health status of the ceph cluster, many novices are not very clear about this, in order to help you solve this problem, the following editor will explain for you in detail, people with this need can come to learn, I hope you can gain something.

1. Introduction 1.1 introduction

The various service processes running in the cluster that we need to monitor; all the pgs normal status of the cluster is active + clean, and the rest are abnormal.

1.2 Cluster basic monitoring

Cluster basic monitoring mainly includes checking the health status of the cluster, capacity usage, and the running status of monitor and osd daemons (up, down).

2. Cluster health detection

Haha, usually quite lazy, every time to enter so much ceph, a little annoying, the original ceph has an interactive mode (no command memory function, , miscalculation)

2.1 check cluster health in interactive mode # ceph## to view the current status of the cluster, HEALTH_OK, HEALTH_WARN, HEALTH_ERRceph > health## effect is the same as ceph-s ceph > status## cluster mon related information ceph > quorum_statusceph > mon_status2.2 command line input

Haha, it's better to use the command line operation, which can be found by pressing the up and down keystrokes. When the same command is executed continuously, you don't need to keep typing.

# # the effect of the two commands is the same # ceph status# ceph-slots # health: HEALTH_OK, HEALTH_WARN, HEALTH_ERR# ceph health [detail]

Note:

Cluster health status "HEALTH_OK" means that the cluster is healthy and normal. If "HEALTH_WARN XXX num placement group stale" occurs, you can wait for a few minutes and it will automatically return to normal.

2.3 Cluster dynamic monitoring

In some cases, dynamic and continuous attention to cluster event information is required.

# ceph-w3, Cluster capacity Test 3.1 Cluster capacity View

The cluster runs in a healthy state, everything has a limit, and the storage cluster is the same, so it is impossible to write all the time. You need to pay attention to the capacity state at a later stage. After all, the larger the amount of data, the lower the performance of the whole cluster (after all, if the capacity status is not detected properly, the problem caused by data blocking is not so easy to solve.) Delete the data that should be deleted. There is really no capacity, so expand it. In theory, it is unlimited expansion. There is also the problem of data balance.

# # in ceph, all data is written to the datapool (abstract concept) # ceph df3.2 cluster capacity parameters

In general, if the osd is used more than 85%, the data will not be written to the osd; if the overall capacity of the cluster exceeds 95%, the cluster cannot be written; you can adjust the configuration to control the capacity of the cluster, and it is not recommended to adjust too much; if the osd exceeds the default alarm value, consider whether the data can be balanced; if the cluster exceeds the alarm value, expand the capacity.

# # add capacity configuration parameters to the configuration file, and remember to restart relevant services to make the configuration take effect # # , you can also modify the configuration parameters online Then write a separate file to introduce # vim / etc/ceph/ceph.conf...## cluster overall capacity usage limit mon_osd_full_ratio = "0.950000" # # single osd capacity usage limit mon_osd_nearfull_ratio = "0.850000".. 4, mon detection

In general, multiple mon; are deployed in online environments, so when reading and writing data to the cluster, you need to check the mon status.

# # dump is more detailed than stat, quorum_status is more detailed than dump # ceph mon stat# ceph mon dump# ceph quorum_status-f json-pretty5, osd detects 5.1osd status

In:osd joins the cluster

Out:osd did not join the cluster

Down:osd joined the cluster, but the service stopped

Up:osd joins the cluster and the service is running

5.2 osd status Detection # # check all osd status # is it helpful for ceph osd stat# ceph osd dump# ceph osd tree to read the above content? If you want to know more about the relevant knowledge or read more related articles, please follow the industry information channel, thank you for your support.

Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.

Views: 0

*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.

Share To

Servers

Wechat

© 2024 shulou.com SLNews company. All rights reserved.

12
Report