Ceph is a very resilient, highly available storage system. Once a Ceph cluster is configured, for the most part, it can run maintenance free. In most cases, lack of knowledge on how Ceph works leads to major issues, causing cluster-side interference. In this section, we will highlight some of the most common issues and how to combat them in a Ceph cluster.
The following are a few best practices to keep a Ceph cluster running healthy:
- If possible, keep all settings to default for a healthy cluster.
- Use Ceph pool only to implement a different OSD type policy and not for multitenancy, such as one pool for SSDs and another for HDDs.
- Do not make frequent Ceph configuration changes. It adds extra workload on the cluster OSDs, reducing the life of HDDs. After each change, let the cluster rebalance data before making new changes.
- Always keep in mind the core count of Ceph nodes when adjusting Ceph threads. Do not let the number of...