Book Image

Mastering Ceph

By : Nick Fisk

Book Image

Mastering Ceph

By: Nick Fisk

Overview of this book

Mastering Ceph covers all that you need to know to use Ceph effectively. Starting with design goals and planning steps that should be undertaken to ensure successful deployments, you will be guided through to setting up and deploying the Ceph cluster, with the help of orchestration tools. Key areas of Ceph including Bluestore, Erasure coding and cache tiering will be covered with help of examples. Development of applications which use Librados and Distributed computations with shared object classes are also covered. A section on tuning will take you through the process of optimisizing both Ceph and its supporting infrastructure. Finally, you will learn to troubleshoot issues and handle various scenarios where Ceph is likely not to recover on its own. By the end of the book, you will be able to successfully deploy and operate a resilient high performance Ceph cluster.

Preface

What this book covers

What you need for this book

Who this book is for

Reader feedback

Customer support

Free Chapter

Planning for Ceph

Planning for Ceph

How Ceph works?

Specific use cases

Infrastructure design

How to plan a successful Ceph implementation

Deploying Ceph

Preparing your environment with Vagrant and VirtualBox

A very simple playbook

Adding the Ceph Ansible modules

Change and configuration management

BlueStore

What is BlueStore?

Why was it needed?

How BlueStore works

How to use BlueStore

Erasure Coding for Better Storage Efficiency

Erasure Coding for Better Storage Efficiency

What is erasurecoding?

How does erasure coding work in Ceph?

Algorithms and profiles

Where can I use erasure coding?

Creating an erasure-coded pool

Developing with Librados

Developing with Librados

What is librados?

How to use librados?

Example librados application

Distributed Computation with Ceph RADOS Classes

Distributed Computation with Ceph RADOS Classes

Example applications and the benefits of using RADOS classes

Writing a simple RADOS class in Lua

Writing a RADOS class that simulates distributed computing

RADOS class caveats

Monitoring Ceph

Monitoring Ceph

Why it is important to monitor Ceph

What should be monitored

PG states -the good, the bad, and the ugly

Monitoring Ceph with collectd

Tiering with Ceph

Tiering with Ceph

Tiering versus caching

What is a bloom filter

Creating tiers in Ceph

Promotion throttling

Tuning Ceph

Recommended tunings

Troubleshooting

Troubleshooting

Repairing inconsistent objects

Slow performance

Extremely slow performance or no IO

Investigating PGs in a down state

Large monitor databases

Disaster Recovery

Disaster Recovery

What is a disaster?

Avoiding data loss

What can cause an outage or data loss?

Lost objects and inactive PGs

Recovering from a complete monitor failure

Using the Cephs object store tool

Investigating asserts

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

How Ceph works?

The core storage layer in Ceph is RADOS, which provides, as the name suggests, an object store on which the higher level storage protocols are built and distributed. The RADOS layer in Ceph comprises a number of OSDs. Each OSD is completely independent and forms peer-to-peer relationships to form a cluster. Each OSD is typically mapped to a single physical disk via a basic host bus adapter (HBA) in contrast to the traditional approach of presenting a number of disks via a Redundant Array of Independent Disks (RAID) controller to the OS.

The other key component in a Ceph cluster are the monitors; these are responsible for forming cluster quorum via the use of Paxos. By forming quorum the monitors can be sure that are in a state where they are allowed to make authoritative decisions for the cluster and avoid split brain scenarios. The monitors are not directly involved in the data path and do not have the same performance requirements as OSDs. They are mainly used to provide a known cluster state including membership, configuration, and statistics via the use of various cluster maps. These cluster maps are used by both Ceph cluster components and clients to describe the cluster topology and enable data to be safely stored in the right location.

Due to the scale that Ceph is intended to be operated at, one can appreciate that tracking the state and placement of every single object in the cluster would become computationally very expensive. Ceph solves this problem using CRUSH to place the objects into groups of objects named placement groups (PGs). This reduces the need to track millions of objects to a much more manageable number in the thousands range.

Librados is a Ceph library that can be used to build applications that interact directly with the RADOS cluster to store and retrieve objects.

For more information on how the internals of Ceph work, it would be strongly recommended to read the official Ceph documentation and also the thesis written by Sage Weil, the creator and primary architect of Ceph.