Mastering Ceph - Second Edition

By : Nick Fisk

Mastering Ceph - Second Edition

By: Nick Fisk

Overview of this book

Ceph is an open source distributed storage system that is scalable to Exabyte deployments. This second edition of Mastering Ceph takes you a step closer to becoming an expert on Ceph. You’ll get started by understanding the design goals and planning steps that should be undertaken to ensure successful deployments. In the next sections, you’ll be guided through setting up and deploying the Ceph cluster with the help of orchestration tools. This will allow you to witness Ceph’s scalability, erasure coding (data protective) mechanism, and automated data backup features on multiple servers. You’ll then discover more about the key areas of Ceph including BlueStore, erasure coding and cache tiering with the help of examples. Next, you’ll also learn some of the ways to export Ceph into non-native environments and understand some of the pitfalls that you may encounter. The book features a section on tuning that will take you through the process of optimizing both Ceph and its supporting infrastructure. You’ll also learn to develop applications, which use Librados and distributed computations with shared object classes. Toward the concluding chapters, you’ll learn to troubleshoot issues and handle various scenarios where Ceph is not likely to recover on its own. By the end of this book, you’ll be able to master storage management with Ceph and generate solutions for managing your infrastructure.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Free Chapter

Section 1: Planning And Deployment

Planning for Ceph

What is Ceph?

How Ceph works

Ceph use cases

Infrastructure design

How to plan a successful Ceph implementation

Summary

Questions

Deploying Ceph with Containers

Technical requirements

Preparing your environment with Vagrant and VirtualBox

Orchestration

Ansible

A very simple playbook

Adding the Ceph Ansible modules

Change and configuration management

Summary

BlueStore

Summary

Ceph and Non-Native Protocols

Block

File

Summary

Section 2: Operating and Tuning

RADOS Pools and Client Access

Pools

Ceph storage types

Summary

Questions

Developing with Librados

What is librados?

How to use librados

Example librados application

Summary

Questions

Distributed Computation with Ceph RADOS Classes

Example applications and the benefits of using RADOS classes

Writing a simple RADOS class in Lua

Writing a RADOS class that simulates distributed computing

RADOS class caveats

Summary

Questions

Monitoring Ceph

Why it is important to monitor Ceph

What should be monitored

The Ceph dashboard

PG states – the good, the bad, and the ugly

Monitoring Ceph with collectd

Summary

Questions

Tuning Ceph

Latency

Benchmarking

Recommended tunings

Summary

Questions

Tiering with Ceph

Tiering versus caching

What is a bloom filter?

Tiering modes

Uses cases

Creating tiers in Ceph

Tuning tiering

Promotion throttling

Summary

Questions

Section 3: Troubleshooting and Recovery

Troubleshooting

Repairing inconsistent objects

Full OSDs

Ceph logging

Slow performance

Extremely slow performance or no IO

Investigating PGs in a down state

Large monitor databases

Summary

Questions

Disaster Recovery

What is a disaster?

Avoiding data loss

What can cause an outage or data loss?

Lost objects and inactive PGs

Recovering from a complete monitor failure

Using the Ceph object-store tool

Investigating asserts

Summary

Questions

Assessments

Chapter 1, Planning for Ceph

Chapter 2, Deploying Ceph with Containers

Chapter 3, BlueStore

Chapter 4, Ceph and Non-Native Protocols

Chapter 5, RADOS Pools and Client Access

Chapter 6, Developing with Librados

Chapter 7, Distributed Computation with Ceph RADOS Classes

Chapter 8, Monitoring Ceph

Chapter 9, Tuning Ceph

Chapter 10, Tiering with Ceph

Chapter 11, Troubleshooting

Chapter 12, Disaster Recovery

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Investigating PGs in a down state

A PG in a down state will not service any client operations, and any object contained within the PG will be unavailable. This will cause slow requests to build up across the cluster as clients try to access these objects. The most common reason for a PG to be in a down state is when a number of OSDs are offline, which means that there are no valid copies of the PGs on any active OSDs. However, to find out why a PG is down, you can run the following command:

ceph pg x.y query

This will produce a large amount of output; the section we are interested in shows the peering status. The example here was taken from a PG whose pool was set to min_size 1 and had data written to it when only OSD 0 was up and running. OSD 0 was then stopped and OSDs 1 and 2 were started:

We can see that the peering process is being blocked, as Ceph knows that the PG has...

Mastering Ceph - Second Edition

By : Nick Fisk

Mastering Ceph - Second Edition

By: Nick Fisk

Overview of this book

Related Content you might be interested in

Current Title:

Mastering Ceph - Second Edition

Ceph Cookbook

Learning Ceph

Mastering Proxmox