Ceph: Designing and Implementing Scalable Storage Systems

Ceph: Designing and Implementing Scalable Storage Systems

By : Michael Hackett, Vikhyat Umrao, Karan Singh, Nick Fisk, Anthony D'Atri, Vaibhav Bhembre

Buy this Book

Ceph: Designing and Implementing Scalable Storage Systems

By: Michael Hackett, Vikhyat Umrao, Karan Singh, Nick Fisk, Anthony D'Atri, Vaibhav Bhembre

Buy this Book

Overview of this book

This Learning Path takes you through the basics of Ceph all the way to gaining in-depth understanding of its advanced features. You’ll gather skills to plan, deploy, and manage your Ceph cluster. After an introduction to the Ceph architecture and its core projects, you’ll be able to set up a Ceph cluster and learn how to monitor its health, improve its performance, and troubleshoot any issues. By following the step-by-step approach of this Learning Path, you’ll learn how Ceph integrates with OpenStack, Glance, Manila, Swift, and Cinder. With knowledge of federated architecture and CephFS, you’ll use Calamari and VSM to monitor the Ceph environment. In the upcoming chapters, you’ll study the key areas of Ceph, including BlueStore, erasure coding, and cache tiering. More specifically, you’ll discover what they can do for your storage system. In the concluding chapters, you will develop applications that use Librados and distributed computations with shared object classes, and see how Ceph and its supporting infrastructure can be optimized. By the end of this Learning Path, you'll have the practical knowledge of operating Ceph in a production environment. This Learning Path includes content from the following Packt products: • Ceph Cookbook by Michael Hackett, Vikhyat Umrao and Karan Singh • Mastering Ceph by Nick Fisk • Learning Ceph, Second Edition by Anthony D'Atri, Vaibhav Bhembre and Karan Singh

Title Page

About Packt

Contributors

Preface

Free Chapter

Ceph - Introduction and Beyond

Introduction

Ceph – the beginning of a new era

RAID – the end of an era

Ceph – the architectural overview

Planning a Ceph deployment

Setting up a virtual infrastructure

Installing and configuring Ceph

Scaling up your Ceph cluster

Using the Ceph cluster with a hands-on approach

Working with Ceph Block Device

Introduction

Configuring Ceph client

Creating Ceph Block Device

Mapping Ceph Block Device

Resizing Ceph RBD

Working with RBD snapshots

Working with RBD clones

Disaster recovery replication using RBD mirroring

Configuring pools for RBD mirroring with one way replication

Configuring image mirroring

Configuring two-way mirroring

Recovering from a disaster!

Working with Ceph and OpenStack

Introduction

Ceph – the best match for OpenStack

Setting up OpenStack

Configuring OpenStack as Ceph clients

Configuring Glance for Ceph backend

Configuring Cinder for Ceph backend

Configuring Nova to boot instances from Ceph RBD

Configuring Nova to attach Ceph RBD

Working with Ceph Object Storage

Introduction

Understanding Ceph object storage

RADOS Gateway standard setup, installation, and configuration

Creating the radosgw user

Accessing the Ceph object storage using S3 API

Accessing the Ceph object storage using the Swift API

Integrating RADOS Gateway with OpenStack Keystone

Integrating RADOS Gateway with Hadoop S3A plugin

Working with Ceph Object Storage Multi-Site v2

Introduction

Functional changes from Hammer federated configuration

RGW multi-site v2 requirement

Installing the Ceph RGW multi-site v2 environment

Configuring Ceph RGW multi-site v2

Testing user, bucket, and object sync between master and secondary sites

Working with the Ceph Filesystem

Introduction

Understanding the Ceph Filesystem and MDS

Deploying Ceph MDS

Accessing Ceph FS through kernel driver

Accessing Ceph FS through FUSE client

Exporting the Ceph Filesystem as NFS

Ceph FS – a drop-in replacement for HDFS

Operating and Managing a Ceph Cluster

Introduction

Understanding Ceph service management

Managing the cluster configuration file

Running Ceph with systemd

Scale-up versus scale-out

Scaling out your Ceph cluster

Scaling down your Ceph cluster

Replacing a failed disk in the Ceph cluster

Upgrading your Ceph cluster

Maintaining a Ceph cluster

Ceph under the Hood

Introduction

Ceph scalability and high availability

Understanding the CRUSH mechanism

CRUSH map internals

CRUSH tunables

Ceph cluster map

High availability monitors

Ceph authentication and authorization

I/O path from a Ceph client to a Ceph cluster

Ceph Placement Group

Placement Group states

Creating Ceph pools on specific OSDs

The Virtual Storage Manager for Ceph

Introductionc

Understanding the VSM architecture

Setting up the VSM environment

Getting ready for VSM

Installing VSM

Creating a Ceph cluster using VSM

Exploring the VSM dashboard

Upgrading the Ceph cluster using VSM

Disk performance baseline

Baseline network performance

Ceph rados bench

RADOS load-gen

Benchmarking the Ceph Block Device

Benchmarking Ceph RBD using FIO

Ceph admin socket

Using the ceph tell command

Ceph REST API

Profiling Ceph memory

The ceph-objectstore-tool

Using ceph-medic

Deploying the experimental Ceph BlueStore

Deploying Ceph

Preparing your environment with Vagrant and VirtualBox

Orchestration

Ansible

A very simple playbook

Adding the Ceph Ansible modules

Change and configuration management

Summary

BlueStore

Summary

Erasure Coding for Better Storage Efficiency

What is erasure coding?

How does erasure coding work in Ceph?

Algorithms and profiles

Where can I use erasure coding?

Creating an erasure-coded pool

Summary

Developing with Librados

What is librados?

How to use librados?

Example librados application

Summary

Distributed Computation with Ceph RADOS Classes

Example applications and the benefits of using RADOS classes

Writing a simple RADOS class in Lua

Writing a RADOS class that simulates distributed computing

RADOS class caveats

Summary

Tiering with Ceph

Tiering versus caching

What is a bloom filter

Tiering modes

Uses cases

Creating tiers in Ceph

Tuning tiering

Promotion throttling

Summary

Troubleshooting

Repairing inconsistent objects

Full OSDs

Ceph logging

Slow performance

Extremely slow performance or no IO

Investigating PGs in a down state

Large monitor databases

Summary

Disaster Recovery

What is a disaster?

Avoiding data loss

What can cause an outage or data loss?

RBD mirroring

RBD recovery

Lost objects and inactive PGs

Recovering from a complete monitor failure

Using the Cephs object store tool

Investigating asserts

Summary

Operations and Maintenance

Topology

Configuration

Scrubs

Logs

Common tasks

Working with remote hands

Summary

Monitoring Ceph

Monitoring Ceph clusters

Monitoring Ceph MONs

Monitoring Ceph OSDs

Monitoring Ceph placement groups

Monitoring Ceph MDS

Open source dashboards and tools

Summary

Performance and Stability Tuning

Ceph performance overview

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

What can cause an outage or data loss?

The majority of outages and cases of data loss will be directly caused by the loss of a number of OSDs that exceed the replication level in a short period of time. If these OSDs do not come back online, be it due to a software or hardware failure and Ceph was not able to recover objects in-between OSD failures, then these objects are now lost.

If an OSD has failed due to a failed disk, then it is unlikely that recovery will be possible unless costly disk recovery services are utilized, and there is no guarantee that any recovered data will be in a consistent state. This chapter will not cover recovering from physical disk failures and will simply suggest that the default replication level of 3 should be used to protect you against multiple disk failures.

If an OSD has failed due to a software bug, the outcome is possibly a lot more positive, but the process is complex and time-consuming. Usually, an OSD, which, although the physical disk is in a good...

Ceph: Designing and Implementing Scalable Storage Systems

By : Michael Hackett, Vikhyat Umrao, Karan Singh, Nick Fisk, Anthony D'Atri, Vaibhav Bhembre

Ceph: Designing and Implementing Scalable Storage Systems

By: Michael Hackett, Vikhyat Umrao, Karan Singh, Nick Fisk, Anthony D'Atri, Vaibhav Bhembre

Overview of this book

Related Content you might be interested in

Current Title:

Ceph: Designing and Implementing Scalable Storage Systems

What can cause an outage or data loss?