VMware Performance and Capacity Management, Second Edition

VMware Performance and Capacity Management, Second Edition - Second Edition

By : Sunny Dua

Buy this Book

VMware Performance and Capacity Management, Second Edition - Second Edition

By: Sunny Dua

Buy this Book

Overview of this book

Performance management and capacity management are the two top-most issues faced by enterprise IT when doing virtualization. Until the first edition of the book, there was no in-depth coverage on the topic to tackle the issues systematically. The second edition expands the first edition, with added information and reorganizing the book into three logical parts. The first part provides the technical foundation of SDDC Management. It explains the difference between a software-defined data center and a classic physical data center, and how it impacts both architecture and operations. From this strategic view, it zooms into the most common challenges—performance management and capacity management. It introduces a new concept called Performance SLA and also a new way of doing capacity management. The next part provides the actual solution that you can implement in your environment. It puts the theories together and provides real-life examples created together with customers. It provides the reasons behind each dashboard, so that you get the understanding on why it is required and what problem it solves. The last part acts as a reference section. It provides a complete reference to vSphere and vRealize Operations counters, explaining their dependencies and providing practical guidance on the values you should expect in a healthy environment.

VMware Performance and Capacity Management Second Edition

Credits

Foreword

About the Author

Acknowledgments

About the Reviewers

www.PacktPub.com

Preface

Part 1

Part 2

Part 3

Free Chapter

VM – It Is Not What You Think!

Our journey into the virtual world

Not all virtualizations are equal

Virtual Machine – it is not what you think!

Physical server versus Virtual Machine

Summary

Software-Defined Data Centers

The software-defined data center

SDDC versus HDDC

Summary

SDDC Management

What you manage has changed

Management changes in SDDC

The restaurant analogy

Contention versus utilization

Performance and capacity management

Primary counters for monitoring

Who uses which dashboards

How many dashboards do I need?

Summary

Performance Monitoring

A day in the life of a VMware Admin

What exactly is performance?

Performance versus capacity

Performance SLA

VDI SLA

Summary

Capacity Monitoring

Some well-meaning but harmful advice

A shift in capacity management

SDDC capacity planning

When is a peak not a true peak?

Putting it all together

VDI capacity planning

VM rightsizing

Summary

Performance-Monitoring Dashboards

What is the overall IaaS performance?

Are you serving my VM well?

Is vMotion causing performance hit?

Is any VM abusing the shared IaaS?

Summary

Capacity-Monitoring Dashboards

Tier 1 compute

Tier 2 and 3 compute

Storage

Network

Putting it all together

Enhancements to the dashboard

Rightsizing VMs

Summary

Specific-Purpose Dashboards

Dashboards for the big screen

Monitoring ESXi host temperature

Dashboards for the storage team

Dashboards for the network team

Dashboards for the VDI team

Summary

Infrastructure Monitoring Using Blue Medora

Dell PowerEdge servers

Summary

Application Monitoring Using Blue Medora

Overview

Microsoft SQL Server

Oracle Enterprise Manager

Citrix XenDesktop and XenApp

IBM Tivoli

IBM DB2

SAP HANA

Summary

SDDC Key Counters

Compute

Storage

Network

Metric groups

Counters in vSphere and vRealize

Summary

CPU Counters

CPU counters at the VM level

CPU counters at the ESXi level

CPU counters at the cluster level

CPU counters at higher levels

Summary

Memory Counters

Memory – not such a simple matter

Memory counters at the Guest OS level

Memory counters at the VM level

Memory counters at the ESXi level

Memory counters at cluster level

Memory counters at higher levels

Summary

Storage Counters

Multilayer storage

Storage counters at the VM level

Storage counters at the ESXi level

Storage counters at the cluster level

Storage counters at the datastore level

Storage counters at the datastore cluster level

Storage counters at higher levels

Capacity monitoring

VMware VSAN

Summary

Network Counters

Network counters at the Guest OS level

Network counters at the VM level

Network counters at the ESXi level

Network counters at the cluster level

Network counters at the Distributed Switch level

Network counters at the distributed port group level

Network counters at higher levels

Network counters in NSX

Network counters for physical switches

Summary

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Physical server versus Virtual Machine

Hopefully, I've driven home the point that a VM is different from a physical server. I'll now list the differences from a management point of view. The following table shows the differences that impact how you manage your infrastructure. Let's begin with the core properties:

Properties	Physical server	Virtual Machine
BIOS	Every brand and model has a unique BIOS. Even the same model (for example, HP DL 380 Generation 9) can have multiple BIOS versions. The BIOS needs updates and management, often with physical access to a data center. This requires downtime.	This is standardized in a VM. There is only one type, which is the VMware motherboard. This is independent from the ESXi motherboard. The VM BIOS needs far fewer updates and management. The inventory management system no longer needs the BIOS management module.
Virtual HW	Not applicable.	This is a new layer below the BIOS. It needs an update after every vSphere release. A data center management system needs to be aware of this as it requires a deep knowledge of vSphere. For example, to upgrade the virtual hardware, the VM has to be in the powered-off state.
Drivers	Many drivers are loaded and bundled with the OS. Often, you need to get the latest drivers from their respective hardware vendors. All these drivers need to be managed. This can be a complex operation, as they vary from model to model and brand to brand. The management tool has rich functionalities, such as being able to check compatibility, roll out drivers, roll them back if there is an issue, and so on.	Relatively fewer drivers are loaded with the Guest OS; some drivers are replaced by the ones provided by VMware Tools. Even with NPIV, the VM does not need the FC HBA driver. VMware Tools needs to be managed, with vCenter being the most common management tool.

Properties

Physical server

Virtual Machine

BIOS

Every brand and model has a unique BIOS. Even the same model (for example, HP DL 380 Generation 9) can have multiple BIOS versions.

The BIOS needs updates and management, often with physical access to a data center. This requires downtime.

This is standardized in a VM. There is only one type, which is the VMware motherboard. This is independent from the ESXi motherboard.

The VM BIOS needs far fewer updates and management. The inventory management system no longer needs the BIOS management module.

Virtual HW

Not applicable.

This is a new layer below the BIOS.

It needs an update after every vSphere release. A data center management system needs to be aware of this as it requires a deep knowledge of vSphere. For example, to upgrade the virtual hardware, the VM has to be in the powered-off state.

Drivers

Many drivers are loaded and bundled with the OS. Often, you need to get the latest drivers from their respective hardware vendors.

All these drivers need to be managed. This can be a complex operation, as they vary from model to model and brand to brand. The management tool has rich functionalities, such as being able to check compatibility, roll out drivers, roll them back if there is an issue, and so on.

Relatively fewer drivers are loaded with the Guest OS; some drivers are replaced by the ones provided by VMware Tools.

Even with NPIV, the VM does not need the FC HBA driver. VMware Tools needs to be managed, with vCenter being the most common management tool.

How do all these differences impact the hardware upgrade process? Let's take a look:

Physical server	Virtual Machine
Downtime is required. It is done offline and is complex. OS reinstallation and updates are required, hence it is a complex project in physical systems. Sometimes, a hardware upgrade is not even possible without upgrading the application.	It is done online and is simple. Virtualization decouples the application from hardware dependencies. A VM can be upgraded from 5-year-old hardware to a new one, moving from the local SCSI disk to 10 Gigabit Fiber Channel over Ethernet (FCoE), from a dual-core to an 18-core CPU. So yes, MS-DOS can run on 10 Gigabit Ethernet, accessing SSD storage via the PCIe lane. You just need to migrate to the new hardware with vMotion. As a result, the operation is drastically simplified.

Physical server

Virtual Machine

Downtime is required. It is done offline and is complex.

OS reinstallation and updates are required, hence it is a complex project in physical systems. Sometimes, a hardware upgrade is not even possible without upgrading the application.

It is done online and is simple. Virtualization decouples the application from hardware dependencies.

A VM can be upgraded from 5-year-old hardware to a new one, moving from the local SCSI disk to 10 Gigabit Fiber Channel over Ethernet (FCoE), from a dual-core to an 18-core CPU. So yes, MS-DOS can run on 10 Gigabit Ethernet, accessing SSD storage via the PCIe lane. You just need to migrate to the new hardware with vMotion. As a result, the operation is drastically simplified.

In the preceding table, we compared the core properties of a physical server with a VM. Every server needs storage, so let's compare their storage properties:

Physical server	Virtual Machine
Servers connected to a SAN can see the SAN and FC fabric. They need HBA drivers and have FC PCI cards, and they have multipathing software installed. They normally need an advanced file system or volume manager to Redundant Array of Inexpensive Disks (RAID) local disk.	No VM is connected to the FC fabric or SAN. The VM only sees the local disk. Even with N_Port ID Virtualization (NPIV) and physical Raw Device Mapping (RDM), the VM does not send FC frames. Multipathing is provided by vSphere, transparent to the VM. There is no need for a RAID local disk. It is one virtual disk, not two. Availability is provided at the hardware layer.
A backup agent and backup LAN are required in a majority of cases.	These are not needed in a majority of cases, as backup is done via the vSphere VADP API, which is a VMware vStorage API that backs up and restores vSphere VMs. An agent is only required for application-level backup.

Physical server

Virtual Machine

Servers connected to a SAN can see the SAN and FC fabric. They need HBA drivers and have FC PCI cards, and they have multipathing software installed.

They normally need an advanced file system or volume manager to Redundant Array of Inexpensive Disks (RAID) local disk.

No VM is connected to the FC fabric or SAN. The VM only sees the local disk. Even with N_Port ID Virtualization (NPIV) and physical Raw Device Mapping (RDM), the VM does not send FC frames. Multipathing is provided by vSphere, transparent to the VM.

There is no need for a RAID local disk. It is one virtual disk, not two. Availability is provided at the hardware layer.

A backup agent and backup LAN are required in a majority of cases.

These are not needed in a majority of cases, as backup is done via the vSphere VADP API, which is a VMware vStorage API that backs up and restores vSphere VMs. An agent is only required for application-level backup.

There's a big difference in storage. How about network and security? Let's see:

Physical server	Virtual Machine
NIC teaming is common. This typically requires two cables per server.	NIC teaming is provided by ESXi. The VM is not aware and only sees one vNIC.
The Guest OS is VLAN-aware. It is configured inside the OS. Moving the VLAN requires reconfiguration.	The VLAN is generally provided by vSphere and not done inside the Guest OS. This means the VM can be moved from one VLAN to another with no downtime. With network virtualization, the VM moves from a VLAN to VXLAN.
The AV agent is installed on the Guest and can be seen by an attacker.	An AV agent runs on the ESXi host as a VM (one per ESXi). It cannot be seen by the attacker from inside the Guest OS.
The AV consumes OS resources. AV signature updates cause high storage usage.	The AV consumes minimal Guest OS resources as it is offloaded to the ESXi Agent VM. AV signature updates do not require high Input/Output Operations Per Second (IOPS) inside the Guest OS. The total IOPS is also lower at the ESXi host level as it is not done per VM.

Finally, let's take a look at the impact on management. As can be seen here, even the way we manage a server changes once it is converted into a VM:

Property	Physical server	Virtual Machine
Monitoring approach	An agent is commonly deployed. It is typical for a server to have multiple agents. In-Guest counters are accurate as the OS can see the physical hardware. A physical server has an average of 5 percent CPU utilization due to a multicore chip. As a result, there is no need to monitor it closely.	An agent is typically not deployed. Certain areas, such as application and Guest OS monitoring, are still best served by an agent. The key in-Guest counters are not accurate as the Guest OS does not see the physical hardware. A VM has an average of 50 percent CPU utilization as it is rightsized. This is 10 times higher compared to a physical server. As a result, there is a need to monitor it closely, especially when physical resources are oversubscribed. Capacity management becomes a discipline in itself.
Availability approach	HA is provided by clusterware, such as Microsoft Windows Server Failover Clusters (WSFC) and Veritas Cluster Server (VCS). Clusterware tends to be complex and expensive. Cloning a physical server is a complex task and requires the boot drive to be on the SAN or LAN, which is not typical. A snapshot is rarely made, due to cost and complexity. Only very large IT departments are found to perform physical server snapshots.	HA is a built-in core component of vSphere. From what I see, most clustered physical servers end up as just a single VM since vSphere HA is good enough. Cloning can be done easily. It can even be done live. The drawback is that the clone becomes a new area of management. Snapshots can be made easily. In fact, this is done every time as part of the backup process. Snapshots also become a new area of management.
Company asset	The physical server is a company asset and it has book value in the accounting system. It needs proper asset management as components vary among servers. Here, an annual stock-take process is required.	A VM is not an asset as it has no accounting value. It is like a document. It is technically a folder with files in it. A stock-take process is no longer required as the VM cannot exist outside vSphere.

Property

Physical server

Virtual Machine

Monitoring approach

An agent is commonly deployed. It is typical for a server to have multiple agents.

In-Guest counters are accurate as the OS can see the physical hardware.

A physical server has an average of 5 percent CPU utilization due to a multicore chip. As a result, there is no need to monitor it closely.

An agent is typically not deployed. Certain areas, such as application and Guest OS monitoring, are still best served by an agent.

The key in-Guest counters are not accurate as the Guest OS does not see the physical hardware.

A VM has an average of 50 percent CPU utilization as it is rightsized. This is 10 times higher compared to a physical server. As a result, there is a need to monitor it closely, especially when physical resources are oversubscribed. Capacity management becomes a discipline in itself.

Availability approach

HA is provided by clusterware, such as Microsoft Windows Server Failover Clusters (WSFC) and Veritas Cluster Server (VCS). Clusterware tends to be complex and expensive.

Cloning a physical server is a complex task and requires the boot drive to be on the SAN or LAN, which is not typical.

A snapshot is rarely made, due to cost and complexity. Only very large IT departments are found to perform physical server snapshots.

HA is a built-in core component of vSphere. From what I see, most clustered physical servers end up as just a single VM since vSphere HA is good enough.

Cloning can be done easily. It can even be done live. The drawback is that the clone becomes a new area of management.

Snapshots can be made easily. In fact, this is done every time as part of the backup process. Snapshots also become a new area of management.

Company asset

The physical server is a company asset and it has book value in the accounting system. It needs proper asset management as components vary among servers.

Here, an annual stock-take process is required.

A VM is not an asset as it has no accounting value. It is like a document. It is technically a folder with files in it.

A stock-take process is no longer required as the VM cannot exist outside vSphere.

VMware Performance and Capacity Management, Second Edition - Second Edition

By : Sunny Dua

VMware Performance and Capacity Management, Second Edition - Second Edition

By: Sunny Dua

Overview of this book

Related Content you might be interested in

Current Title:

VMware Performance and Capacity Management, Second Edition - Second Edition

Physical server versus Virtual Machine