A Closer Look into the World of GPUs | GPU Programming with C++ and CUDA

Book Overview & Buying
Table Of Contents

GPU Programming with C++ and CUDA

By : Paulo Motta

Buy this Book

GPU Programming with C++ and CUDA

By: Paulo Motta

Buy this Book

Overview of this book

Written by Paulo Motta, a senior researcher with decades of experience, this comprehensive GPU programming book is an essential guide for leveraging the power of parallelism to accelerate your computations. The first section introduces the concept of parallelism and provides practical advice on how to think about and utilize it effectively. Starting with a basic GPU program, you then gain hands-on experience in managing the device. This foundational knowledge is then expanded by parallelizing the program to illustrate how GPUs enhance performance. The second section explores GPU architecture and implementation strategies for parallel algorithms, and offers practical insights into optimizing resource usage for efficient execution. In the final section, you will explore advanced topics such as utilizing CUDA streams. You will also learn how to package and distribute GPU-accelerated libraries for the Python ecosystem, extending the reach and impact of your work. Combining expert insight with real-world problem solving, this book is a valuable resource for developers and researchers aiming to harness the full potential of GPU computing. The blend of theoretical foundations, practical programming techniques, and advanced optimization strategies it offers is sure to help you succeed in the fast-evolving field of GPU programming.

Preface

Who this book is for

What this book covers

To get the most out of this book

Get in touch

Free Chapter

Understanding Where We Are Heading

Introduction to Parallel Programming

Getting the most out of this book – get to know your free benefits

Technical requirements

What is parallelism in software?

Why is parallelism important?

A quick start guide to the different types of parallelism

An overview of GPU architecture

Comparing CPUs and GPUs

Advantages and challenges of GPU programming

Summary

Setting Up Your Development Environment

Technical requirements

Configuring your development environment

Docker at a glance

CUDA Toolkit installation

Summary

Hello CUDA

Technical requirements

The kernel and some terminology

A first running program

Consulting devices

A better working environment

Summary

Hello Again, but in Parallel

Technical requirements

The SIMD execution model

Not-so-parallel prime number verification

Memory: bring me some data!

Vector addition

Summary

Exercises

Bring It On!

A Closer Look into the World of GPUs

Technical requirements

Understanding the thread, block, and grid concepts

Asynchronous data transfers

Parallelizing with streams

Following the events

Accelerating with shared memory

Understanding hardware capabilities

Summary

Parallel Algorithms with CUDA

Technical requirements

Designing parallel algorithms

Computing matrix addition and multiplication

Calculating numerical integrals

Reducing from many

Sorting data

Processing sensor data with a convolution

Summary

Performance Strategies

Technical requirements

Introducing optimization

Profiling with NVIDIA Nsight Compute

Optimizing to speed up our code

Considering other algorithms

Summary

Reference

Moving Forward

Overlaying Multiple Operations

Technical requirements

Debugging CUDA code with VS Code

Using CUDA streams to overlay operations

Running multiple GPUs together

Summary

Exercises

Exposing Your Code to Python

Technical requirements

Integrating with Python

Creating the C++ Library

Using Ctypes

Wrapping with your own code

Passing numpy arrays to your library

Analyzing performance

Summary

Exploring Existing GPU Models

Technical requirements

Using existing libraries and frameworks

Writing your own code

Moving sequential code to the GPU

Testing your code with GTest and Pytest

Summary

Unlock Your Book’s Exclusive Benefits

How to unlock these benefits in three easy steps

Other Books You May Enjoy

Index

GPU Programming with C++ and CUDA

By : Paulo Motta

GPU Programming with C++ and CUDA

By: Paulo Motta

Overview of this book

Accelerating with shared memory

Confirmation

Buy this book with your credits?

Submit Your Feedback

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access