This chapter requires us to use a CUDA version later than 9.x, and the GPU architecture should be Volta or Turing. If you use a GPU with Pascal architecture, skip the Grid-level cooperative groups section because this feature is introduced for Volta architecture.
Learn CUDA Programming
By :
Learn CUDA Programming
By:
Overview of this book
<p>Compute Unified Device Architecture (CUDA) is NVIDIA's GPU computing platform and application programming interface. It's designed to work with programming languages such as C, C++, and Python. With CUDA, you can leverage a GPU's parallel computing power for a range of high-performance computing applications in the fields of science, healthcare, and deep learning. </p><p> </p><p>Learn CUDA Programming will help you learn GPU parallel programming and understand its modern applications. In this book, you'll discover CUDA programming approaches for modern GPU architectures. You'll not only be guided through GPU features, tools, and APIs, you'll also learn how to analyze performance with sample parallel programming algorithms. This book will help you optimize the performance of your apps by giving insights into CUDA programming platforms with various libraries, compiler directives (OpenACC), and other languages. As you progress, you'll learn how additional computing power can be generated using multiple GPUs in a box or in multiple boxes. Finally, you'll explore how CUDA accelerates deep learning algorithms, including convolutional neural networks (CNNs) and recurrent neural networks (RNNs). </p><p> </p><p>By the end of this CUDA book, you'll be equipped with the skills you need to integrate the power of GPU computing in your applications.</p>
Table of Contents (18 chapters)
Title Page
Copyright and Credits
Dedication
About Packt
Contributors
Preface
Free Chapter
Introduction to CUDA Programming
CUDA Memory Management
CUDA Thread Programming
Kernel Execution Model and Optimization Strategies
CUDA Application Profiling and Debugging
Scalable Multi-GPU Programming
Parallel Programming Patterns in CUDA
Programming with Libraries and Other Languages
GPU Programming Using OpenACC
Deep Learning Acceleration with CUDA
Appendix
Another Book You May Enjoy
Customer Reviews