This chapter recommends using an NVIDIA GPU card later than Pascal architecture. In other words, your GPU's compute capability should be equal to or greater than 60. If you are unsure of your GPU's architecture, please visit NVIDIA's GPU site at https://developer.nvidia.com/cuda-gpus, and confirm your GPU's compute capability.
Sample code was developed and tested with 10.1 when we wrote this book. In general, it is recommended to use the latest CUDA version if applicable.
In this chapter, we'll perform CUDA programming by profiling the code. If your GPU architecture is Turing, it is recommended to install Nsight Compute to profile the code. It is free, and you can download it from https://developer.nvidia.com/nsight-compute. When we wrote this book, it was a transition moment of the profiler. You can learn about its basic usage in...