HOME ABOUT CONTACT

C/C++ CUDA Learning Notes - Kernels

Rain September 6, 2025

Introduction

CUDA (Compute Unified Device Architecture) is a parallel computing platform and application programming interface (API) model created by NVIDIA. It allows developers to use NVIDIA GPUs for general purpose processing (an approach known as GPGPU, General-Purpose computing on Graphics Processing Units).

This article records some key learning points about designing a simple CUDA kernel, along with the basic workflow and core concepts.

Here’s a quick overview with a flowchart:

CUDA Flowchart

Github Repository:

Kernels

When it comes to kernels, there are a few important points to note:

Thread Hierarchy

Last updated:

Related Posts

  1. Python - Accelerating Brain Mask Generation with CUDA
  2. C/C++ - Retrieving GPU Device Information with CUDA Driver API