GitHub - HROlive/Fundamentals-of-Accelerated-Computing-with-CUDA-Python: Fundamental tools and techniques for running GPU-accelerated Python applications using CUDA® GPUs and the Numba compiler.

Description

This course explores how to use Numba—the just-in-time, type-specializing Python function compiler—to accelerate Python programs to run on massively parallel NVIDIA GPUs.

You’ll learn how to:

Use Numba to compile CUDA kernels from NumPy universal functions (ufuncs);
Use Numba to create and launch custom CUDA kernels;
Apply key GPU memory management techniques.
Upon completion, you’ll be able to use Numba to compile and launch CUDA kernels to accelerate your Python applications on NVIDIA GPUs.

Information

At the conclusion of the workshop, you’ll have an understanding of the fundamental tools and techniques for GPU-accelerated Python applications with CUDA and Numba:

GPU-accelerate NumPy ufuncs with a few lines of code.

Configure code parallelization using the CUDA thread hierarchy.

Write custom CUDA device kernels for maximum performance and flexibility.

Use memory coalescing and on-device shared memory to increase CUDA kernel bandwidth.

More detailed information and links for the course can be found on the course website.

Certificate

The certificate for the course can be found below:

"Fundamentals of Accelerated Computing with CUDA Python" - NVIDIA Deep Learning Institute (Issued On: January 2025)

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
assessment		assessment
debug		debug
img		img
slides		slides
Custom CUDA Kernels in Python with Numba.ipynb		Custom CUDA Kernels in Python with Numba.ipynb
Effective Memory Use.ipynb		Effective Memory Use.ipynb
Introduction to CUDA Python with Numba.ipynb		Introduction to CUDA Python with Numba.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Table of Contents

Description

Information

Certificate

About

Releases

Packages

Languages

HROlive/Fundamentals-of-Accelerated-Computing-with-CUDA-Python

Folders and files

Latest commit

History

Repository files navigation

Table of Contents

Description

Information

Certificate

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages