└── README.md /README.md: -------------------------------------------------------------------------------- 1 | ## High Quality Resources on GPU Programming/Architecture 2 | 3 | ### Articles/Blogs 4 | 5 | - [GPU Programming](https://enccs.github.io/gpu-programming/) 6 | - [The CUDA Parallel Programming Model](https://fabiensanglard.net/cuda/) 7 | - [A HISTORY OF NVIDIA STREAM MULTIPROCESSOR](https://fabiensanglard.net/cuda/index.html) 8 | - [Parallel Thread Execution](https://docs.nvidia.com/cuda/parallel-thread-execution/index.html) 9 | - [How to Optimize a CUDA Matmul Kernel for cuBLAS-like Performance: a Worklog](https://siboehm.com/articles/22/CUDA-MMM) 10 | - [Making Deep Learning Go Brrrr From First Principles](https://horace.io/brrr_intro.html) 11 | - [CUDA Matrix Multiplication Optimization](https://leimao.github.io/article/CUDA-Matrix-Multiplication-Optimization/) 12 | - [What Every Developer Should Know About GPU Computing](https://codeconfessions.substack.com/p/gpu-computing) 13 | - [A minimal GPU design in Verilog to learn how GPUs work from the ground up](https://github.com/adam-maj/tiny-gpu) 14 | - [GPU Programming: When, Why and How?](https://enccs.github.io/gpu-programming/) 15 | - [Understanding GPU internals](https://cmeraki.github.io/gpu-part1.html) 16 | - [Understanding the GPU programming model](https://cmeraki.github.io/gpu-part2.html) 17 | 18 | ### Tutorials 19 | - [Intro to Parallel Programming](https://developer.nvidia.com/udacity-cs344-intro-parallel-programming) 20 | 21 | ### Notebooks 22 | - [GPU Puzzles](https://github.com/srush/GPU-Puzzles) 23 | 24 | ### Videos 25 | - [How GPU Computing Works](https://www.youtube.com/watch?v=3l10o0DYJXg) 26 | - [Getting Started With CUDA for Python Programmers](https://youtu.be/nOxKexn3iBo?si=nung2_X-TXsnK4YK) 27 | - [Programming Massively Parallel Processors - Lecture Series by the Book Author](https://www.youtube.com/playlist?list=PLRRuQYjFhpmubuwx-w8X964ofVkW1T8O4) 28 | - [Programming Massively Parallel Processors: A Hands-on Approach and then this YT series](https://m.youtube.com/playlist?list=PL6RdenZrxrw-zNX7uuGppWETdxt_JxdMj&si=ZqKCQgFef-v3JBv8) 29 | - [Programming Parallel Computers](https://youtube.com/playlist?list=PL2RY7P3JxZN-Pz1nwvnoJ9uEHmOmv4jmi&si=-7hc_4fQfFrMc8VZ) 30 | - [GPU Programming Lectures](https://youtube.com/playlist?list=PL3xCBlatwrsXCGW4SfEoLzKiMSUCE7S_X&si=2vIw6R0JpZjBt8pR) 31 | - [From Scratch CUDA](https://youtube.com/playlist?list=PLxNPSjHT5qvvwoy6KXzUbLaF5A8NdJvuo&si=rvc52nc-VAPVwhNh) 32 | - [CUDA Programming](https://www.youtube.com/watch?v=xwbD6fL5qC8) 33 | - [CUDA MODE Lectures](https://www.youtube.com/@CUDAMODE/videos) 34 | 35 | ### Books 36 | - [RDNA3 Instruction Set Architecture Reference Guide](https://www.amd.com/content/dam/amd/en/documents/radeon-tech-docs/instruction-set-architectures/rdna3-shader-instruction-set-architecture-feb-2023_0.pdf) 37 | 38 | ### Community 39 | - [CUDA MODE Discord](https://discord.com/invite/cudamode) 40 | --------------------------------------------------------------------------------