├── 00_googleDrive_CUDAExam.ipynb ├── 01_PyCUDA_simple_example.ipynb ├── 01_cuda_lab ├── 01_simple.ipynb ├── 02_openmp.ipynb ├── 03_simple_avx.ipynb ├── 04_helloCUDA.ipynb ├── 05_vectorAdd.ipynb ├── 06_2DIndex.ipynb ├── 07_memoryType.ipynb ├── 08_DeviceQuery_Bandwidth.ipynb ├── 09_coalMemory.ipynb ├── README.md └── clock.cu ├── 02_cuda_lab ├── 00_UnifiedMemory_SharedMem.ipynb ├── 00_googleDrive_CUDAExam.ipynb ├── 01_Transpose.ipynb ├── 03_reduction.ipynb ├── 04_atomic.ipynb ├── README.md ├── atomicAdd.cu ├── clock.cu ├── gpu_timer.h └── reduction_all.cu ├── 03_cuda_lab ├── 01_matmul.cu ├── 02_matmul_tile.cu ├── 03_matmul_tile2_mem_coel.cu ├── 04_matmul_tile3_noBankConflict.cu ├── 05_matmul_tile4_unroll.cu ├── 07_async_streams.cu ├── 08_stream_n_event.ipynb ├── README.md └── clock.cu ├── 03_numba_vectorize.ipynb ├── PPTs ├── 001_Intro. Parallel Computing.pptx ├── README.md ├── Robot_02_CUDA I - Basic Programming.pdf ├── Robot_03_CUDA II - Optimization - Transpose.pdf ├── Robot_04_CUDA III - Optimization - Reductions.pdf ├── Robot_05_CUDA IV - Optimization - Mat-Mat Multiplication.pdf └── Robot_06_CUDA V - Synchronization Stream.pdf ├── README.md ├── colab_gdrive.ipynb └── hello_CUDA.ipynb /00_googleDrive_CUDAExam.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/00_googleDrive_CUDAExam.ipynb -------------------------------------------------------------------------------- /01_PyCUDA_simple_example.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/01_PyCUDA_simple_example.ipynb -------------------------------------------------------------------------------- /01_cuda_lab/01_simple.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/01_cuda_lab/01_simple.ipynb -------------------------------------------------------------------------------- /01_cuda_lab/02_openmp.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/01_cuda_lab/02_openmp.ipynb -------------------------------------------------------------------------------- /01_cuda_lab/03_simple_avx.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/01_cuda_lab/03_simple_avx.ipynb -------------------------------------------------------------------------------- /01_cuda_lab/04_helloCUDA.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/01_cuda_lab/04_helloCUDA.ipynb -------------------------------------------------------------------------------- /01_cuda_lab/05_vectorAdd.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/01_cuda_lab/05_vectorAdd.ipynb -------------------------------------------------------------------------------- /01_cuda_lab/06_2DIndex.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/01_cuda_lab/06_2DIndex.ipynb -------------------------------------------------------------------------------- /01_cuda_lab/07_memoryType.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/01_cuda_lab/07_memoryType.ipynb -------------------------------------------------------------------------------- /01_cuda_lab/08_DeviceQuery_Bandwidth.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/01_cuda_lab/08_DeviceQuery_Bandwidth.ipynb -------------------------------------------------------------------------------- /01_cuda_lab/09_coalMemory.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/01_cuda_lab/09_coalMemory.ipynb -------------------------------------------------------------------------------- /01_cuda_lab/README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/01_cuda_lab/README.md -------------------------------------------------------------------------------- /01_cuda_lab/clock.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/01_cuda_lab/clock.cu -------------------------------------------------------------------------------- /02_cuda_lab/00_UnifiedMemory_SharedMem.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/02_cuda_lab/00_UnifiedMemory_SharedMem.ipynb -------------------------------------------------------------------------------- /02_cuda_lab/00_googleDrive_CUDAExam.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/02_cuda_lab/00_googleDrive_CUDAExam.ipynb -------------------------------------------------------------------------------- /02_cuda_lab/01_Transpose.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/02_cuda_lab/01_Transpose.ipynb -------------------------------------------------------------------------------- /02_cuda_lab/03_reduction.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/02_cuda_lab/03_reduction.ipynb -------------------------------------------------------------------------------- /02_cuda_lab/04_atomic.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/02_cuda_lab/04_atomic.ipynb -------------------------------------------------------------------------------- /02_cuda_lab/README.md: -------------------------------------------------------------------------------- 1 | ## 02_cuda_lab 2 | 3 | -------------------------------------------------------------------------------- /02_cuda_lab/atomicAdd.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/02_cuda_lab/atomicAdd.cu -------------------------------------------------------------------------------- /02_cuda_lab/clock.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/02_cuda_lab/clock.cu -------------------------------------------------------------------------------- /02_cuda_lab/gpu_timer.h: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/02_cuda_lab/gpu_timer.h -------------------------------------------------------------------------------- /02_cuda_lab/reduction_all.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/02_cuda_lab/reduction_all.cu -------------------------------------------------------------------------------- /03_cuda_lab/01_matmul.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/03_cuda_lab/01_matmul.cu -------------------------------------------------------------------------------- /03_cuda_lab/02_matmul_tile.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/03_cuda_lab/02_matmul_tile.cu -------------------------------------------------------------------------------- /03_cuda_lab/03_matmul_tile2_mem_coel.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/03_cuda_lab/03_matmul_tile2_mem_coel.cu -------------------------------------------------------------------------------- /03_cuda_lab/04_matmul_tile3_noBankConflict.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/03_cuda_lab/04_matmul_tile3_noBankConflict.cu -------------------------------------------------------------------------------- /03_cuda_lab/05_matmul_tile4_unroll.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/03_cuda_lab/05_matmul_tile4_unroll.cu -------------------------------------------------------------------------------- /03_cuda_lab/07_async_streams.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/03_cuda_lab/07_async_streams.cu -------------------------------------------------------------------------------- /03_cuda_lab/08_stream_n_event.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/03_cuda_lab/08_stream_n_event.ipynb -------------------------------------------------------------------------------- /03_cuda_lab/README.md: -------------------------------------------------------------------------------- 1 | ## 03_cuda_lab 2 | 3 | -------------------------------------------------------------------------------- /03_cuda_lab/clock.cu: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/03_cuda_lab/clock.cu -------------------------------------------------------------------------------- /03_numba_vectorize.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/03_numba_vectorize.ipynb -------------------------------------------------------------------------------- /PPTs/001_Intro. Parallel Computing.pptx: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/PPTs/001_Intro. Parallel Computing.pptx -------------------------------------------------------------------------------- /PPTs/README.md: -------------------------------------------------------------------------------- 1 | ## Powerpoint course material 2 | ## 파워포인트 수업 자료 3 | -------------------------------------------------------------------------------- /PPTs/Robot_02_CUDA I - Basic Programming.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/PPTs/Robot_02_CUDA I - Basic Programming.pdf -------------------------------------------------------------------------------- /PPTs/Robot_03_CUDA II - Optimization - Transpose.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/PPTs/Robot_03_CUDA II - Optimization - Transpose.pdf -------------------------------------------------------------------------------- /PPTs/Robot_04_CUDA III - Optimization - Reductions.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/PPTs/Robot_04_CUDA III - Optimization - Reductions.pdf -------------------------------------------------------------------------------- /PPTs/Robot_05_CUDA IV - Optimization - Mat-Mat Multiplication.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/PPTs/Robot_05_CUDA IV - Optimization - Mat-Mat Multiplication.pdf -------------------------------------------------------------------------------- /PPTs/Robot_06_CUDA V - Synchronization Stream.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/PPTs/Robot_06_CUDA V - Synchronization Stream.pdf -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/README.md -------------------------------------------------------------------------------- /colab_gdrive.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/colab_gdrive.ipynb -------------------------------------------------------------------------------- /hello_CUDA.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/jeonggunlee/Parallel_Programming/HEAD/hello_CUDA.ipynb --------------------------------------------------------------------------------